Question/Issue:
Hi, I am using a Nicla Vision for Object Detection. I have 5000 images from the Coco dataset (for testing, will be 75000) which encompasses 5 classes and I am trying to train the model with a GPU but it returns this error:
Node: ‘model/block_2_expand/Conv2D’
OOM when allocating tensor with shape[128,48,160,160] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[{{node model/block_2_expand/Conv2D}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info. This isn’t available when running in Eager mode.
[Op:__inference_train_function_502751]
Attached to job 19331449…
Application exited with code 1
Job failed (see above)
The model was generated successfully on CPU but took a very long time. When training with GPU, is the training occurring locally or via Edge Impulse’s servers?
Either way, any advice on training the models with GPU would be greatly appreciated.
Project ID:
Context/Use case:
Object Detection. The objective is to compare a model trained on large datasets (Coco) vs trained on smaller, custom datasets.