Scott Hoang
Scott Hoang
hi zee-fee, you can download the fix here https://github.com/jwyang/faster-rcnn.pytorch/pull/894
@tjiagoM yes I did. https://github.com/ultralytics/yolov3. I achieved much much better results using their model.
@Khalifa1997 maybe I was a bit in haste. One hundred images are very small sample population but can be artificially multiplied with proper data-augmentation. It depends on how many cls...
Are you using pytorch distributed package? if so, did you correctly set your default Cuda location for your local process rank? if not, this happens.
the self.cached prevented the model from recomputing the normalized graph over and over and just memorizing it since the graph is the same in inductive. Without cached, each run might...
Have you tried reducing the expansion_ratio?
This PR attempts to resolve the issue derived from training with packed data?
so in a no_sync context, gradients are accumulated in FP32?
Extending on this (and maybe unrelated to the overall topic) In the current implementation of FSDP1, we are sharding parameters across nodes in a multi-node scenario (zero3 implementation). Is there...