TransUNet
TransUNet copied to clipboard
Scaling issue
The model does not scale on multiple GPU units on a single CPU. It create multiple copies of model (when using distributed data parallel) and add all the load to cuda(0), instead of evenly distributing load to all cuda devices. This results in out of memory issues. Appreciate your help on this.