Justin

Results 2 issues of Justin

Adds broadcasting support for gather by adding dimensions (unsqueezing through _force_order using an overlapping order) and expanding.

Thanks for the code release! Heads up for other users who want to resume training from a checkpoint: you will want to 1. de-indent DDP_main.py:80 so that all devices can...