oslo
oslo copied to clipboard
Fix TP embedding layers
Describe a TODO feature
- Force tp_wrapper do not parallelize emb-layer if model has not embedding layer. (for vision model competible) https://discord.com/channels/729741769192767510/1012603449910759504/1083785802930192434
Assignees
- @jason9693