christopher-motional
christopher-motional
Hi @mh0797, apologies for the delayed response, and thank you for the investigation. We are looking into this and making sure empty agent features are being handled correctly.
Hi @zhangdongkun98, yes, sorry for the delay --thanks for the catch, those do indeed sound like bugs, will confirm and aim to get a fix up shortly.
Hi @Fan-Yixuan, Sorry for the delay. Regarding your question about parameter scaling, the effective batch size is actually increased when using ddp as the same specified batch size is used...
Regarding your follow-up questions, 1. You can see some of the discussion @patk-motional linked to for a little more information, but a given lidar_pc can potentially be tagged with multiple...
Is this with distributed caching? If so could you compare if you use single vs multi-node?