kunfupanda-hw
kunfupanda-hw
> I have created a new branch that updates NVSHMEM to version 3.2.5. However, I don't have a RoCE environment for verification. Can you test this out? [@Baibaifan](https://github.com/Baibaifan) Branch: https://github.com/deepseek-ai/DeepEP/tree/roce-support...
> > > I have created a new branch that updates NVSHMEM to version 3.2.5. However, I don't have a RoCE environment for verification. Can you test this out? [@Baibaifan](https://github.com/Baibaifan)...
> Hi [@sphish](https://github.com/sphish), The process works, but the performance does not seem to meet expectations. > > env: > > 1. H100 80GB HBM3 *8/HPC > 2. 4 * CX7...
> > By following the logs, we found the calltrace was `test_loop() --> buffer = deep_ep.Buffer() --> __init__ --> self.runtime.sync(device_ids, ipc_handles, root_unique_id)` in DeepEP and then `nvshmemt_ibrc_connect_endpoints --> nvshmemt_ibrc_ep_connect -->...
> > perftest > > Could you please send me the command to run `perftest` for reference? [@kunfupanda-hw](https://github.com/kunfupanda-hw) server: ib_write_bw -d your_card_name -a --report_gbits -n 1000 -q 16 --CPU-freq -p...