koanho

Results 6 comments of koanho

Thank you for your kind reply :)

@sphish Hi, I got a similar issue. When testing ```./shmem_put_bw```, got an error below. ``` Runtime options after parsing command line arguments min_size: 4, max_size: 4194304, step_factor: 2, iterations: 10,...

Thank you for reply @sphish. I think nvidia-peermem is correctly installed and loaded. ``` Singularity> modinfo nvidia-peermem filename: /lib/modules/5.14.0-284.11.1.el9_2.x86_64/extra/nvidia-peermem.ko version: 550.54.15 license: Dual BSD/GPL description: NVIDIA GPU memory plug-in author:...

Thank you @sphish. I couldn't modify the driver configuration because I don't have root permissions on my training cluster 😞 It seems the error may have occurred because IBGDA is...

> I've found a "good old version" that works with "IBGDA disabled" machines, which is [a84a248](https://github.com/deepseek-ai/DeepEP/commit/a84a24808fb0ea732f49b874cc456a69dde69076) @vinjn I've tested commit [a84a248](https://github.com/deepseek-ai/DeepEP/commit/a84a24808fb0ea732f49b874cc456a69dde69076), but encountered a runtime error during DeepEP setup. It...