xpu timer python package
I am attempting to learn and utilize the xpu timer as described in the following article:
故障排查难?xpu_timer 让大模型训练无死角! https://mp.weixin.qq.com/s/OYkv4gXh_l_HpHXHqK6Ijw
This article references a Python package shown in the image below, but I could not find any information about this package. Is it open-sourced? If so, how can I install it?
We will release xpu_timer in this month.
Any updates on this issue?
@cos120 already release?
@cos120 already release?
@aqwertaqwert @dafu-wu @issaccv please refer: https://github.com/intelligent-machine-learning/dlrover/discussions/1350