Boris Fomitchev
Boris Fomitchev
bfloat16 support is sorely missed in cupy. Would really appreciate it getting fixed! We are currently forced to work around it like this (thankfully we have torch.view): ``` x =...
Can you please post exact command lines you have used? I keep getting OOM errors on startup if I preallocate 0.95 .. 0.5. If I do not preallocate, the errors...
@cshanbo: I have recompiled and reinstalled - still same issue. Can in be nccl 2.0 incompatibility ? How exactly did you run the benchmark ?
@cshanbo : What exactly should be added to PATH/LD_LIBRARY_PATH? What happens is this call returns garbage: self._local_id = gpucoll.GpuCommCliqueId(context=self.gpuctx) And then utf8 decode fails on it: response = self.send_req("platoon-get_platoon_info", info={'device':...
NCCL is in standard system path (installed via .deb), tried adding it to LD_LIBRARY_PATH - no effect. GpuCommCliqueId would have failed in it was not found, right? Instead it returns...