Ilya Markov

Results 5 comments of Ilya Markov
trafficstars

@ymjiang, That image is built with an error. There was a conflict between torchvision and pillow. So when I run my script it fails with an error: `ImportError: cannot import...

@bobzhuyb I tried the proposed solutions with moving tensors to gpu on K80 GPUs and V100 gpus. The result is the same: ``` F byteps/common/nccl_manager.cc:37] Check failed: e == cudaSuccess...

I am talking about the first scenario, the one without parameter server. The behaviour in second scenario is a bit different. I checked horovod NCCL training in docker container. It...

`benchmark_byteps.py` and `benchmark_cross_barrier_byteps.py` work fine in both scenarios. Also I found out that in order to launch two consecutive benchmarks in second scenario, I need to restart server and scheduler....

@ymjiang Sorry for the late response. I tried it again - mnist example still crashes in 2 worker 1 server mode. And it outputs meaningless accuracy in any kind of...