Jae-Won Chung comments

Results 56 comments of


                                            Jae-Won Chung

About the dataset preprocessing part. I think the index of items and users should start at 1 not 0

I am very sorry for the delayed response. I'm recently getting little free time to actively maintain this repository. Similar issues have arisen quite frequently, and a PR is welcome....

[BUG] pip install .

It looks like the cause is https://github.com/pypa/setuptools_scm/issues/457. Reproduction steps: 1. `docker run -it --gpus all deepspeed/deepspeed:latest_torch111 bash` - Probably doesn't exactly have to be `latest_torch111`. 1. `git clone --depth=1 https://github.com/microsoft/deepspeech.git`...

Integrating with PyTorch Lightning

Nah, I just reverted to an older version of Deepspeech2 that didn't use PyTorch Lightning and integrated adaptdl there.

Power management server

> There are two potential approaches to address this issue, although additional options may also exist: > > * Making a change at the NVML library’s side to reduce the...

[BUG] `TrainSchedule` seems to use one more buffer than what's needed

Hi @tohtana :) Maybe you meant this, but I think what's happening is `RecvActivation(buffer_id=0)` for writing to `self.pipe_buffers['input'][0]`, thereby removing the tensors that hold the gradients (Jacobian-vector products) produced by...

[BUG] `TrainSchedule` seems to use one more buffer than what's needed

EDIT: Wrong Manually fixing three lines would look like: ```diff >>> pprint(list(FixBufferTrainSchedule(8, 4, 2)), width=120) [[-1], [-2], [0, RecvActivation(buffer_id=0), ForwardPass(buffer_id=0)], [-1, SendActivation(buffer_id=0)], [1, RecvActivation(buffer_id=1), ForwardPass(buffer_id=1)], [0, SendActivation(buffer_id=1), RecvGrad(buffer_id=0), BackwardPass(buffer_id=0)], -...