Results 36 comments of Harry Thomas
trafficstars

only approve the last workflow, please. i had commented some code in master that i shouldnt have when i renamed master to main. Rust 112 passes on branch: https://github.com/infosechoudini/libpnet/actions/runs/2208929740

resolved the conflicts

this can be merged now

I'm facing the same issue with both persistent and non-persistent deployments. It's not loading the model on the GPUs. I've tried deepspeed and zero2 and zero3. ``` model_id = "codellama/CodeLlama-7b-Instruct-hf"...

``` [2023-09-04 12:23:19,159] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) -------------------------------------------------- DeepSpeed C++/CUDA extension op report -------------------------------------------------- NOTE: Ops not installed will be just-in-time (JIT) compiled at runtime if...

Hey, Deepspeed works fine. I just finished training a model with deepspeed yesterday. I was messing around with it but couldnt find a solution. It just hangs on waiting for...