Mayank Mishra comments

Results 187 comments of


                                            Mayank Mishra

How to save / form the config.json after fine-tuning - Flan T5 11b

i only see 4 processes in the yaml ^^ you can always enable cpu offloading

[BUG] DeepSpeed zero_to_fp32.py script ignores some layers while creating FP32 checkpoints from DS ZeRO checkpoints.

I think this issue needs re-visiting @tjruwase . This is very much needed for a lot of transformer models

[BUG] DeepSpeed zero_to_fp32.py script ignores some layers while creating FP32 checkpoints from DS ZeRO checkpoints.

@TingchenFu The size mismatch looks a bit weird to me. I have not seen this before. The following is how I load it, its a bit unclean but it works...

"bloom-ds-zero-inference.py" works but "inference_server.cli --deployment_framework ds_zero" fails

this is weird. Ill look into this one

Big batchsize cause OOM in bloom-ds-inference.py, how to adjust max_split_size_mb value

`max_split_size_mb` won't work with deepspeed inference I think. This is only for pure pytorch native code.

Batch Decoding in GPT2 with variable length sequences

@younesbelkada related issue that we had closed before: https://github.com/huggingface/transformers/issues/18809

stuck when inferring

I don't think thats the case. I will try to run this on my end :0

Where can I find MCR-DL?

Hey, no specific reasons. Its mostly to ding into the code and the optimizations done by the DeepSpeed team. Is it not openly available?

[BUG] DS-inference possible memory duplication

Init inference is fine, its in forward @mrwyattii

[BUG] DS-inference possible memory duplication

@RezaYazdaniAminabadi @mrwyattii @jeffra https://github.com/bigcode-project/bigcode-inference-benchmark You can run ```shell sh scripts/run_batch_size.sh ds-inference-1b-bloom-fp16 ``` This will run BLOOM 1.3B (randomly initialized) using DS-inference in fp16 in batch sizes 1 to 128 (doubled...