Mayank Mishra comments

Results 187 comments of


                                            Mayank Mishra

Make sure deepspeed powered models are equivalent with their non deepspeed version

Hi @philippmtk unfortunately they don't lead to indentical outputs. Unfortunately, resharding checkpoints changes the order of operations and this is a problem with floating point arithmetic. fp operations are not...

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

Seems like there is a check in place which is not letting the new weights work with MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

Any updates on this? @jeffra @RezaYazdaniAminabadi

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

https://github.com/huggingface/transformers-bloom-inference/blob/abe365066fec6e03ce0ea2cc8136f2da1254e2ea/bloom-inference-server/ds_inference/grpc_server.py#L33 @cderinbogaz I hacked my way around it for now I pass the downloaded model path and checkpoint dict for the model I need to use and the model="bigscience/bloom" I...

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

@mrwyattii I believe your commit yesterday has fixed this? Let me know. I am closely watching this repo :)

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

Hi @TahaBinhuraib I think MII doesn't support int8 models. Can you try vanilla DS-inference? https://github.com/huggingface/transformers-bloom-inference/tree/main/bloom-inference-server you can try running via a CLI/ deploy a generation server as given in the...

Mayank Mishra

Make sure deepspeed powered models are equivalent with their non deepspeed version

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

New microsoft/bloom-deepspeed-inference-fp16 weights not working with DeepSpeed MII

Universal checkpoint for zero stage 1

Increasing the token-length based on available memory for GPT models

Increasing the token-length based on available memory for GPT models