Mayank Mishra comments

Results 187 comments of


                                            Mayank Mishra

Possible memory leak when inferencing BLOOM 176B

Hmm, @sgugger can you tell which library are you guys using for the inference API, if not flask?

Possible memory leak when inferencing BLOOM 176B

Thanks, I can confirm that this issue is not occuring with Starlette and FastAPI (built on top of Starlette). Not sure why this happens with Flask. Closing this ❤️

Possible memory leak when inferencing BLOOM 176B

@muellerzr @sgugger Nevermind, this is still happening even with this minimal working example: As you can see I am not even storing any variable, only the model and tokenizer. This...

Possible memory leak when inferencing BLOOM 176B

If I call a torch.cuda.empty_cache() after this, then this happens:

Possible memory leak when inferencing BLOOM 176B

Could it be that this is expected behaviour? @ydshieh I am seeing a memory blowup with gpt2 also after replacing bigscience/bloom to gpt2 I am not sure if this is...

Possible memory leak when inferencing BLOOM 176B

With pdb, I am seeing a blowup too. But my guess would be this is not right way to measure memory since, I see something similar with GPT2 as well....

Possible memory leak when inferencing BLOOM 176B

> You may also be able to get a bit more by doing garbage collection as well, after deleting the model in python > > E.g.: > > ```python >...

Possible memory leak when inferencing BLOOM 176B

Yes, thanks I think Ill try to see the memory usage over time by running in a for loop or something. To see how this changes memory (both in server...

Possible memory leak when inferencing BLOOM 176B

This is not an issue anymore. Thanks for helping guys. Closing this :)

[new feature] train finishing ETA!

I want to work on this