Marut Pandya comments

Results 27 comments of


                                            Marut Pandya

Very slow cold starts even with flashboot

@avacaondata Sorry for this inconvenience. I can understand the frustration. We changed the flow a bit, for making it easy to update the vllm version but there's a model caching...

Add Runpod Provider

@ashwinb @yanxi0830 @hardikjshah when can I expect review? Thanks.

Add Runpod Provider

@ashwinb Sure. Thanks for the update. Looking forward to getting this merged soon.

Add Runpod Provider

@ashwinb let me update the implementation. Thanks

Add Runpod Provider + Distribution

@ashwinb Closed the stale PR and Created a new one. Hoping this to be reviewed soon.

Add Runpod Provider + Distribution

@ashwinb What does it require to have this PR reviewed ? Because, Stale PRs end up having merge conflicts and then it just becomes the another reason to not review/merge....

Add Runpod Provider + Distribution

@raghotham Really appreciate you reviews. Totally understand the priorities and timeline. Hoping to getting this merged soon.

There's no "seed" option in the text completions (not chat) which is very important

@michaelinva Do you mean in OpenAI completion API? We set it via ENV because it can be set during engine initialisation,

There's no "seed" option in the text completions (not chat) which is very important

- Make sense. Let me work on that. Thanks for the feedback. @michaelinva

MAJOR BUG: Beam search does not work according to README

@casper-hansen Do you encounter this issue when using OpoenAI compat API?