Peter Devine

Results 18 comments of Peter Devine

But if I want to make a serverless endpoint (as described here - https://aws.amazon.com/about-aws/whats-new/2021/12/amazon-sagemaker-serverless-inference/), then I cannot supply an instance type, as this option explicitly has no defined instance. In...

Bump. I'd love this feature so that I can run vLLM with my quantized model. Thanks.

@fmmoret Any way I can help? This would be an awesome feature so I'd hate to see its inclusion falter.

Bump. It would be really handy to be able to evaluate continuously on a specified dataset different from the training dataset so that we could control early stopping etc. based...

PS, being able to change the ratio (or number of examples) of the test set [when it is automatically generated](https://github.com/artidoro/qlora/blob/7f4e95a68dc076bea9b3a413d2b512eca6d004e5/qlora.py#L547) (i.e., making it so that instead of a setting test_size...

I would like to second this request. As I understand it, this is simply free increased efficiency for training with no degradation on accuracy, right? I think this would be...

Legend. Superman has posters of you on his wall. Thanks so much for all of your work!