Arithmo
Arithmo copied to clipboard
Small and Efficient Mathematical Reasoning LLMs
Hi, when I am trying to use your model for inference on my data, I get 'CUDA is out of memory' error. when i try to quantize the model using...
would be good to release the training process as well :)
Hi, Could you please add a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section on the github webpage?MuggleMATH mainly investigates the scaling law and generalization for data...
Hello, I truly admire your work on fine-tuning LLMs for mathematical reasoning and I have a few questions about the data preprocessing. I would appreciate some insights into the following...
I appreciate your dataset for math reasoning, But can you provide me more details for how you construct your test data (11k size in listed in the huggingface)? https://huggingface.co/datasets/akjindal53244/Arithmo-Data