Arithmo icon indicating copy to clipboard operation
Arithmo copied to clipboard

Small and Efficient Mathematical Reasoning LLMs

Results 5 Arithmo issues
Sort by recently updated
recently updated
newest added

Hi, when I am trying to use your model for inference on my data, I get 'CUDA is out of memory' error. when i try to quantize the model using...

would be good to release the training process as well :)

Hi, Could you please add a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section on the github webpage?MuggleMATH mainly investigates the scaling law and generalization for data...

Hello, I truly admire your work on fine-tuning LLMs for mathematical reasoning and I have a few questions about the data preprocessing. I would appreciate some insights into the following...

I appreciate your dataset for math reasoning, But can you provide me more details for how you construct your test data (11k size in listed in the huggingface)? https://huggingface.co/datasets/akjindal53244/Arithmo-Data