Arithmo issues

Results 5 Arithmo issues

Sort by recently updated

CUDA is out of memory

Hi, when I am trying to use your model for inference on my data, I get 'CUDA is out of memory' error. when i try to quantize the model using...

poojitharamachandra

training script?

would be good to release the training process as well :)

bugface

Ask for adding a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section

Hi, Could you please add a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section on the github webpage?MuggleMATH mainly investigates the scaling law and generalization for data...

ChengpengLi1003

Inquiry on Data Deduplication, Random Lower-Casing, and PoT Prompts Diversity

Hello, I truly admire your work on fine-tuning LLMs for mathematical reasoning and I have a few questions about the data preprocessing. I would appreciate some insights into the following...

lyf-00

how do you construct your test set (11k) ?

I appreciate your dataset for math reasoning, But can you provide me more details for how you construct your test data (11k size in listed in the huggingface)? https://huggingface.co/datasets/akjindal53244/Arithmo-Data

JianqiaoLu

Arithmo
Arithmo copied to clipboard

Metadata

CUDA is out of memory

training script?

Ask for adding a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section

Inquiry on Data Deduplication, Random Lower-Casing, and PoT Prompts Diversity

how do you construct your test set (11k) ?

← Metadata

Owner

Metadata

Arithmo Arithmo copied to clipboard

Metadata

CUDA is out of memory

training script?

Ask for adding a new baseline——MuggleMATH in the Comparing Arithmo-Mistral-7B with other LLM models section

Inquiry on Data Deduplication, Random Lower-Casing, and PoT Prompts Diversity

how do you construct your test set (11k) ?

← Metadata

Owner

Metadata

Arithmo
Arithmo copied to clipboard