gritlm icon indicating copy to clipboard operation
gritlm copied to clipboard

BUG: NotImplementedError

Open oYoungCo opened this issue 1 year ago • 3 comments
trafficstars

When I run the script of Training.embedding_model, a bug is occurred. How can I fix it? File /gritlm/training/run.py", line 166, in main else: raise NotImplementedError NotImplementedError

oYoungCo avatar Feb 29 '24 07:02 oYoungCo

if you check line 166 you see that it's cuz your training data is not in the right format, what data are you using & whats ur script?

Muennighoff avatar Feb 29 '24 07:02 Muennighoff

if you check line 166 you see that it's cuz your training data is not in the right format, what data are you using & whats ur script? I run this script, and data is toy_data provided in code. torchrun --nproc_per_node 1
-m training.run
--output_dir test_path
--model_name_or_path openaccess-ai-collective/tiny-mistral
--train_data training/toy_data
--learning_rate 1e-5
--num_train_epochs 5
--per_device_train_batch_size 2
--dataloader_drop_last True
--normalized True
--temperature 0.02
--query_max_len 32
--passage_max_len 128
--train_group_size 2
--negatives_cross_device
--mode unified

oYoungCo avatar Feb 29 '24 09:02 oYoungCo

It should have been fixed now (by just removing the raise)! Thanks for raising this!

Muennighoff avatar Feb 29 '24 13:02 Muennighoff