sandeep-krutrim issues

Results 9 issues of


                                            sandeep-krutrim

Cannot train new tokens while finetuning

I am trying to add new tokens to the tokenizer and want to train them during finetuning. For that model embedding size has to be increased. For standard bert architectures...

Details of finetuning

Can you share details on fine-tuning for IndicGlue tasks like hyperparameters , full fine tuning or freezing the bert layers etc. Will be helpful in comparing fairly.

Unable to reproduce Retrieval Accuracy as given in the paper for Flores

I am trying to reproduce the retrieval task results for Flores dataset of IndicXtreme. I am using. - fine-tuning/retrieval/retrieval.py code. However, the accuracy returned by the script is nowhere closer...

Clarification on Retrieval Accuracy for XLMR for Table 23

Why the numbers for XLMR are so weird in this table -

getting error when trying to do quantization using bitsandbytes

I am trying to speed up inference using quantized version of the llm2vec models. I have trained a gemma-2B-model on custom data. This is my inference code - ``` import...

Unable to load merged model for MTEB evaluation

I have trained a model using supervised contrastive. I saved the model using - `l2v.save('/llm2vec_models/final_merged_model', merge_before_save=True, save_config=True)` Now when I try to run mteb_eval.py using - `!python experiments/mteb_eval.py --model_name model_name...

sandeep-krutrim

Cannot train new tokens while finetuning

Details of finetuning

Unable to reproduce Retrieval Accuracy as given in the paper for Flores

Clarification on Retrieval Accuracy for XLMR for Table 23

getting error when trying to do quantization using bitsandbytes

Unable to load merged model for MTEB evaluation

Finetuning code for sequence classification, NLI task ?

LoRA fine tuning code ?

Issue with mntp training for Llama 3.2 model