Vaibhav Adlakha comments

Results 53 comments of


                                            Vaibhav Adlakha

Evaluating MNTP task only

> In my experiments, the results are _attn_implementation dependent... I am not sure what you mean by this. Can you elaborate? We do not evaluate MNTP task separately as it...

Evaluating MNTP task only

Feel free to re-open if you have any more questions about this issue.

Can I use LORA to fine tune text embedding?

Yes, definitely. Our [scripts](https://github.com/McGill-NLP/llm2vec/tree/main/experiments) provide an example of how to use LORA finetuning for masked next token prediction (MNTP) and supervised contrastive learning. You can similarly use LORA finetuning for...

Can I use LORA to fine tune text embedding?

It is generally recommended to keep the instructions in similar style as used in training. You can check Table 10 in our [paper](https://arxiv.org/pdf/2404.05961) to see the instructions we have used...

Can I use LORA to fine tune text embedding?

Feel free to re-open if you have any more questions regarding this issue.

"ModuleNotFoundError: No module named 'transformers.pytorch_utils' when using llm2vec with transformers 4.40.1"

Which version of llm2vec are you using? The latest version of llm2vec is compatible with transformers 4.40.1. Can you share the exact command that resulted in the error?

"ModuleNotFoundError: No module named 'transformers.pytorch_utils' when using llm2vec with transformers 4.40.1"

Closing as it is stale. Feel free to re-open if you have any more questions regarding this issue.

Effect of batch size on run time

I changed the code slightly on my end as I did not have access to `data2vec`. ```python import time import matplotlib.pyplot as plt import random import numpy as np import...

Effect of batch size on run time

For CPU, I got this plot ![batch_size_vs_encoding_time](https://github.com/McGill-NLP/llm2vec/assets/32997732/f59bc155-7bcf-4fb2-b4a9-acf6024e068c)

Effect of batch size on run time

Can you try encoding with different batch sizes using any [sentence-transformer models](https://huggingface.co/models?library=sentence-transformers&sort=trending)? Similar to LLM2Vec, you have to call `encode` function with list of sentences. This will help us determine...