Vaibhav Adlakha comments

Results 53 comments of


                                            Vaibhav Adlakha

Effect of batch size on run time

@a-green-hand-jack Can you try to run [Instructor XL](https://huggingface.co/hkunlp/instructor-xl). It is a sentence-transformers model similar in size to Sheared Llama so it will be a more suitable comparison.

Effect of batch size on run time

Closing as it is stale. @a-green-hand-jack - Feel free to re-open if you still need help on this.

Create llm2vec based on the new llama-3-8b

Llama-3 has been added to the [model list](https://github.com/McGill-NLP/llm2vec?tab=readme-ov-file#model-list)

Create llm2vec based on the new llama-3-8b

Fixed, thanks!

Create llm2vec based on the new llama-3-8b

We did. [Here](https://x.com/vaibhav_adlakha/status/1785406274273751315?s=46&t=U0ISFfo1VSM4DDGABmLQjg) is a twitter/X thread summarizing our findings. All MTEB scores are also present in our Huggingface repo of these models. The results should be updated on the...

Create llm2vec based on the new llama-3-8b

> @vaibhavad I have prepared an llm2vec version for Qwen2 but there are some problems, I wonder if you can give me the process to prepare code for a new...

Create llm2vec based on the new llama-3-8b

@Iambestfeed It looks like your implementation is similar to Llama, however, looking to [modeling_qwen2.py](https://github.com/huggingface/transformers/blob/v4.40.1/src/transformers/models/qwen2/modeling_qwen2.py#L912) at transformers library, it seems the implementation is similar to Mistral. Different models differ on how...

Vaibhav Adlakha

Effect of batch size on run time

Effect of batch size on run time

Create llm2vec based on the new llama-3-8b

Create llm2vec based on the new llama-3-8b

Create llm2vec based on the new llama-3-8b

Create llm2vec based on the new llama-3-8b

Create llm2vec based on the new llama-3-8b

Create llm2vec based on the new llama-3-8b

How to run larger models like Llama-2-13B or 70B

How to run larger models like Llama-2-13B or 70B