dtamayo
dtamayo
The current implementation only allows to set `intermediate_size` for Llama models, but I would like to be capable to change the `intermediate_size` in GPT-NeoX models. I have tested this implementation...
Hi, Thanks for your excellent work in developing this forward. I am interested in adding the task explained in [Yarn](https://arxiv.org/pdf/2309.00071) related to computing the perplexity for long contexts. However, I...
**Your question** When we want to make a training in LLMs with a lot of corpora, I understand that the usual approach is to introduce the documents with the following...
### Confirm that this is a metadata correction - [X] I want to file corrections to make the metadata match the PDF file hosted on the ACL Anthology. ### Anthology...