Eljan Mahammadli

Results 5 comments of Eljan Mahammadli

@rasbt can you please also guide on the #1095 as well. Essentially, it is similar problem but your approach would not work as I have different config such as `n_layer`,...

@awaelchli I am using the "Pretrain LLMs - TinyLlama 1.1B" template from the studio. Below is the code with minimal changes. I have changed the column name. And my data...

I am in the "Pretrain LLMs - TinyLlama 1.1B" on Lightning AI Studios. First we get custom tokenizer: ```bash python lit-gpt/scripts/download.py \ --repo_id eljanmahammadli/simhash_dedup_tokenizer_1_5M \ --access_token HF_TOKEN_HERE \ --tokenizer_only true...

I don't see any option to specify the username for sharing. Could you please elaborate?

Hey @tchaton. On this quote "would fail at some point when reaching the outlier samples". Until you spot any bugs, am I good to go ahead and train the model...