Teknium comments

Results 81 comments of


                                            Teknium

Inquiry about the maximum number of tokens that Llama can handle

> @teknium1 how would one do so? I am a bit of a newbie here, is the process easy? Else I may find help elsewhere. That's why I asked because...

Inquiry about the maximum number of tokens that Llama can handle

I'm hearing you can just raise the max sequence length and fine tune it on longer prompts

Inquiry about the maximum number of tokens that Llama can handle

> Is there any parameter that needs to be optimized for the maximum length? It should just be that the training data has not seen a longer one, so the...

Windows 64-bit, Microsoft Visual Studio - it works like a charm after those fixes!

Any chance we could publish binaries for windows?

Fix few issues with the dataset

Would the dataset benefit from multiple prompt:response chains rather than just single prompt>response? i.e. Question:Answer:FollowupQ:FollowupA

Fix few issues with the dataset

for prompts it seems a good idea to keep typos

Solve BUG:AttributeError: module transformers has no attribute LLaMATokenizer

I see the same but fixing the capitalization didnt fix for me ![image](https://user-images.githubusercontent.com/127238744/225819984-f3a3f682-3370-4aba-b224-a383f67211f8.png)

Solve BUG:AttributeError: module transformers has no attribute LLaMATokenizer

Am using transformers 4.27.1, is it a different version?

Solve BUG:AttributeError: module transformers has no attribute LLaMATokenizer

Yeah you have to install from Transformers github. I had thought since it was merged it was in an updated pip package but its not yet. `pip install git+https://github.com/huggingface/transformers.git` works...

Bad dataset

> Maybe tangentially related, but @tloen curious why you might want to leave typos in the dataset (per [#32 (comment)](https://github.com/tloen/alpaca-lora/pull/32#issuecomment-1474454667)) Not my place to respond, but I would say leaving...