juanps90 comments

Results 12 comments of


                                            juanps90

ChromaDB error when using HuggingFace Embeddings

Worked beautifully.

LlaMa

> FYI: I just submitted this pull request to integrate llama.cpp into langchain: #2242 Thank you very much!! Do you think it would be possible to run LLaMA on GPU...

How to Fine-tune the Model?

I'm interested in finetuning as well. Does anyone have any recommendation for this?

Training Script stablity 3B and 7B

Are you able to train 7B using dual RTX3090's? Do you think you could setup a notebook on Colab? Thank you!!!!!!

CodeLLaMA + LoRA: RuntimeError: CUDA error: an illegal memory access was encountered

This appears to be related to CodeLLaMA34B as its 13B variant works with LoRA and about 13K context (haven't tried more).

Increased context length with NTK Rope Scaling

I am using Neko-Institute-of-Science_LLaMA-30B-4bit-128g with no context scaling training at all. As I understand, NTK RoPE Scaling does not require any finetuning at all, unlike SuperHOT. Am I setting the...

Increased context length with NTK Rope Scaling

> I think you need to call `config.calculate_rotary_embedding_base()` with the current way RoPE NTK scaling is implemented for the settings to properly take effect. Make sure `config.alpha_value` is already set...

Increased context length with NTK Rope Scaling

I'm having a weird issue where it just skips or adds digits to numbers. For example, if there's a phone number in the prompt, the generated text may add another...

Increased context length with NTK Rope Scaling

> I've seen that effect while running a linear-scaled LoRA (SuperHOT or Airoboros 8k or 16k) with the wrong compress_pos_emb value. If it's set to anything other than what it...

Increased context length with NTK Rope Scaling

Well, LLaMA v2 13B GPTQ from The-Bloke goes NUTS after I do: ``` config = ExLlamaConfig(model_config_path) # create config from config.json config.model_path = model_path # supply path to model weights...