Awni Hannun comments

Results 1014 comments of


                                            Awni Hannun

Command-R-Plus, Context Window Limitations

That is very odd. The tokenizer copying is very simple in MLX LM. We basically load with Hugging Face and then save it with Hugging Face. There is no MLX...

Command-R-Plus, Context Window Limitations

@fblissjr you can reproduce the behavior with: ```python from transformers import AutoTokenizer tokenizer = AutoTokenizer.from_pretrained("CohereForAI/c4ai-command-r-plus") tokenizer.save_pretrained(".") ``` I feel that should not break the tokenizer.. so it might be worth...

libc++abi: terminating due to uncaught exception of type std::runtime_error

Curious.. what machine are you on? What OS?

libc++abi: terminating due to uncaught exception of type std::runtime_error

The command runs fine for me with our default dataset: ``` python -m mlx_lm.lora \ --model google/gemma-2b-it \ --train \ --data ../lora/data \ --iters 600 --adapter-path . ```

omp_set_nested routine deprecated, please use omp_set_max_active_levels instead

Hmm, that's annoying. It's definitely not from MLX, we don't use open mp. If I had to guess, probably from `numba`.

libc++abi: terminating due to uncaught exception of type std::runtime_error

I'm not sure.. I never saw that error before. It seems related to too much resource use (e.g. OOM). Does it run if you use a smaller batch size? `--batch-size=1`?

libc++abi: terminating due to uncaught exception of type std::runtime_error

I'm running the command you shared.

libc++abi: terminating due to uncaught exception of type std::runtime_error

@danny-su what version of MLX / MLX LM are you using: `python -c "import mlx.core as mx; print(mx.__version__)"` If it's not the latest, please update and try again. So far...

libc++abi: terminating due to uncaught exception of type std::runtime_error

> I can use the same size data to fine-tune Mistral-7B-Instruct-v0.2 Quantized or fp16?

libc++abi: terminating due to uncaught exception of type std::runtime_error

Wow that is a really long sequence length: `102400`. I can't imagine you have enough memory on your machine for that long of a sequence length. Just the attentions scores...