Nicolas Patry comments

Results 978 comments of


                                            Nicolas Patry

Matrix multiplication support

@Wumpf Maybe ? After having matured a bit this and implemented the `Matrix` in https://github.com/huggingface/candle/ I feel like this PR should be trimmed down to only get the `MPSMatrix` support...

[WHISPER] Unreliable timestamp with whisper for videos under 30 seconds

@altryne @ArthurZucker . While deep diving into whisper, I've notived `openai/whisper` uses timestamp ALL the time, while `transformers` doesn't (you have to ask for timestamps for us to use them)....

Safe Tensor converting fails for LLaMa 13B and 30B

Hmm the `check_file_size` is pretty rough sanitation, the file might actually be OK but it's hard to tell without looking at the file. You can try deactivating the check ?...

Safe Tensor converting fails for LLaMa 13B and 30B

Can you point to the actual model on the hub you're using too ? Because we don't have any issue with "official" checkpoints

Safe Tensor converting fails for LLaMa 13B and 30B

> The funny thing is the code works well with 7B and 65B models but fails for 13B and 30B I converted 30B like 3 times today for quantization purposes...

Safe Tensor converting fails for LLaMa 13B and 30B

> Throws a InvalidHeaderDeserialization. Loading the same files using LlamaForCausalLM works fine in a notebook. You're trying to load `pickle` files (which would work for `LLamaForCausalLM`) it seems

Safe Tensor converting fails for LLaMa 13B and 30B

> - /usr/src/llama-30b-supercot/pytorch_model-00001-of-00243.bin: 537 > - /usr/src/llama-30b-supercot/model-00001-of-00243.safetensors: 48 This is probably an empty file or containing a super small tensor...

Safe Tensor converting fails for LLaMa 13B and 30B

It's empty actually...

Safe Tensor converting fails for LLaMa 13B and 30B

No no, I inspected the first file, it's purely empty. My paranoid flag almost expected a payload here, but no it's just empty.

Adding Llava-Next (Llava 1.6) with full support.

> Awesome! Documenting that this PR should fix #1689 - right? Indeed it would ! Not super generally quite yet (we need to transfer part of the token counting logic...