Serhii Korol comments

Results 83 comments of


                                            Serhii Korol

Downloading get stuck in infinite loop

The problem on Mac is with the underlying download script (associative arrays and nested loop). It's Linux-oriented and should be adopted to MacOS. TBH, I quit trying to fix different...

Downloading get stuck in infinite loop

Because it's not the root cause. It never enters this [loop](https://github.com/juncongmoo/pyllama/blob/main/llama/download_community.sh#L134-L139).

Readme Should Have Inference Command to use for Quantization in Text

```shell_script python3 quant_infer.py --wbits 4 --load pyllama-7B4b.pt --text "The meaning of life is" --max_length 24 --cuda cuda:0 ```

Meaningless Prediction in 13B 2bit

Several people complaining on the garbage in the output here #58.

quantify llama 7B, the md5 value and the model size does not equals to the value in README

Noticed the same on 4 bits model. Just a garbage in the output. Now I'm trying to quantize from the downloaded files. Will post the result here later.

quantify llama 7B, the md5 value and the model size does not equals to the value in README

BTW, found an interesting observation here #58: `--groupsize 128` affect the results somehow. Need to try to quantize w/o this flag.

quantify llama 7B, the md5 value and the model size does not equals to the value in README

Yeah, seems like it works w/o `groupsize`.

Driver is not currently compatible with ROS2 SLAM Toolbox -- 70% of scans are discarded

@DrewSBAI this number would be different even for a single device if you re-run it 10 times. For me, slam-toolbox mostly every single run prints a new number in ~447-453...

Jetbrains plugins: SQLITE_CONSTRAINT: UNIQUE constraint failed: cache.key

Any updates? JB plugin still doesn't work: ![Screenshot from 2024-08-03 14-17-47](https://github.com/user-attachments/assets/9c2f4353-a6a2-4d71-a5f5-45cd4ca9494e)

Jetbrains plugins: SQLITE_CONSTRAINT: UNIQUE constraint failed: cache.key

Nevermind, it's fixed in the dev branch. Just build it and install from .zip.