tuanhe issues

Repositories
Issues
Comments

Results 1 issues of


                                            tuanhe

reproduce Llama2 7b failure : RuntimeError: The expanded size of the tensor (4608) must match the existing size (4096) at non-singleton dimension 3. Target sizes: [65, 32, 512, 4608]. Tensor sizes: [65, 1, 512, 4096]

I wanna reproduce the llama2 steps followed by the scripts/llama2_example.sh on RTX4090 I just run the commad `python -m awq.entry --model_path /data/models/Llama-2-7b-chat-hf --w_bit 4 --q_group_size 128 --run_awq --dump_awq awq_cache/Llama-2-7b-chat-hf-w4-g128.pt `...