tuanhe
Results
1
issues of
tuanhe
I wanna reproduce the llama2 steps followed by the scripts/llama2_example.sh on RTX4090 I just run the commad `python -m awq.entry --model_path /data/models/Llama-2-7b-chat-hf --w_bit 4 --q_group_size 128 --run_awq --dump_awq awq_cache/Llama-2-7b-chat-hf-w4-g128.pt `...