Wenhua Cheng

Results 24 issues of Wenhua Cheng

File "/home/wenhuach/anaconda3/envs/autoround/lib/python3.10/site-packages/triton/runtime/jit.py", line 167, in return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) File "/home/wenhuach/auto-round/auto_round_extension/cuda/triton_utils/custom_autotune.py", line 131, in run key = tuple([2 ** int(math.log2(x) + 0.5) for x in key])...

please consolidate with activation quantization

enhancement

# What does this PR do? New Feature: Support AutoRound quantization algorithm which has shown benefit for finetuning https://substack.com/@bnjmnmarie/p-147609171 ## Before submitting - [x] Did you read the [contributor guideline](https://github.com/hiyouga/LLaMA-Factory/blob/main/.github/CONTRIBUTING.md)?...

### Reminder - [X] I have read the README and searched the existing issues. ### System Info None ### Reproduction None ### Expected behavior None ### Others Hi, Thank you...

pending