Sehun Heo
Results
2
comments of
Sehun Heo
trafficstars
Is it clear?
@tyler-romero Thank you for quickly response. I simply used Liger through the `--use_liger_kernel=True` option in the Huggingface trainer. While it is true that Qwen-2.5 uses the same architecture as Qwen-2,...