jiaqiw09

Results 4 issues of jiaqiw09

## what this PR do issue: #1797 This PR add NPU support for huggingface.py. It just does some fix of existing code to support NPU device. ## what part to...

## Feature request It would be nice if the lm_eval could be used directly on the Ascend NPU without modifying the source code of `huggingface.py`. ## Motivation In China, Ascend...

# What does this PR do? Aggregated RuntimeArgs Container - Introduced RuntimeArgs dataclass that aggregates DataArguments, ModelArguments, TrainingArguments, and SampleArguments - Supports partial aggregation (e.g., only model+training for training scenarios)...

I trained model qwen2.5-7B in GPU with llamafactory deepspeed zero2 + autotp. And there is no obvious memory reduction. When individually using zero2, the average memory of 8 cards is...