NanoCode012

Results 46 issues of NanoCode012

# Description Closes https://github.com/axolotl-ai-cloud/axolotl/issues/2658 ## Motivation and Context ## How has this been tested? Untested! ## Screenshots (if appropriate) ## Types of changes ## Social Handles (Optional)

# Description - Updated grpo trl docs in config qmd ~- Added `log_sweep` as possible scheduler in schema (it was supported long ago, but not allowed in schema)~ - Clarified...

documentation

# Description RL trainer was not loading some basic configs like `logging_steps` etc. This PR consolidates the setting of these params and cleans them up. This PR also fixes a...

# Description Still a WIP, but pushed to share the user journey and take feedbacks. This requires Attention Enum PR to be merged first. ## Motivation and Context Assist users...

### ⚠️ Please check that this feature request hasn't been suggested before. - [x] I searched previous [Ideas in Discussions](https://github.com/axolotl-ai-cloud/axolotl/discussions/categories/ideas) didn't find any similar feature requests. - [x] I searched...

enhancement

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/OpenAccess-AI-Collective/axolotl/discussions/categories/ideas) didn't find any similar feature requests. - [X] I searched...

enhancement

# Description VLLM should support torch2.5-2.7 equally. Prev, torch2.5 would install 0.7.4 which would break GRPO as seen in #2601 ## Motivation and Context ## How has this been tested?...

# Description Found out from a lot of tests that `lm_head` has some issues with linearized model. Can't even `.clone()` it. Chose to detach then reattach the gradient. The conversion...

hold

# Description Separated from #2504 Deepseekv3 not working, but added to be used as future ref code for any potential contributors ## Motivation and Context ## How has this been...

# Description Updated xformers llama attention patch for transformers v4.50.0 ## Motivation and Context ## How has this been tested? ## Screenshots (if appropriate) ## Types of changes ## Social...