वेदांत
वेदांत
Related to #1321 Did 2 fies test_bitpacking test_affine_quantized_float cc @jerryzh168
## What does this PR do? This PR addresses the memory leak issue during prediction in PyTorch Lightning. It adds proper memory management when `return_predictions=False` and includes comprehensive tests to...
Skip nvrtc-builtins dependency on Windows This PR modifies the CMake configuration to handle the nvrtc-builtins library differently on Windows platforms where the library naming convention differs. Description ----------- This is...
What does this PR address? Feat: Add native SpaCy runner for v1.0.0 architecture This PR adds native SpaCy integration for BentoML's v1.0.0+ architecture, providing a complete implementation of the SpaCy...
Pass config during wandb.init() in offline mode instead of using config.update() Fixes #3607 similar [https://github.com/huggingface/transformers/issues/38968](https://github.com/huggingface/transformers/issues/38968) ## Before submitting - [X] Was this discussed/approved via a Github issue or the forum?...
# What does this PR do? add's warning for invalid keys in check_device_map as discussed in #3584 Fixes #3584 ## Before submitting - [X] Did you read the [contributor guideline](https://github.com/huggingface/accelerate/blob/main/CONTRIBUTING.md#submitting-a-pull-request-pr),...
# What does this PR do? - Add truncate_dataset and gradient_steps parameters to BatchSamplerShard - Implement optimal dataset size calculation - Add truncation logic to both split and non-split batch...
fix passing model type bu `cce_patch` ``` File "/workspace/ml-cross-entropy/cut_cross_entropy/transformers/patch.py", line 202, in cce_patch [202/3480] return PATCH_FNS[model_type](model_type_or_model, patch_options) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/workspace/ml-cross-entropy/cut_cross_entropy/transformers/kimi_linear.py", line 106, in patch_kimi_linear model_config = AutoConfig.from_pretrained(maybe_model, trust_remote_code=True) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File...
# Description sample gen support sft related to https://github.com/axolotl-ai-cloud/axolotl/pull/3067 ## Motivation and Context #3139 ## How has this been tested? Adding ## Summary by CodeRabbit ## Release Notes * **New...