Tanay Mehta issues

Results 13 issues of


                                            Tanay Mehta

Add EfficientNet model for PyTorch

# What does this PR do? This PR adds the EfficientNet model family to HuggingFace transformers proposed in #15759 (PyTorch only for this PR). The implementation is based on that...

[WIP] Add Cross Validators to cuml

Adding scikit-learn Cross Validators to CUML. Discussions in issue #4662 cc @divyegala

Cython / Python

[FEA] Add Cross Validators to cuml

**Is your feature request related to a problem? Please describe.** I would really love to see Cross validators such as KFold, StratifiedKFold, GroupKFold, etc in cuml. It will help make...

feature request

inactive-30d

Fix `BatchSizeFinder` leaving model in train state

## What does this PR do? This PR patches the bug where `BatchSizeFinder` would leave the model in train state if used with trainer.validate Fixes #18813 Before submitting - [x]...

bug

tuner

community

Tokenizer for `prepare_tulu_data.py` not found

### 🐛 Describe the bug First of all, thanks a lot for Open Sourcing OLMo! I tried running the `scripts/prepare_tulu_data.py` and faced the following error: ``` 2024-02-02 05:36:05.619 5a0b0b9dc92e:0 olmo.util:152...

type/bug

Enhance Lance Image Docs (DNC)

This issue tracks all the enhancements to Lance documentation pertaining to images.

documentation

Huggingface Integration doesn't work when `streaming=True`

Huggingface integration for converting an HF dataset to Lance doesn't work when using streaming mode. Below is the snippet to reproduce the error. ```python import lance import pyarrow as pa...

bug

enhancement

Tracking Lance Datasets (DNC)

This issue tracks the progress of Deep learning datasets created using Lance - [x] [Text] [Codeparrot GitHub Code dataset variants](https://huggingface.co/datasets/codeparrot/github-code) - [Codeparrot-1M Python](https://www.kaggle.com/datasets/heyytanay/codeparrot-1m) - [ ] [Text] [Wikitext](https://huggingface.co/datasets/wikitext) - [...

good first issue

Docs: Gemma-7B knowledge distillation with Lance

Add "Fill-in-Middle" pipeline

# What does this PR do? This PR adds the Fill-in-Middle pipeline to 🤗 transformers. FIM objective was proposed in [Efficient Training of Language Models to Fill in the Middle](https://arxiv.org/abs/2207.14255)....