Tanay Mehta

Results 13 issues of Tanay Mehta

# What does this PR do? This PR adds the EfficientNet model family to HuggingFace transformers proposed in #15759 (PyTorch only for this PR). The implementation is based on that...

Adding scikit-learn Cross Validators to CUML. Discussions in issue #4662 cc @divyegala

Cython / Python

**Is your feature request related to a problem? Please describe.** I would really love to see Cross validators such as KFold, StratifiedKFold, GroupKFold, etc in cuml. It will help make...

feature request
inactive-30d

## What does this PR do? This PR patches the bug where `BatchSizeFinder` would leave the model in train state if used with trainer.validate Fixes #18813 Before submitting - [x]...

bug
tuner
community
pl

### 🐛 Describe the bug First of all, thanks a lot for Open Sourcing OLMo! I tried running the `scripts/prepare_tulu_data.py` and faced the following error: ``` 2024-02-02 05:36:05.619 5a0b0b9dc92e:0 olmo.util:152...

type/bug

This issue tracks all the enhancements to Lance documentation pertaining to images.

documentation

Huggingface integration for converting an HF dataset to Lance doesn't work when using streaming mode. Below is the snippet to reproduce the error. ```python import lance import pyarrow as pa...

bug
enhancement

This issue tracks the progress of Deep learning datasets created using Lance - [x] [Text] [Codeparrot GitHub Code dataset variants](https://huggingface.co/datasets/codeparrot/github-code) - [Codeparrot-1M Python](https://www.kaggle.com/datasets/heyytanay/codeparrot-1m) - [ ] [Text] [Wikitext](https://huggingface.co/datasets/wikitext) - [...

good first issue

# What does this PR do? This PR adds the Fill-in-Middle pipeline to 🤗 transformers. FIM objective was proposed in [Efficient Training of Language Models to Fill in the Middle](https://arxiv.org/abs/2207.14255)....