text
text copied to clipboard
Models, data loaders and abstractions for language processing, powered by PyTorch
## 🐛 Bug **Describe the bug** A clear and concise description of what the bug is. torchtext.legacy.data module is not available in torchtext==0.11.0 (torch==1.11) **Environment** Please copy and paste the...
## 🚀 Feature `SentencePieceTokenizer` (or similar) should return IDs. **Motivation** Currently using `SentencePieceTokenizer` as a transform requires a second transform which uses torchtext's `Vocab` abstraction in order to ID-ify the...
## 🐛 Bug **Describe the bug** The output of OpenAI's CLIP tokenizer is different than Torchtext's tokenizer, when using the same inputs & settings. **To Reproduce** Steps to reproduce the...
## 🚀 Feature **Motivation** * To avoid pitfall with shuffling and sharding of datapipes in distributed training environments * To ensure consistent experience of TorchData based datasets across domains. **Pitch**...
## 📚 Documentation **Description** I didn't get what i need to provide for the class for it to work and there is no documentation on what is `merges_path` or what...
Summary: As we may automatically switch to use nested tensor, we need further support of this in torchtext, especially for return_all_layers Reviewed By: mikekgfb, parmeet Differential Revision: D36213184
## 🚀 Feature TorchData is currently an optional dependency which means users have to `pip install torchdata` if they want to be able to use our datasets. Since datasets are...
## ❓ Questions and Help **Description** I am trying to implement a sequence (multi-output) regression task using `torchtext`, but I am getting the error in the title. torch version: 1.10.1...
## searching the ideal torchtext pipeline **Description** hey there, so ive been using the legacy version of torchtext for quite sometime as it provides easier ways to load custom dataset...
## 🚀 Feature Revamp Pytorch Benchmark implementations for pytorch_struct and Attention is All You Need models **Motivation** Since we deprecated pytorch.legacy, we need to revamp these tutorials to make them...