text
text copied to clipboard
Models, data loaders and abstractions for language processing, powered by PyTorch
## 🐛 Bug **Describe the bug** While following the torchtext tutorial here https://pytorch.org/tutorials/beginner/torchtext_translation_tutorial.html I am unable to download english & german gzipped datasets from the urls specified in the example,...
## 🚀 Feature **Motivation** The legacy `BucketIterator` was convenient because it could batch samples by length to minimize padding. It had many disadvantages because of its API and non-comformance with...
## 🚀 Feature **Motivation** The current method of training a sentencepiece model requires a file to be passed. It would be nice if this was not required. **Pitch** Like other...
Hello everyone, I want to implement a Fast WordPiece Tokenization algorithm introduced by Google. # Fast WordPiece algortihm Google introduced a new algorithm called LinMaxMatch for WordPiece tokenization has time...
## ❓ Questions and Help **Description** Hello, I want to train custom data with lstm, the data looks like: text, value a a a a b a ... 0.2 a...
Hi There! To help folks sort through the clutter of libraries, please add a comparison of PyTorch-NLP and pytorch/text. https://github.com/PetrochukM/PyTorch-NLP Thanks!
User may want to additionally project the features from encoder. This PR add support for projecting features to different dimensional space.
## ❓ Questions and Help **Description** Hey folks, I was hoping someone could tell me a better way to deal with this issue. I am getting a `StopIteration` error on...
This PR introduces the new issue forms that replace issue templates. This is similar to what was done in torchvision https://github.com/pytorch/vision/pull/4299, you can see the end result here: https://github.com/pytorch/vision/issues/new/choose (click...