text
text copied to clipboard
Models, data loaders and abstractions for language processing, powered by PyTorch
## ❓ Questions and Help **Description** I was trying to use the torchtext.experimental module in my code and use `pip install --pre torch torchtext -f https://download.pytorch.org/whl/nightly/cu102/torch_nightly.html` to install the package....
## 🚀 Feature Outputs for the current CLIP tokenizer appear to be a list of strings of numbers, rather than a tensor or even a list of numbers: ``` clip_tokenizer...
## 🐛 Bug It seems to me that the first line is ignored due to it being a comment about the file in open_clip and OpenAI's vocab file, but this...
## ❓ How to build vocab from Glove embedding? **Description** How to build vocab from Glove embedding? I have gone through the documentation and the release update, I got to...
## ❓ Questions and Help **Description** Three packages are under [pytorch text third_party](https://github.com/pytorch/text/tree/main/third_party). However, I personally prefer using system installed packages, - libre2-dev - libdouble-conversion-dev - libsentencepiece-dev In addition, isn't...
## 🚀 Feature Tracking issue for [this discussion](https://github.com/pytorch/text/pull/1622#discussion_r812166562). File-reading should probably be handled in a classmethod so files don't need to be written to construct certain objects. One example would...
CI is generally unhappy because of caching, but something has broken with the OS X conda builds. For whatever reason, OS X python 3.8 builds are OK. cc @seemethere
## ❓ Questions and Help **Description** For some reason, calling __getitem__() on the Torchtext Multi30k dataset returns a NotImplementedError for me, despite the dataset being properly downloaded and calling next(iter())...
## 🚀 Feature torchtext datasets should provide an optional `transforms` kwarg. **Motivation** Other domain libraries provide a `transform` and `target_transform` kwargs to datasets for common operations (e.g., resizing, scaling, and...