PyTorch-NLP
PyTorch-NLP copied to clipboard
Basic Utilities for PyTorch Natural Language Processing (NLP)
GLUE datasets are standard for evaluating NLU tasks. > In pursuit of this objective, we introduce the General Language Understanding Evaluation benchmark > (GLUE), a tool for evaluating and analyzing...
## Behaviors The following code snippet is directly taken from `README.md` of the this library (see [here](https://github.com/PetrochukM/PyTorch-NLP/blob/master/README.md)). I am expecting the following `n_class` to be equal to 2 (i.e. there...
Is there any straightforward way to specify the maximum number of tokens per batch in a sampler (eg.: BucketBatchSampler)? Reducing the amount of padding per batch is critical for performance...
Cleared an existing issue, added the rouge metric - rouge_n and rouge_l
## Expected Behavior PyTorch-NLP might be hard to understand if you are starting a NLP project, I expect a blog post to exist to help fresh faces. Example links: https://hn.algolia.com/?query=nlp&sort=byPopularity&prefix&page=0&dateRange=all&type=story...
Hi, I have multiple large-scale datasets in TFDS format, which needs to be converted to iterative datasets, and I want to trani large-scale T5 model on TPUs with them, for...
`fork_rng_wrap` does not propagate function arguments `fork_rng_wrap`, and it should.
The `Encoder` is unnecessarily complex: - It has OOB that's hard to reason with - The `Encoder` implementation is coupled with special tokens - The `batch_encoding` isn't helpful We could...
Add two simple parameters to class FastText, making it possible to load fasttext model from custom file.
Based on some initial experiments, it looks like the Encoder classes can not be exported to the ONNX format using `torch.onnx.export`. It would be great to be able to package...