PyTorch-NLP icon indicating copy to clipboard operation
PyTorch-NLP copied to clipboard

Basic Utilities for PyTorch Natural Language Processing (NLP)

Results 24 PyTorch-NLP issues
Sort by recently updated
recently updated
newest added

GLUE datasets are standard for evaluating NLU tasks. > In pursuit of this objective, we introduce the General Language Understanding Evaluation benchmark > (GLUE), a tool for evaluating and analyzing...

enhancement
help wanted
good first issue

## Behaviors The following code snippet is directly taken from `README.md` of the this library (see [here](https://github.com/PetrochukM/PyTorch-NLP/blob/master/README.md)). I am expecting the following `n_class` to be equal to 2 (i.e. there...

Is there any straightforward way to specify the maximum number of tokens per batch in a sampler (eg.: BucketBatchSampler)? Reducing the amount of padding per batch is critical for performance...

Cleared an existing issue, added the rouge metric - rouge_n and rouge_l

## Expected Behavior PyTorch-NLP might be hard to understand if you are starting a NLP project, I expect a blog post to exist to help fresh faces. Example links: https://hn.algolia.com/?query=nlp&sort=byPopularity&prefix&page=0&dateRange=all&type=story...

help wanted

Hi, I have multiple large-scale datasets in TFDS format, which needs to be converted to iterative datasets, and I want to trani large-scale T5 model on TPUs with them, for...

`fork_rng_wrap` does not propagate function arguments `fork_rng_wrap`, and it should.

The `Encoder` is unnecessarily complex: - It has OOB that's hard to reason with - The `Encoder` implementation is coupled with special tokens - The `batch_encoding` isn't helpful We could...

Add two simple parameters to class FastText, making it possible to load fasttext model from custom file.

Based on some initial experiments, it looks like the Encoder classes can not be exported to the ONNX format using `torch.onnx.export`. It would be great to be able to package...