fairseq icon indicating copy to clipboard operation
fairseq copied to clipboard

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Results 636 fairseq issues
Sort by recently updated
recently updated
newest added

## 🐛 Bug Hello i have downloaded the many to many mbart50 and i want to test it in en-fr with data from wmt. It did not work and I...

bug
needs triage

#### What is your question? I successfully trained a translation model with scalar quantization and I am wondering how to convert the fake INT8 weights to regular INT8 weights? When...

question
needs triage

Hi, I'm currently training an own Data2Vec text model with the latest `fairseq` master version. When I'm loading the model checkpoint with: ```bash cd examples/data2vec/models ``` then: ```python from data2vec_text...

bug
needs triage

Excuse me,can someone answer this quesion? When i run the pyfile,fairseq/examples/hubert/dump_mfcc_feature.py,it will report an error:No module named 'npy_append_array'.And there are no relevant files in the fairseq I downloaded. Please ask...

question
needs triage

Hi, #4513 introduced PyTorch version checks, that are problematic when using NVIDIA PyTorch images. Let's have a look at one code snippet, where the version check is performed: https://github.com/facebookresearch/fairseq/blob/5307a0e078d7460003a86f4e2246d459d4706a1d/fairseq/modules/transformer_layer.py#L118-L138 The...

bug
needs triage

We find that the vocabulary obtained with `--bpe` in fairseq-proprecess is the same regardless of being specified as [`character/subword-nmt/fast_bpe` etc.], when we use the following command. ```bash fairseq-preprocess --source-lang mo...

question
needs triage

## 🐛 Bug According to the [doc](https://github.com/pytorch/fairseq/tree/master/examples/nonautoregressive_translation#knowledge-distillation), it said "The easiest way of performing distillation is to follow the instructions of training a standard transformer model on the same data,...

bug

## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the docs. #### What is your question? I try to train a multilingual translation model with...

question
needs triage

This PR adds the code for the following methods to the Non-Autoregressive Transformer: - Glancing Transformer (GLAT) from "[Glancing Transformer for Non-Autoregressive Neural Machine Translation](https://aclanthology.org/2021.acl-long.155.pdf)" (Qian et al., 2021) -...

CLA Signed

Hi! Recently I stumbled across your repo and wmt models. They showed pretty good results on my data out-of-the-box (I uploaded them via HuggingFace) but I failed to find any...

question
needs triage