Wagtail comments

Results 56 comments of


                                            Wagtail

Word limit

Have you tried normalizing your input text, e.g. with `input.capitalize()` ? The sentencepiece tokenizer junks rare words in many small parts, especially if they are uppercase and regular not uppercase.

Fine-tune a NLLB-200 model for translating

I am currently [researching about language modeling](https://gitlab.com/Bachstelze/instructionbert).

Mask-fill pipeline for t5 and flan-t5

@Leolty It could be possible that the model generates multiple words if it was pretrained with longer masked spans like in [UL2 mixture of denoisers](https://ai.googleblog.com/2022/10/ul2-20b-open-source-unified-language.html). Sometimes the t5 models already...

[WIP] Refactor Deberta/Deberta-v2

What is the status? The logs of the checks are expired.

RuntimeError: CUDA error: invalid device ordinal

> If you have less than the default number of GPUs (8) Who has a default number of 8 GPUs?

Encoder-Decoder

@conceptofmind Sorry, I got confused by this figure from [UL2](https://ai.googleblog.com/2022/10/ul2-20b-open-source-unified-language.html) and concluded that they switched completely to encode-decoder models: ![image](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiozftwuxITX87OmCkAwkBouHRkjmpZHlfHCZYxRdp6_E5rLigiia3l1JlxvSnhih67iQ_CI1lQmtfffvuXNLGhuO5rFsrifmT1rk5wfLTCKcYK-6ngoendoOUzqUP1SENoQs9WvB-nsu7QDgha57NZXVMU6OpxOrbu9Mh4qKzsE3t6a0BGhlyMYhSLkw/w400-h346/image1.png) Description: In both decoder-only and encoder-decoder setups, UL2 strikes a...

Wagtail

Word limit

Fine-tune a NLLB-200 model for translating

Mask-fill pipeline for t5 and flan-t5

[WIP] Refactor Deberta/Deberta-v2

RuntimeError: CUDA error: invalid device ordinal

Encoder-Decoder

Encoder-Decoder

Model open-source releases

Trying to add support for GPT2 as decoder in EncoderDecoder model

Trying to add support for GPT2 as decoder in EncoderDecoder model