mammoth issues

Remove the need for communicating ready_t

A model component (e.g. the Swahili encoder) is likely to exist on multiple devices. Because each device samples its own task sequence, it is possible that when a gradient synchronization...

TimotheeMickus

enhancement

Translation server and demo frontend

3

The existing translation server from OpenNMT-py was refurbished. A demo frontend was implemented using streamlit.

Waino

data state restoration

closes #63 . same idea as v2, didn't bother porting from it. does not embark bucket states, although this could maybe be done by picking the line indices from all...

TimotheeMickus

Currently, the `--train_from` option does not include means of restoring corpora states, hence training resumes from the beginning of the bitexts. This entails resumed models are training on a subset...

TimotheeMickus

enhancement

Supporting other data types (e.g. video)

5

Hi, Is it possible to use Mammoth for other seq2seq problems, such as multilingual video/image captioning? What I have in mind is to prepare video features in this format (batch,...

shakibyzn

question

Opts cleaning

2

closes #60

TimotheeMickus

Opts need a major cleanup

Going through the existing catalogue of options listed in our docs, a number of them seem to not be plugged in. The list below is most likely not exhaustive. ###...

TimotheeMickus

bug

documentation

enhancement

Encoder only / decoder only models

Currently, we only support training encoder-decoder models. We might want to support encoder-only (e.g. BERT) and decoder-only models (e.g. GPTs). This could be inferred automatically from the types of sharing...

TimotheeMickus

enhancement

good first issue

Prefix / prompt learning with mammoth

Add a feature to learn virtual embeddings for prompt/prefix learning on a pretrained model. This would depend on #24 being implemented first.

TimotheeMickus

enhancement

External dependencies for layer architectures

Currently, we rely on custom-made layer / encoder definitions for our modules. Cf. for instance this class: https://github.com/Helsinki-NLP/mammoth/blob/c6a193b1cc16bf7140520c44712bcf82701ec87d/mammoth/modules/transformer_encoder.py#L13 This entails that any architectural variant we wish to test has to...

TimotheeMickus

enhancement

good first issue

mammoth
mammoth copied to clipboard

Metadata

Remove the need for communicating ready_t

Translation server and demo frontend

data state restoration

Data state restoration

Supporting other data types (e.g. video)

Opts cleaning

Opts need a major cleanup

Encoder only / decoder only models

Prefix / prompt learning with mammoth

External dependencies for layer architectures

← Metadata

Owner

Metadata

mammoth mammoth copied to clipboard

Metadata

← Metadata

Owner

Metadata

mammoth
mammoth copied to clipboard