Etienne Monneret comments

Results 106 comments of


                                            Etienne Monneret

How to solve this no checkpoints found problem?

Hi, ModernMT is storing your model in the "engines" subfolder and some temporary data in the "runtime" subfolder. If a previous storing exists, it tries to start from last state,...

Idea : add extracted n-gram pairs to the neural training

The whole number of words, is less than twice the number of words in the 16M original sentences. Here is a quite complex example (also showing how I'm doing some...

Idea : add extracted n-gram pairs to the neural training

A RNN is not learning phrases like a SMT. It is learning words sequences, on both encoder and decoder layers = what is the more probable word N after having...

Idea : add extracted n-gram pairs to the neural training

I used our own tool. We are specialists of such a technical stuff. See for example our historical software Similis (now free).

Idea : add extracted n-gram pairs to the neural training

For your information: after few real tests, our translators are impressed by the quality of the new model, trained with this new chunk enriched data. I'm now working on 2...

Idea : add extracted n-gram pairs to the neural training

Thanks ! I give it a try right now ! :) For your question, all is said on this sentence in my original post: "I had the idea to extract...

Idea : add extracted n-gram pairs to the neural training

Of course.. as the chunk pairs are added to the training data, they are also used by MMT in its online adaptation, like the original sentence pairs.

Idea : add extracted n-gram pairs to the neural training

Yes : 16M sentence pairs + 50M chunk pairs (my new algo is a bit more selective than the first one), all in the training data. To avoid too fast...

Idea : add extracted n-gram pairs to the neural training

I did 2 things: 1) I improved my chunking/pairing algo, and rejected chunk pairs with a too low quality estimation 2) I used the chunking covering to build an estimation...

Idea : add extracted n-gram pairs to the neural training

Yes. You should finally get this: ![image](https://user-images.githubusercontent.com/25932245/39172647-ac9f2646-47a2-11e8-9438-6ec352d3d221.png)