mtmonkey
mtmonkey copied to clipboard
Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)
Modified MT-monkey that encapsulates perl pre- and post-processors.
We should add an option that will force the system to treat the request as a single sentence. I suggest `segmentSentences=False`.
If detokenize=false, should the result (still tokenized) be in "text" or in "tgt-tokenized"?
I think the API should allow more reporting than just errors. Like warnings, debug info, other meta info. But how? Perhaps a single mechanism in combination with errors? ``` msgLevel:...
It might be useful to have timing info in the result. See, for example: http://zardoz.service.rug.nl:9070/rpc?action=translate&sourceLang=nl&targetLang=en&text=Dit+is+een+test.
Output actual translation scores. Nice to have: get scores using some baseline quality estimation approach instead of the rather uninformative translation probability output by Moses.