yalign icon indicating copy to clipboard operation
yalign copied to clipboard

please provide a phrase table demo

Open keyboardWitch opened this issue 7 years ago • 8 comments

Hi, I found that this align tool is very useful. And I wanna to train a model of my own, but I do not have any phrase table could you provide a phrase table demo? many thanks!

keyboardWitch avatar Feb 13 '18 02:02 keyboardWitch

Same here, looking for a Chinese dictionary (phrase table)

echan00 avatar Oct 24 '18 04:10 echan00

Did you have any luck?

echan00 avatar Oct 24 '18 04:10 echan00

I turned to an other align program. Based on gale church align and the bleu score of machine translation

On Oct 24, 2018, at 12:00, Erik Chan [email protected] wrote:

Did you have any luck?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

keyboardWitch avatar Oct 24 '18 04:10 keyboardWitch

Was it bleualign? Or something else? It would be great if you can share what you used.

echan00 avatar Oct 24 '18 04:10 echan00

Yes bleualign. I made a task queue to auto align the parellel web pages downloaded by scrapy. The machine translation is from google .

On Oct 24, 2018, at 12:54, Erik Chan [email protected] wrote:

Was it bleualign? Or something else? It would be great if you can share what you used.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

keyboardWitch avatar Oct 24 '18 05:10 keyboardWitch

Thanks! I was initially also using bleualign, but had too many documents to align and using google translate is too expensive for my project.

echan00 avatar Oct 24 '18 05:10 echan00

You can use the free google translation , slow down the request and increase the number of concurrent the yalign need language models. I think bleualign is more useful for short pages alignment

On Oct 24, 2018, at 13:27, Erik Chan [email protected] wrote:

Thanks! I was initially also using bleualign, but had too many documents to align and using google translate is too expensive for my project.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

keyboardWitch avatar Oct 24 '18 05:10 keyboardWitch

Same problem here. But I saw someone already has their language pair for alignment. I also found a parper that talked about improving the performance of Yalign. https://arxiv.org/abs/1512.01641 didn't mentioned the creation of dictionary(phrase table), but might be helpful.

LukeTu avatar Oct 05 '19 08:10 LukeTu