cdleong

Results 19 comments of cdleong

Ah, well we'd have to update the notebooks as well, as they point directly to the forked version

Some languages, e.g. `ady`, lack alignment files for English: https://opus.nlpl.eu/JW300.php

[test_letter_a_new.zip](https://github.com/masakhane-io/masakhane-mt/files/6712076/test_letter_a_new.zip) Did every language code which starts with the letter "a". Here's the ones that weren't already in there.

Got to `bfi` before I started actually practicing "quality at a glance" and looking at the data. Turns out `bfi` is just... English data?

Oh, it's "British Sign Language". What the heck? https://en.wikipedia.org/wiki/British_Sign_Language

[test_ba_thru_btg_new.zip](https://github.com/masakhane-io/masakhane-mt/files/6712237/test_ba_thru_btg_new.zip) `ba` thru `btg` codes, not already in the global test set

Steps that need to be done: - [ ] (optional) assign yourself in "Assignees" over to the right - [ ] Try running the notebooks, in Google Colab - [...

So for example, this section breaks because JW300 is no longer downloadable: ![image](https://user-images.githubusercontent.com/4109253/138313280-551d2907-edd8-47c8-ad18-87b0f344d2e7.png)

I think it is still relevant, yes. And I just got done with my semester so I might have more free time as well, after the holidays On Mon, Dec...