cdleong
cdleong
I'm on Ubuntu 18. I created a fresh conda environment with pip installed, and tried to follow the instructions from https://github.com/heartexlabs/label-studio-ml-backend#quickstart. I got this: ``` INFO: pip is looking at...
Currently it does not seem to be possible to resume a previous training where it left off, if there's already a model directory. If overwrite is set to `False`, https://github.com/joeynmt/joeynmt/blob/cd6974f862922757129fa7d50b5fd842baa996f0/joeynmt/helpers.py#L34...
I'm on ubuntu, made a fresh conda environment and conda installed pip within it, then ran `pip install -r requirements.txt` I got this ``` ERROR: sentence-transformers 0.2.5.1 has requirement transformers==2.3.0,...
Slack discussion: https://masakhane-nlp.slack.com/archives/C01JAP67HRV/p1634844082006400  https://github.com/joeynmt/joeynmt/blob/master/joey_demo.ipynb is the Tatoeba example.
# Edit: see #200, maybe we should leave the old JW300 notebooks up, and instead create new ones # The problem JW300 has been taken down for copyright reasons. At...
Following #157, check what languages are not covered in https://github.com/juliakreutzer/masakhane/tree/master/jw300_utils/test, and create custom test sets for those. @juliakreutzer I think I can give this a go, but do I need...
For example, `/content/drive/My Drive/masakhane/$src-$tgt-$tag` can cause issues, but also the following situation caused an error for me: ```python source_file = f"/content/drive/My Drive/Research/Hani Machine Translation/hni_story_corpus/v2/hani_story_corpus_train.{source_language}" target_file = f"/content/drive/My Drive/Research/Hani MachineTranslation/hni_story_corpus/v2/hani_story_corpus_train.{target_language}" #...
Hello, I was attempting to adapt [this guide](https://huggingface.co/blog/how-to-train) for use with Charformer Pytorch. Colab notebook for that guide is [here](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb). I'd like to be able to use GBST on the...
Add https://www.openslr.org/32/ to the speech datasets in https://github.com/masakhane-io/masakhane-community/blob/master/list-of-datasets.md
This link currently doesn't work:  Should it be be pointed to https://arxiv.org/abs/2109.04020 instead? Hopefully this is helpful, sorry if you were already planning to fix this!