cdleong

Results 10 issues of cdleong

I'm on Ubuntu 18. I created a fresh conda environment with pip installed, and tried to follow the instructions from https://github.com/heartexlabs/label-studio-ml-backend#quickstart. I got this: ``` INFO: pip is looking at...

Currently it does not seem to be possible to resume a previous training where it left off, if there's already a model directory. If overwrite is set to `False`, https://github.com/joeynmt/joeynmt/blob/cd6974f862922757129fa7d50b5fd842baa996f0/joeynmt/helpers.py#L34...

enhancement

I'm on ubuntu, made a fresh conda environment and conda installed pip within it, then ran `pip install -r requirements.txt` I got this ``` ERROR: sentence-transformers 0.2.5.1 has requirement transformers==2.3.0,...

Slack discussion: https://masakhane-nlp.slack.com/archives/C01JAP67HRV/p1634844082006400 ![image](https://user-images.githubusercontent.com/4109253/138348855-36b06965-a97d-47d2-b940-f70a3b1624cc.png) https://github.com/joeynmt/joeynmt/blob/master/joey_demo.ipynb is the Tatoeba example.

# Edit: see #200, maybe we should leave the old JW300 notebooks up, and instead create new ones # The problem JW300 has been taken down for copyright reasons. At...

Following #157, check what languages are not covered in https://github.com/juliakreutzer/masakhane/tree/master/jw300_utils/test, and create custom test sets for those. @juliakreutzer I think I can give this a go, but do I need...

For example, `/content/drive/My Drive/masakhane/$src-$tgt-$tag` can cause issues, but also the following situation caused an error for me: ```python source_file = f"/content/drive/My Drive/Research/Hani Machine Translation/hni_story_corpus/v2/hani_story_corpus_train.{source_language}" target_file = f"/content/drive/My Drive/Research/Hani MachineTranslation/hni_story_corpus/v2/hani_story_corpus_train.{target_language}" #...

Hello, I was attempting to adapt [this guide](https://huggingface.co/blog/how-to-train) for use with Charformer Pytorch. Colab notebook for that guide is [here](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb). I'd like to be able to use GBST on the...

Add https://www.openslr.org/32/ to the speech datasets in https://github.com/masakhane-io/masakhane-community/blob/master/list-of-datasets.md

This link currently doesn't work: ![image](https://user-images.githubusercontent.com/4109253/133619403-bb1b823e-0e72-48dc-8f3e-164171254181.png) Should it be be pointed to https://arxiv.org/abs/2109.04020 instead? Hopefully this is helpful, sorry if you were already planning to fix this!