audio-transformers-course icon indicating copy to clipboard operation
audio-transformers-course copied to clipboard

Translation to Russian

Open artyomboyko opened this issue 1 year ago • 36 comments

Hi there 👋

Let's translate the course to Russian so that the whole community can benefit from this resource 🌎!

Below are the chapters and files that need translating - let us know here if you'd like to translate any and we'll add your name to the list. Once you're finished, open a pull request and tag this issue by including #issue-number in the description, where issue-number is the number of this issue.

🙋 If you'd like others to help you with the translation, you can also post in our forums or tag @_lewtun on Twitter to gain some visibility.

Chapters

UNIT 0. WELCOME TO THE COURSE!

UNIT 1. WORKING WITH AUDIO DATA

UNIT 2. A GENTLE INTRODUCTION TO AUDIO APPLICATIONS

UNIT 3. TRANSFORMER ARCHITECTURES FOR AUDIO

UNIT 4. BUILD A MUSIC GENRE CLASSIFIER

UNIT 5. AUTOMATIC SPEECH RECOGNITION

UNIT 6. From text to speech

UNIT 7. Putting it all together

UNIT 8. Finish line

Course Events

Adding extra material:

  • [X] #136 @blademoon
  • [X] #137 @blademoon

artyomboyko avatar Jul 26 '23 19:07 artyomboyko

Good evening, I have added a file translation_agreements.txt in which I enter various words and phrases that need to be translated in a uniform way. This is necessary to maintain a uniform translation style in case there will be several translators.

artyomboyko avatar Jul 27 '23 19:07 artyomboyko

@lewtun Hello. Can you post a review? As soon as I finish translating Unit 1 I plan to push everything already translated into the official repository....

artyomboyko avatar Jul 31 '23 17:07 artyomboyko

Hi! I can take chapter 2 for translation.

Lightmourne avatar Aug 01 '23 14:08 Lightmourne

@Lightmourne Good news! The two of us can do more!

artyomboyko avatar Aug 01 '23 15:08 artyomboyko

@MKhalusova Good afternoon, Maria. Can you suggest who can do a review of our contribution to the repository?

artyomboyko avatar Aug 01 '23 16:08 artyomboyko

@blademoon Thanks for initiating and organizing this effort! I can handle the reviews.

MKhalusova avatar Aug 02 '23 17:08 MKhalusova

@MKhalusova That's great!

artyomboyko avatar Aug 03 '23 14:08 artyomboyko

@MKhalusova I have one question, will there be an example in the course on how to train Whisper for multilingual ASR? For example how to fine-tune Whisper for two languages - English and Russian? That would be very good.

artyomboyko avatar Aug 03 '23 15:08 artyomboyko

@blademoon Whisper is already multilingual, and you can further fine-tune on any language. In the course, we show how to fine-tune it on a language it wasn't trained on - Dhivehi. But you can apply the same principles to other languages.

MKhalusova avatar Aug 04 '23 12:08 MKhalusova

@MKhalusova Good afternoon. Yes, the Whisper fine-tuning notebook is something I've already looked into. I thought that there is some difference if you fine-tune Whisper for two languages at the same time. After all, at least each language needs its own tokenizer configured accordingly. But so far, I don't understand how to do it....

artyomboyko avatar Aug 06 '23 09:08 artyomboyko

Hi. Chapter 4 [BUILD A MUSIC GENRE CLASSIFIER] translated to Russian.

Lightmourne avatar Aug 06 '23 12:08 Lightmourne

@blademoon i take chapter 5 for translation [AUTOMATIC SPEECH RECOGNITION].

Lightmourne avatar Aug 08 '23 04:08 Lightmourne

@Lightmourne OK 😉

artyomboyko avatar Aug 08 '23 06:08 artyomboyko

@MKhalusova Good afternoon, Maria. The new part of the translation has been sent to PR https://github.com/huggingface/audio-transformers-course/pull/122

Besides, we have agreed with Sergey that after the translation is finished we will reread the whole course again (at the same time we will go through it to check everything) and make minor edits, this should improve the quality of our work.

artyomboyko avatar Aug 09 '23 15:08 artyomboyko

@Lightmourne @MKhalusova Good afternoon. I decided to add a new marker to our task list - MINOR_FIX_DONE. This marker will be used to mark those files that we have reread and corrected minor errors.

artyomboyko avatar Aug 10 '23 07:08 artyomboyko

@MKhalusova Good evening Maria, can you tell me what a "rainbow passage" is? Is it a book or?

artyomboyko avatar Aug 10 '23 17:08 artyomboyko

@blademoon The rainbow passage is a specific piece of text (this one) that is often used in English language speech and voice research to assess different aspects of speech. It includes a variety of phonetic sounds and linguistic patterns that can help researchers understand how speech sounds are produced by individuals with different accents or speech characteristics.

MKhalusova avatar Aug 11 '23 13:08 MKhalusova

@MKhalusova Thank you, I will add your clarification to the translated version. It will seriously simplify understanding.

artyomboyko avatar Aug 11 '23 15:08 artyomboyko

@MKhalusova In file "pre-trained_model.mdx" (Chapter 6):

## SpeechT5 

[SpeechT5](https://arxiv.org/abs/2110.07205) is a model published by Junyi Ao et al. from Microsoft that is capable of

Junyi Ao - is a person's name. But what this - et al? Could it be a typo?

artyomboyko avatar Aug 11 '23 15:08 artyomboyko

@MKhalusova In Catalan et al means "and others"))) But I don't know Catalan)))

artyomboyko avatar Aug 11 '23 15:08 artyomboyko

@blademoon et al. comes from Latin, and does mean "and others". It's often encountered in paper citations, and is very common in English: https://www.merriam-webster.com/dictionary/et%20al.

MKhalusova avatar Aug 11 '23 16:08 MKhalusova

@MKhalusova Good afternoon Maria, can you do a review of Sergei's PR https://github.com/huggingface/audio-transformers-course/pull/123 if possible. I am working hard on the translation of the last two units. Sergey is busy checking minor bugs and coordinating the translation of all our work. We are trying to bring the translation to the end and improve the quality. Thank you.

artyomboyko avatar Aug 15 '23 08:08 artyomboyko

@MKhalusova Good afternoon Maria. A small question about the translation of the title House-keeping. Almost all translations known so far are related to housekeeping and agriculture. But this is clearly not it. The context of the content of the section itself didn't help much in choosing synonyms either.... Can you explain the meaning of this word combination more precisely?

artyomboyko avatar Aug 15 '23 13:08 artyomboyko

@MKhalusova Good evening, Maria. For your convenience, my PR https://github.com/huggingface/audio-transformers-course/pull/124 should go after Sergey's PR https://github.com/huggingface/audio-transformers-course/pull/123.

artyomboyko avatar Aug 17 '23 16:08 artyomboyko

@Lightmourne When the PR is accepted and you're ready, tag me. We'll take the course and make minor adjustments to the translation. Okay?

artyomboyko avatar Aug 17 '23 16:08 artyomboyko

@MKhalusova Добрый день Мария ;) Можешь сориентировать, когда нам сюда вернуться чтобы "причесать" наш перевод) Получилось быстро перевести, но хотелось бы еще и согласовать по терминологии. Есть объективные недостатки и их нужно устранить.

artyomboyko avatar Aug 29 '23 13:08 artyomboyko

@blademoon Chapter 5 нужно починить (make style), затем можно мерджить. Completion of the stage of translation of the course into Russian #124 я посмотрю сегодня или завтра. Потом сверху отдельным PR можно добавить поправки после "причесания"

MKhalusova avatar Aug 29 '23 15:08 MKhalusova

@MKhalusova мы так и планировали. Сначала смерджить, затем просто мелкие правки.

Отдельный вопрос, курс по RL переводить будут?

artyomboyko avatar Aug 29 '23 15:08 artyomboyko

@MKhalusova по переводу одного предложения, погорячился, сделал лишний коммит. Ваш коммит делает тоже самое, его я тоже подтвердил. Просто видимо мне тоже нужно выспаться, а еще лучше в отпуск))) Спасибо вам за внимательность Мария!

artyomboyko avatar Aug 30 '23 16:08 artyomboyko

@Lightmourne Have a great birthday and thanks so much for your help! :cocktail:

artyomboyko avatar Aug 30 '23 16:08 artyomboyko