subsync icon indicating copy to clipboard operation
subsync copied to clipboard

Dictionaries and speech recognition models requests

Open sc0ty opened this issue 4 years ago • 19 comments

This is aggregated issue to request support for new languages. If you see one of the following errors:

Synchronization between languages xxx - yyy is currently not supported.

Synchronization with xxx audio is currently not supported.

Instead of opening new ticket, just write comment here and I will add it to the list.

Dictionaries:

  • nothing right now

Speech recognition models:

  • [ ] Korean #18
  • [ ] Finnish #25
  • [ ] Danish #31
  • [ ] Japanese #57
  • [ ] Hindi #58
  • [ ] Polish
  • [ ] Czech
  • [ ] Estonian
  • [ ] French
  • [ ] (Brazilian) Protugese

If you want to help by creating assets (requested here or not) see here for some technical description. All help is appreciated.

sc0ty avatar May 19 '20 17:05 sc0ty

couldn't you change the frequency range and just keep vocals and check when subtitles start and end and try to shift the subtitles to match by display length of the subtitle and audio length. this might be a very generic approach

fawzib avatar Jun 06 '20 07:06 fawzib

There is another project that does exactly that. I'm planning to do something similar eventually, but my synchronizer architecture would poorly fit this approach, so I will need to implement separate synchronization engine. That means lots of work, so don't expect it to be done in the near future.

sc0ty avatar Jun 07 '20 12:06 sc0ty

A comment to ask if you could add French language support :-)

hista avatar Aug 17 '20 19:08 hista

Added to the list.

sc0ty avatar Aug 18 '20 16:08 sc0ty

It will be great to have Japanese and korean recognition. There is stll a chance to get them ? Thank you for your app. It's very useful.

abelrod666 avatar Aug 25 '20 10:08 abelrod666

Sorry @abelrod666, I've missed your reply.

Subsync is using Sphinx speech recognition engine with language models taken from the internet. I don't have knowledge to make new models, and there is no publicly available models for languages on this list (that I know of). That's why we don't support them. If you know of any missing models or you are able to create one then I can add it to subsync, otherwise I can't help you.

sc0ty avatar Sep 02 '20 16:09 sc0ty

@sc0ty No problem. I understand. Gonna see if I can find it.... Thank you for your app. It's great and saves me a lot of time.

abelrod666 avatar Sep 04 '20 19:09 abelrod666

please add persian language support

srmajid avatar Jun 02 '21 09:06 srmajid

please add Thai language support too.

K0ng2 avatar Feb 18 '22 14:02 K0ng2

please add Vietnamese language support too

kid1485 avatar Mar 09 '22 10:03 kid1485

The new openai whisper look really interesting specailly as it can detect multiple languages and isolate words from noise.

https://github.com/openai/whisper

Dnkhatri avatar Oct 07 '22 06:10 Dnkhatri

please add Turkish language support (your subsync is exellent!)

fuatsarperasli avatar Oct 18 '23 13:10 fuatsarperasli

please add Cantonese support

ronohkmo avatar Nov 28 '23 22:11 ronohkmo

Any possibility of adding Ukrainian support? Thank you for all your work on this!

doththouevenhoist avatar Jan 11 '24 17:01 doththouevenhoist

Please add Polish. Thanks in advance.

JanikSi avatar Mar 05 '24 16:03 JanikSi

Hello, is it possible to add speech-cze and speech-pol. Thanks in advance.

JanikSi avatar Mar 08 '24 10:03 JanikSi