spoken-to-signed-translation
spoken-to-signed-translation copied to clipboard
New another language to add or change.
Error:
download_lexicon.py --name 'vgt' --directory ./vgt
usage: download_lexicon.py [-h] --name {signsuisse} --directory DIRECTORY
download_lexicon.py: error: argument --name: invalid choice: 'vgt' (choose from 'signsuisse')
I have add 'VGT'
from .types import Gloss from .common import load_spacy_model
LANGUAGE_MODELS_SPACY = { "de": "de_core_news_lg", "fr": "fr_core_news_lg", "vgt": "vgt_core_news_lg", "en": "en_core_web_lg", }
def text_to_gloss(text: str, language: str, ignore_punctuation: bool = False) -> Gloss:
if language not in LANGUAGE_MODELS_SPACY:
raise NotImplementedError("Don't know language '%s'." % language)
model_name = LANGUAGE_MODELS_SPACY[language]
# disable unnecessary components to make lemmatization faster
spacy_model = load_spacy_model(model_name, disable=("parser", "ner"))
doc = spacy_model(text)
glosses = [] # type: Gloss
for token in doc:
if ignore_punctuation is True:
if token.is_punct:
continue
gloss = (token.text, token.lemma_)
glosses.append(gloss)
return glosses
I see that you are trying to use this repository with VGT.
The download_lexicon script does not support any VGT dataset, so to support Flemish, you would have to go through the following process:
- Collect a lexicon (Download videos from https://vlaamsegebarentaal.be/signbank/signs/show_all/ or collect your own)
- Extract poses using this library and the command
video_to_pose --format mediapipe -i example.mp4 -o example.pose - Construct a lexicon CSV file with the words, matching the poses, for example https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/assets/dummy_lexicon/index.csv
path,spoken_language,signed_language,start,end,words,glosses,priority
sgg/kleine.pose,de,sgg,0,0,kleine,Kleine,0
sgg/kinder.pose,de,sgg,0,0,kinder,Kinder,0
Now, once you have this index.csv, under a directory called, let's say, lexicon, you can run for example:
text_to_gloss_to_pose \
--text "Hallo mijn naam is john." \
--glosser "simple" \
--lexicon "lexicon" \
--spoken-language "nl" \
--signed-language "vgt" \
--pose "quick_test.pose"
hello,I have one problem, look at.
download_lexicon
--name
--directory <path_to_directory>
What should I put in "name" and "directory" in this code?
which file should i run to use this program?
hello,I have one problem, look at.
download_lexicon --name --directory <path_to_directory>
What should I put in "name" and "directory" in this code?
the only dataset available in this repository is signsuisse.
If you have a further issue that is not related to the issue at hand, please create a different issue.
https://www.corpusvgt.be/ might work
Or https://woordenboek.vlaamsegebarentaal.be/search, used by https://github.com/m-decoster/VGT-SL-Dictionary
Thanks for the tip!
On Wed, Jun 12, 2024 at 2:31 AM cleong110 @.***> wrote:
Or https://woordenboek.vlaamsegebarentaal.be/search, used by https://github.com/m-decoster/VGT-SL-Dictionary
— Reply to this email directly, view it on GitHub https://github.com/sign-language-processing/spoken-to-signed-translation/issues/28#issuecomment-2161630553, or unsubscribe https://github.com/notifications/unsubscribe-auth/A5KO7YAXRRSNKBSTGE732RTZG5UCTAVCNFSM6AAAAABE6JYQ5KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNRRGYZTANJVGM . You are receiving this because you commented.Message ID: <sign-language-processing/spoken-to-signed-translation/issues/28/2161630553 @github.com>
I see that you are trying to use this repository with VGT.
The
download_lexiconscript does not support any VGT dataset, so to support Flemish, you would have to go through the following process:
- Collect a lexicon (Download videos from https://vlaamsegebarentaal.be/signbank/signs/show_all/ or collect your own)
- Extract poses using this library and the command
video_to_pose --format mediapipe -i example.mp4 -o example.pose- Construct a lexicon CSV file with the words, matching the poses, for example https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/assets/dummy_lexicon/index.csv
path,spoken_language,signed_language,start,end,words,glosses,priority sgg/kleine.pose,de,sgg,0,0,kleine,Kleine,0 sgg/kinder.pose,de,sgg,0,0,kinder,Kinder,0Now, once you have this
index.csv, under a directory called, let's say,lexicon, you can run for example:text_to_gloss_to_pose
--text "Hallo mijn naam is john."
--glosser "simple"
--lexicon "lexicon"
--spoken-language "nl"
--signed-language "vgt"
--pose "quick_test.pose"
hello @AmitMY , i have already create step 1 until 3 you told using Bahasa Malaysia Sign Language and it works if dont change the spoken lang = "sgg" and sign lang = "de" column in my new index.csv file. When I want put signed lang = "xml" or "msl" and spoken lang = "ms" in index.csv file, its receive error, mind to explain why its not working and what steps need to be taken?
##error output usage: text_to_gloss_to_pose [-h] --text TEXT --glosser {simple,spacylemma,rules,nmt} --spoken-language {de,fr,it,en} --signed-language {sgg,gsg,bfi,ase} --lexicon LEXICON --pose POSE text_to_gloss_to_pose: error: argument --spoken-language: invalid choice: 'ms' (choose from 'de', 'fr', 'it', 'en')
This is because these values are hardcoded here: https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/spoken_to_signed/bin.py#L71-L72 We might want to load them dynamically.
thank you very much for important information.
On Thu, Mar 20, 2025 at 11:03 AM Amit Moryossef @.***> wrote:
This is because these values are hardcoded here:
https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/spoken_to_signed/bin.py#L71-L72 We might want to load them dynamically.
— Reply to this email directly, view it on GitHub https://github.com/sign-language-processing/spoken-to-signed-translation/issues/28#issuecomment-2739812786, or unsubscribe https://github.com/notifications/unsubscribe-auth/A5KO7YG3YJIDC25NPPW3AR32VKG6DAVCNFSM6AAAAABZK7NX4SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMZZHAYTENZYGY . You are receiving this because you commented.Message ID: <sign-language-processing/spoken-to-signed-translation/issues/28/2739812786 @github.com> [image: AmitMY]AmitMY left a comment (sign-language-processing/spoken-to-signed-translation#28) https://github.com/sign-language-processing/spoken-to-signed-translation/issues/28#issuecomment-2739812786
This is because these values are hardcoded here:
https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/spoken_to_signed/bin.py#L71-L72 We might want to load them dynamically.
— Reply to this email directly, view it on GitHub https://github.com/sign-language-processing/spoken-to-signed-translation/issues/28#issuecomment-2739812786, or unsubscribe https://github.com/notifications/unsubscribe-auth/A5KO7YG3YJIDC25NPPW3AR32VKG6DAVCNFSM6AAAAABZK7NX4SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMZZHAYTENZYGY . You are receiving this because you commented.Message ID: <sign-language-processing/spoken-to-signed-translation/issues/28/2739812786 @github.com>