seamless_communication issues

torch.compile + CUDA Graph optimization for bs=1

1

PR request for Pytorch blog post. Summary: This post is the fourth part of a multi-series blog focused on how to accelerate generative AI models with pure, native PyTorch. In...

YJYJLee

CLA Signed

seamless v2 finetunning assert model.t2u_model.target_vocab_info == unit_tokenizer.vocab_info

1

![image](https://github.com/facebookresearch/seamless_communication/assets/137498662/3a802125-db80-40de-85db-be977a8d0e70)

adnankarimjs

t2st

Excuse me, there is a buzzing sound in the translated voice, how to deal with it?

xialixx

Thanks for releasing w2v-bert 2.0! i have two questions 1. is released checkpoint fine-tuned on seamless? or checkpoint trained on only self-supervised objective? 2. is there any data statistics to...

seastar105

AttributeError: 'torch.dtype' object has no attribute 'vocab_info'

3

import torch from seamless_communication.inference import Translator translator = Translator( "seamlessM4T_v2_large", "vocoder_v2", torch.device("cuda:0"), torch.float16, ) translated_text, _, _ = translator.predict( "./input_s2tt.wav", "s2tt", tgt_lang="eng" ) print(translated_text) ----------------------------------------------------- python s2tt1.py Using the cached...

bk111

model inference

1

having the following error while running v2 model ` translator = Translator( File "/opt/conda/lib/python3.10/site-packages/seamless_communication/inference/translator.py", line 145, in __init__ pad_value=self.text_tokenizer.vocab_info.pad_idx or 0, pad_to_multiple=2 AttributeError: 'torch.dtype' object has no attribute 'vocab_info' `

haiderasad

after T2TT, the English period became the double question mark in Chinese!

1

Guess how much I love you. Oh, yes, yes, it was just like that. One autumn morning, little nut-brown hair, big nut-brown hair, and little field mouse sat bathed in...

bk111

text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')

Text to text is not available for some reason for task. The instructions in the read me don't work. m4t_predict "Hello how are you doing?" --task T2TT --tgt_lang arz --src_lang...

gloomiebloomie

Any plans to release finetuning script for Seamless Streaming?

Dear Developers, As a title says, I would like to know whether you can share scripts for funetuning seamless streaming. As I inspected it is far from primitive to figure...

mussakhojayeva

seamless_communication
seamless_communication copied to clipboard

Metadata

fix typo

torch.compile + CUDA Graph optimization for bs=1

seamless v2 finetunning assert model.t2u_model.target_vocab_info == unit_tokenizer.vocab_info

t2st

question about w2v-bert 2.0

AttributeError: 'torch.dtype' object has no attribute 'vocab_info'

model inference

after T2TT, the English period became the double question mark in Chinese!

text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')

Any plans to release finetuning script for Seamless Streaming?

← Metadata

Owner

Metadata

seamless_communication seamless_communication copied to clipboard

Metadata

← Metadata

Owner

Metadata

seamless_communication
seamless_communication copied to clipboard