seamless_communication
seamless_communication copied to clipboard
ASR inference
The following error creeps up, while:
m4t_predict /workspace/english/data/lib_light/100_sea_fairies_0812_librivox_64kb_mp3_01_baum_sea_fairies_64kb_0.wav --task asr --tgt_lang "eng" --model_name seamlessM4T_v2_large
warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.")
2024-01-05 16:53:03,472 INFO -- seamless_communication.cli.m4t.predict.predict: text_generation_opts=SequenceGeneratorOptions(beam_size=5, soft_max_seq_len=(1, 200), hard_max_seq_len=1024, step_processor=None, unk_penalty=0.0, len_penalty=1.0)
2024-01-05 16:53:03,474 INFO -- seamless_communication.cli.m4t.predict.predict: unit_generation_opts=SequenceGeneratorOptions(beam_size=5, soft_max_seq_len=(25, 50), hard_max_seq_len=1024, step_processor=None, unk_penalty=0.0, len_penalty=1.0)
2024-01-05 16:53:03,474 INFO -- seamless_communication.cli.m4t.predict.predict: unit_generation_ngram_filtering=False
2024-01-05 16:53:03,479 WARNING -- seamless_communication.inference.translator: Transposing audio tensor from (bsz, seq_len) -> (seq_len, bsz).
Traceback (most recent call last):
File "/opt/conda/bin/m4t_predict", line 8, in