seamless_communication issues

pip dependency conflict when installing

I tried to install it in kaggle using the command !pip install -e ./seamless_communication after i cloned the official repo. But there are A LOT of dependency conflicts. ERROR: pip's...

ivanhe123

S2S aligned metadata "extension" is a subset of prior metadata release?

The metadata files in [docs/m4t/seamless_align_README.md](https://github.com/facebookresearch/seamless_communication/blob/main/docs/m4t/seamless_align_README.md) come in several dated revisions. From what I've checked of `enA-ptA` and `enA-esA` at least, it seems like the "extension" from Nov 30 is a...

arlofaria-cartesia

Obtain the speech embedding

How do we get the final speech embedding (output of the length adaptor) for SeamlessM4T?

Sameep-c

Reproduciblity/seed

Hi, I am not able to reproduce ASR results after fine-tuning the seamlessM4T_medium. After fine-tuning in the same setting I am getting different results. I tried setting seeds but having...

Satyam52

How to find out speaker id for certain languages? Is there any reference?

Just as the title, "vocoder_num_spkrs": 200, there are 200 different speakers. How to find a suitable one for certain language?

NBStarry

Deployment of Seamless M4T Model - Exporting text.decoder to ONNX or Using torch.jit.trace

2

#### Description I am currently working on deploying the Seamless M4T model for text-to-text translation on a Triton server. I have successfully exported the `text.encoder` to ONNX and traced it...

HesamAlavian