seamless_communication
seamless_communication copied to clipboard
how to identify input audio language?
For Input audio --> output audio language before translating or converting anything
The Seamless project did not release a speech language identification model. However, you can use a speech LID model from a related project called MMS: https://github.com/facebookresearch/fairseq/blob/main/examples/mms/README.md#lid.
In https://github.com/facebookresearch/seamless_communication/issues/325 I give some more details.