seamless_communication issues

fix cmn ASR eval in SeamlessExpressive

SeamlessExpressive evaluation of eng-cmn normalizes both transcribed and ground truth texts into simplified Chinese texts. It is observed that there is a mix of traditional and simplified Chinese characters in...

hygong-fb

CLA Signed

Analysis of Audio Frame Alignment Discrepancies in Metadata Retrieval Process

Hello, I have noticed the following concerning the audio frames provided for retrieval; they seem to be slightly erroneous. For instance, the following metadata (enA-mtA and enA-mlt): - [enA-mtA metadata...

nassergharbi

seamlessM4T_v2_large finetuning on speech translation task

I'm trying to fintune the seamlessM4T_v2_large model on speech translation task. Would there be a reason for the model to return nan values? ` tokens, units = model(batch) ` **Output**...

laleye

Missing key(s) and size mismatch error for SeamlessM4T_medium

9

Hi, I am trying to use SeamlessM4T_medium ckpt for evaluation, but I am getting following error while loading the ckpt. I just added `--model_name seamlessM4T_medium` to the command, is there...

YJYJLee

finetune.run failed on assert batch.text_to_units.prev_output_tokens is not None

4

I tried to finetune on a new language using m4t_cli scripts without success. I have the following error which I cannot understand. However, it is indicated in the dataloader that`...

laleye

Is it possible to run SeamlessStreaming on an Apple M1 Pro?

I want to build a simple desktop app that translates a user's language in real-time. Using Blackhole, I'd like to stream the audio from say a Zoom call into the...

snmishra311

make T2TT in blingual model auto-concatenated

# What ? For bilingual model, running T2TT will give back a Result, in which only the `transcription` and `word_confidence_score` is interested. In such case, it is more convenient to...

antoine-tran

CLA Signed

Support Pip Wheel / PyPi Package

Hi all, Great work on Seamless! I am using parts of `seamless_communication` (in particular, some of the alignment models) in an industry engineering project, and rather than cloning the repository...

moinnadeem

Trouble Extracting Monolingual Datasets from SeamlessAlign

### Problem Description The dataset provided at [this link](https://github.com/facebookresearch/seamless_communication/blob/main/docs/m4t/seamless_align_README.md) presents challenges in extracting Maltese datasets. Specifically, the metadata for Textual Audio alignment includes a subset seemingly sourced from common-crawl, with...

nassergharbi

Question about Maltese Dataset Consistency - Extension of the previous S2S release (November 30, 2023)

Hello everyone, ## Issue Description ### Observation The Maltese dataset dated November 30, 2023, is strictly identical to the previous version, without any observable extension. The datasets metadata is provided...

nassergharbi

seamless_communication
seamless_communication copied to clipboard

Metadata

fix cmn ASR eval in SeamlessExpressive

Analysis of Audio Frame Alignment Discrepancies in Metadata Retrieval Process

seamlessM4T_v2_large finetuning on speech translation task

Missing key(s) and size mismatch error for SeamlessM4T_medium

finetune.run failed on assert batch.text_to_units.prev_output_tokens is not None

Is it possible to run SeamlessStreaming on an Apple M1 Pro?

make T2TT in blingual model auto-concatenated

Support Pip Wheel / PyPi Package

Trouble Extracting Monolingual Datasets from SeamlessAlign

Question about Maltese Dataset Consistency - Extension of the previous S2S release (November 30, 2023)

← Metadata

Owner

Metadata

seamless_communication seamless_communication copied to clipboard

Metadata

← Metadata

Owner

Metadata

seamless_communication
seamless_communication copied to clipboard