seamless_communication issues

Hardware Requirements for Deploying seamless_communication on a Linux Server

5

Hello, I'm interested in deploying the seamless_communication project on my own Linux server. Before proceeding, I'd like to ensure that my server meets the necessary hardware requirements. Could you please...

alongLFB

multi-gpu

how to use multi-gpu in app.py,thx

world2025

Do ASR must specify the parameter “tgt_lang” ? (ASR 必须要指定tgt_lang这个参数吗)

1

import torchaudio from transformers import AutoProcessor, SeamlessM4Tv2Model processor = AutoProcessor.from_pretrained("facebook/seamless-m4t-v2-large") model = SeamlessM4Tv2Model.from_pretrained("facebook/seamless-m4t-v2-large") fileName="asr.wav" audio, orig_freq = torchaudio.load(fileName) audio = torchaudio.functional.resample(audio, orig_freq=orig_freq, new_freq=16000) audio_inputs = processor(audios=audio, return_tensors="pt") output_tokens = model.generate(**audio_inputs,...

lilongwei5054

Update README.md

2

add missing import of `Collater`

nguyenvulong

CLA Signed

Microphone is not working well for Large V2 model demo!

2

Dear authors and contributors, I wanted to test demo for the new version of SeamlessM4T V2 Large however found out that the microphone cannot be used properly! It'd record voices...

arash-aut

Poor output for Japanese S2TT tasks, and `mps` device

I have tried the following S2TT tasks: | # | Device | Input language | Output language | Result |--|-|-|-----|--- | 1 | `cpu` | Russian | English | ✅...

bcherny

https://github.com/arXiv/arxiv-docs/blob/arxiv-dois/help%2Farxiv_doi.mdhelp/arxiv_doi.mdhttps://doi.org/10.48550/arXiv.2201.NNNNN

# - [ ] - - [**_[]()~~~~_**

Backuser617

Release

1

https://github.com/arXiv/arxiv-docs/releases/tag/2.0

Backuser617

Captcha

https://github.com/git-ecosystem/git-credential-manager/issues/1496

Backuser617

https://developers.facebook.com/products/ai/

1

import json from models import User import requests from super_secret_environment_file import api_key, api_secret from settings import env if env == 'dev': host = "https://staging.bbot.menu" else: host = "https://bbot.menu" new_guest =...

Backuser617

seamless_communication
seamless_communication copied to clipboard

Metadata

Hardware Requirements for Deploying seamless_communication on a Linux Server

multi-gpu

Do ASR must specify the parameter “tgt_lang” ? (ASR 必须要指定tgt_lang这个参数吗)

Update README.md

Microphone is not working well for Large V2 model demo!

Poor output for Japanese S2TT tasks, and `mps` device

https://github.com/arXiv/arxiv-docs/blob/arxiv-dois/help%2Farxiv_doi.mdhelp/arxiv_doi.mdhttps://doi.org/10.48550/arXiv.2201.NNNNN

Release

Captcha

https://developers.facebook.com/products/ai/

← Metadata

Owner

Metadata

seamless_communication seamless_communication copied to clipboard

Metadata

← Metadata

Owner

Metadata

seamless_communication
seamless_communication copied to clipboard