Codec-SUPERB icon indicating copy to clipboard operation
Codec-SUPERB copied to clipboard

Audio Codec Speech processing Universal PERformance Benchmark

Results 9 Codec-SUPERB issues
Sort by recently updated
recently updated
newest added

Hello, in the released development set, different test sets have varying sampling rates such as 8kHz, 16kHz, 44.1kHz, and 48kHz, as well as different audio formats like WAV and FLAC....

Hi, I found that codec_superb_data contains many datasets and does not give the code for data preprocessing, does it mean that I need to resynthesize each dataset separately by myself...

The checkpoint 16k_320d_large_uni has mAP of 28.65 in the paper while only have 16.19 on the leaderboard

Thanks for your meaningful work. Could you share a minimal examples like superb with us to help us test our own model? includeing: dataset, how to evaluate with a specific...

Here is the result for [SpeechTokenizer](https://github.com/ZhangXInFD/SpeechTokenizer). The bit rate is 2kbps, following are the results: **Results in exps/results.txt** Codec SUPERB application evaluation Stage 1: Run speech emotion recognition. Acc: 72.15%...

for the 16kHz Codec model: the bitrate is 2kbps; for the 44.1kHz Codec model: the bitrate is 6.89kbps; for the 48kHz Codec model: the bitrate is 7.5kbps; #1、Here is the...

Scores updated: ------- Acc_ground_truth: 93.85% Acc_resync_audio: 16.10% Cos_similarity: 36.48% ACC: 16.10% --------- Log results -------------------------------------------------- File Name: crema_d.log Codec SUPERB objective metric evaluation on crema_d Stage 1: Run SDR evaluation....

# 16 kHz 2kbps ## parameter size: encoder (including quantizer) : 29MB decoder: 40MB ### exps/results.txt Codec SUPERB application evaluation Stage 1: Run speech emotion recognition. Acc: 74.93% Stage 2:...

Here is the result for [SemantiCodec](https://haoheliu.github.io/SemantiCodec/) This is a 16Khz codec with three different bit rates: 1. For token rate 100 with book size 16384 the bit rate is 1.35...