Audio-WestlakeU

Results 9 repositories owned by Audio-WestlakeU

FullSubNet

511
Stars
148
Forks
Watchers

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

NBSS

163
Stars
19
Forks
Watchers

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

audiossl

75
Stars
8
Forks
Watchers

A library built for easier audio self-supervised training, downstream tasks evaluation

McNet

91
Stars
11
Forks
Watchers

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

RVAE-EM

31
Stars
3
Forks
Watchers

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

FS-EEND

76
Stars
4
Forks
Watchers

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

ATST-SED

84
Stars
13
Forks
Watchers

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

FN-SSL

65
Stars
6
Forks
Watchers

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization