Yuan Gong

Results 12 repositories owned by Yuan Gong

ast

1.0k
Stars
200
Forks
Watchers

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

ssast

347
Stars
56
Forks
Watchers

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

vocalsound

91
Stars
10
Forks
Watchers

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

psla

129
Stars
16
Forks
Watchers

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

gopt

121
Stars
24
Forks
Watchers

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

python-compute-eer

35
Stars
5
Forks
Watchers

Simple Python script to compute equal error rate (EER) for machine learning model evaluation.

realtime-adversarial-attack

20
Stars
3
Forks
Watchers

Code for IJCAI 2019 paper "Real-time Adversarial Attack".

ReMASC

35
Stars
2
Forks
Watchers

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

whisper-at

272
Stars
22
Forks
Watchers

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

ltu

306
Stars
22
Forks
Watchers

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".