Heinrich Dinkel
Heinrich Dinkel
AudioCaption
Dataset and baseline for the first Audiocaption task
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
CDur
Repository for the paper "Towards duration robust weakly supervised sound event detection"
PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
Speaker-Anti-Spoofing-Classifiers
Baselines and Classifiers for speaker anti-spoofing detection
text_based_depression
Source code for the paper "Text-based Depression Detection: What Triggers An Alert"
CED
Source code for Consistent ensemble distillation for audio tagging