semantic_speech_emotion_recognition
semantic_speech_emotion_recognition copied to clipboard
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103.02993.pdf
Speech Emotion Recognition using Semantic Information
This repository provides training and evaluation code for the paper Speech Emotion Recognition using Semantic Information (ICASSP 2021). If you use this codebase in your experiments please cite:
Tzirakis, P., Nguyen, A., Zafeiriou, S., & Schuller, B. W. (2021). Speech Emotion Recognition using Semantic Information. arXiv preprint arXiv:2103.02993.
This repository provides the following:
- Word2Vec embeddings trained on the German SWC corpus.
- Speech2Vec embeddings trained on the SEWA DB.
- Code to train/evaluate word2vec and our unified model, i. e., paralinguistic and semantic feature extrators with a LSTM cell on top.
Requirements
Below are listed the required modules to run the code.
- aeneas
- librosa
- nltk
- numpy
- stop-words
- tensorflow
- torch
Steps
- Create the speech2vec segmentation by running
speech2word_mapping.pyin speech2vec folder. - Run
data_generator.pyto create tfrecords. - Run
train.pyto train the models, andeval.pyto evaluate.