lisa_emotiw
lisa_emotiw copied to clipboard
Bidirectional recurrent net for speech-to-emotion
Code and train a bidirectional recurrent net for speech-to-emotion mapping the feature sequence to a single probabilistic output through a final max-pooling (or other) aggregation.
Try pooling the top hidden layer rather than the output layer, i.e., following the sequence-wide pooling by a linear (or non-linear) classifier.