Clarification on Annotation Order for CH-SIMS Dataset

Open ilucasgoncalves opened this issue 1 year ago • 1 comments

Hello!

Thank you for sharing the CH-SIMS dataset—it’s a valuable resource for multimodal sentiment analysis, and I'm excited to work with it!

I’m looking for clarification regarding the order of the modality-specific annotations in the provided CSV file. Here’s an example row from the file for reference:

video_0001  1  [transcription text]  -1  -1  -1   -1  Negative  train
video_0001  2  [transcription text]   1   1  0.8   1  Positive  train

In this row, there are four numerical sentiment labels following the [transcription text]. Based on the dataset description in your paper, I understand these labels likely correspond to the Text-only, Audio-only, Visual-only, and Multimodal annotations. Could you please confirm the exact order of these columns?

Specifically, is the order of the labels? See below:

Text-only
Audio-only
Visual-only
Multimodal

Or is it arranged differently?

Nov 13 '24 19:11 ilucasgoncalves

Hello, Could you send me a label.csv in CH-SIMS dataset ? I need it very much. Because the format of the cvs file I download from the google drive is in a mess. My email is [email protected]. Do you know the difference between the CH-SIMS and CH-SIMS v2 in the google drives provided by the author? Thanks for your time in advance!

Jan 15 '25 09:01 EssenceCC