SenseVoice icon indicating copy to clipboard operation
SenseVoice copied to clipboard

ESC-50数据集 AED 事件检测性能不能复现,Sneeze的F1仅有4.88%

Open hariiiseldon opened this issue 4 months ago • 0 comments

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

Code

What have you tried?

Applause : F1 = 0.5455 Breath : F1 = 0.7879 Cough : F1 = 0.7632 Cry : F1 = 0.7500 Laughter : F1 = 0.7097 Sneeze : F1 = 0.0488

40个sneezing样本中绝大部分识别为speech,少部分识别为cough,仅有一个正确识别为sneeze。另外各个类别均有部分样本被识别为Event_UNK,不知是否有类似于SER中ban_emo_unk的参数可以使用?

What's your environment?

  • OS (e.g., Linux):
  • FunASR Version (e.g., 1.0.0):
  • ModelScope Version (e.g., 1.11.0):
  • PyTorch Version (e.g., 2.0.0):
  • How you installed funasr (pip, source):
  • Python version:
  • GPU (e.g., V100M32)
  • CUDA/cuDNN version (e.g., cuda11.7):
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
  • Any other relevant information:

hariiiseldon avatar Sep 16 '25 12:09 hariiiseldon