mmaction2 icon indicating copy to clipboard operation
mmaction2 copied to clipboard

动作分类不准确,也比较奇怪

Open Sleepingbug opened this issue 1 year ago • 1 comments

Branch

main branch (1.x version, such as v1.0.0, or dev-1.x branch)

Prerequisite

Environment

Ubuntu 20.04, CUDA 12.2, cudnn 8.6.3。 2024年12月9日最新版本代码。

Describe the bug

一个简单的跳起、落下 动作,会被识别成4种动作分类:dancing gangnam style、jumpstyle dancing、tossing coin、high kick。

Reproduces the problem - code sample

1,执行命令: python demo/webcam_demo.py configs/recognition/timesformer/timesformer_spaceOnly_8xb8-8x32x1-15e_kinetics400-rgb.py checkpoints/timesformer_spaceOnly_8xb8-8x32x1-15e_kinetics400-rgb_20220815-78f05367.pth tools/data/kinetics/label_map_k400.txt --average-size 5 --threshold 0.1

2,实际动作: 站在摄像头前,站立、曲膝、跳起、落下、站立,重复几次。 3,被识别成: dancing gangnam style: 15.33 jumpstyle dancing : 12.91 落地前被识别为 tossing coin: 16.63 有2次 有一只脚膝盖稍微外歪一点就被认为是 high kick : 11.86.

4,期待: 动作分类更规范,不与流行舞蹈挂钩,比如 跳起来 统一识别为 jump,而不是dance。

Reproduces the problem - command or script

python demo/webcam_demo.py configs/recognition/timesformer/timesformer_spaceOnly_8xb8-8x32x1-15e_kinetics400-rgb.py checkpoints/timesformer_spaceOnly_8xb8-8x32x1-15e_kinetics400-rgb_20220815-78f05367.pth tools/data/kinetics/label_map_k400.txt --average-size 5 --threshold 0.1

站在摄像头前,站立、曲膝、跳起、落下、站立,重复几次。

Reproduces the problem - error message

(没有错误信息,而是给的结果不正确)

Additional information

无。

Sleepingbug avatar Dec 10 '24 01:12 Sleepingbug

我参考下面链接demo这里 来识别录像中原地跳的动作,发现一次都没成功。骨骼检测准确,但跳跃动作识别糟糕。看来依赖kinetics数据集来检测原地跳跃并不靠谱。 https://github.com/open-mmlab/mmaction2/blob/main/demo/README.md#video-structuralize-demo

Sleepingbug avatar Dec 13 '24 08:12 Sleepingbug