mmpose icon indicating copy to clipboard operation
mmpose copied to clipboard

Roadmap of MMPose

Open hellock opened this issue 4 years ago • 76 comments

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.

You can either:

  1. Suggest a new feature by leaving a comment.
  2. Vote for a feature request with 👍 or be against with 👎. (Remember that developers are busy and cannot respond to all feature requests, so vote for your most favorable one!)
  3. Tell us that you would like to help implement one of the features in the list or review the PRs. (This is the greatest things to hear about!)

hellock avatar Jul 13 '20 13:07 hellock

i hope that MMPose can support 21 hand landmark detetion, thanks

aa12356jm avatar Jul 15 '20 12:07 aa12356jm

i hope that MMPose can support 21 hand landmark detetion, thanks

Good suggestions! We will add this feature in our TODO list. Thank you.

jin-s13 avatar Jul 22 '20 09:07 jin-s13

TODO List (continuously updated... [last edit: 2023.1.14]) : Here is a collection of feature requests. Items that have already been implemented in MMPose will be removed from the list.

  1. More popular backbones
  1. Add more popular datasets:
  1. More 2d human pose estimation method.
  1. More 2d face alignment algorithms.

  2. More 3d human pose algorithms.

  1. Support 2d video pose estimation and tracking
  1. Support Vehicle pose estimation
  1. Add 3D Pose Consistency Benchmark #828

  2. Mano based hand keypoints detection

  3. Depth-based 3d hand pose estimation

  1. Multi-view 3d pose estimation
  1. Support memonger

  2. Support Pytorch AMP training #339

  3. Hyperparameter tuner Optuna

  4. Support Unity plugin

  5. print loss during evaluation. #333

  6. Quantization Aware Training #359

  7. Easier Usage (API)

  8. Export to Torchscript #576

jin-s13 avatar Jul 23 '20 06:07 jin-s13

Would you mind add https://github.com/open-mmlab/mmpose/issues/31#issuecomment-663334223 to the TODO list.

gachiemchiep avatar Jul 26 '20 05:07 gachiemchiep

Would you mind add #31 (comment) to the TODO list.

Sure.

jin-s13 avatar Jul 26 '20 12:07 jin-s13

Speed up inference https://github.com/open-mmlab/mmpose/issues/40

innerlee avatar Jul 30 '20 02:07 innerlee

Support video pose estimation #67

jin-s13 avatar Aug 16 '20 06:08 jin-s13

Would be great to add support for whole body pose estimation dataset (body+face+hands) via COCO-WholeBody

pablovela5620 avatar Aug 16 '20 21:08 pablovela5620

Also add support for MPII in mmdetection.

lisa676 avatar Aug 26 '20 08:08 lisa676

It would be great to add support for pose tracking dataset i.e. posetrack2017/2018.

OasisYang avatar Aug 26 '20 21:08 OasisYang

Support to convert pytorch model to onnx by the way.Thx!

flynnamy avatar Aug 28 '20 07:08 flynnamy

@OasisYang could you elaborate? is loading the data and processing on a frame basis is enough, or you want the tracking part also?

@flynnamy sounds like a request for a general tool. maybe we can provide such tools for the whole mm-series (just saying, not a confirmation).

innerlee avatar Aug 31 '20 13:08 innerlee

@innerlee If possible, adding both data loading and tracking part would be great. However, the tracking part seems a little bit complicate and always comes with some extra modules. Maybe, the first step is basically to support the data loading and processing. Thanks

@OasisYang could you elaborate? is loading the data and processing on a frame basis is enough, or you want the tracking part also?

@flynnamy sounds like a request for a general tool. maybe we can provide such tools for the whole mm-series (just saying, not a confirmation).

OasisYang avatar Sep 02 '20 06:09 OasisYang

Support ShuffleNet V2 & MobileNet V3 backbones. #94

jin-s13 avatar Sep 02 '20 10:09 jin-s13

Support for InterHand2.6

pablovela5620 avatar Sep 02 '20 13:09 pablovela5620

Add Yolov4 and OpenPose

Witek- avatar Sep 23 '20 20:09 Witek-

Please make it possible to obtain estimated heatmaps from methods

fhir-obesity avatar Oct 03 '20 21:10 fhir-obesity

@hamedcan could you explain more about the usage? do you want a visualizer of heatmaps during training, or a visualization tool for demo, or anything else?

innerlee avatar Oct 05 '20 02:10 innerlee

Bottom up for MPII dataset?

san-guy avatar Oct 05 '20 13:10 san-guy

@innerlee First, I really want to thank you for the MMPose. It really helped me. I want to compare different models' performance on hard poses. So I need to be able to observe generated heatmaps. I want a visualization tool for demo.

fhir-obesity avatar Oct 05 '20 14:10 fhir-obesity

support memonger : https://github.com/Lyken17/pytorch-memonger

HuangJunJie2017 avatar Oct 23 '20 08:10 HuangJunJie2017

Support multi-head networks #219

jin-s13 avatar Oct 30 '20 06:10 jin-s13

Please support mpii_trb demo and mpi_inf_3dhp datasets!

flynnamy avatar Nov 03 '20 03:11 flynnamy

Support 3d hand keypoint estimation!!!!!!

yanglilian avatar Nov 27 '20 08:11 yanglilian

Support log info when dataset is tinty, #333

yulong314 avatar Dec 04 '20 08:12 yulong314

Support PyTorch AMP training, thanks. #339

YibinXie avatar Dec 09 '20 04:12 YibinXie

Support GCN-based methods for refining top-down results. https://arxiv.org/pdf/2003.10506v3.pdf https://github.com/lingtengqiu/OPEC-Net https://arxiv.org/abs/2007.10599

YibinXie avatar Dec 09 '20 06:12 YibinXie

Would be great to see the integration of a hyperparameter tuner like Optuna

pablovela5620 avatar Dec 10 '20 17:12 pablovela5620

A Unity plugin would be amazing to have, using json input data and/or real-time pose estimation with a webcam and seeing it reflected on a 3D model.

MaxGodTier avatar Dec 14 '20 09:12 MaxGodTier

@MaxGodTier do you have experience in developing unity plugin? contributions are welcome :D

innerlee avatar Dec 14 '20 09:12 innerlee

I don't, but a dirty implementation may be possible using an existing repo , it reads pose data from simple text files each representing a single frame , I see two solutions: (1) If pose_results from mmpose were translated into the same format expected from that repo, it will work out of the box without needing to change a single line of code or (2) edit the repo code (C#) to use mmpose rules instead of theirs.

MaxGodTier avatar Dec 15 '20 04:12 MaxGodTier

Quantization Aware Training for models to get the int8 models ,int8 models will greatly improve inference speed #359,thanks

aa12356jm avatar Dec 18 '20 02:12 aa12356jm

Support DetTrack and KeyTrack. http://arxiv.org/abs/2003.13743 & https://arxiv.org/abs/1912.02323

zgs731 avatar Dec 28 '20 08:12 zgs731

Albumentations augmetnations similar to mmclassification

pablovela5620 avatar Dec 30 '20 19:12 pablovela5620

i hope that MMPose can support 3D hand landmarks detetion, thanks

aa12356jm avatar Jan 20 '21 07:01 aa12356jm

Does MMPose support Single Person Pose Estimation? Currently I found only multi-person versions are supported.

rhiver avatar Jan 22 '21 08:01 rhiver

@rhiver single is a case of multi

innerlee avatar Jan 22 '21 09:01 innerlee

@rhiver single is a case of multi

Sort of. But multi-person version has two stages, person detection and pose estimation, which have to infer on two models. So this method doesn't work for realtime pose estimation in mobile devices since it takes too long on the inference. MobileNetV2 is good enough for simple pose estimation. But for best FPS, it's better to let it do both single person detection and pose estimation.

rhiver avatar Jan 22 '21 09:01 rhiver

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

flynnamy avatar Jan 26 '21 04:01 flynnamy

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

Do you have any recommended papers/codes ?

jin-s13 avatar Jan 26 '21 04:01 jin-s13

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

Do you have any recommended papers/codes ?

Yes,wingloss:https://arxiv.org/pdf/1711.06753.pdf, and GCN+softwing loss: https://arxiv.org/pdf/2006.11697.pdf

flynnamy avatar Jan 26 '21 06:01 flynnamy

Support FashionAI https://tianchi.aliyun.com/competition/entrance/231648/introduction

innerlee avatar Jan 28 '21 03:01 innerlee

I want to use mmpose with the pypi package much more easily than now; such as:

from mmpose import top_down

top_down("darkpose", "COCO_wholebody", video_path="hoge.mp4", output_json_title="hoge") # Analyze hoge.mp4 with COCO wholebody on darkpose and output the result as hoge/hoge000000000000.json, hoge/hoge000000000001.json, hoge/hoge000000000002.json, ....

CHRIBUR0309 avatar Feb 06 '21 07:02 CHRIBUR0309

MPII multi-person dataset for bottom-up methods is really needed!

yeshaokai avatar Feb 10 '21 13:02 yeshaokai

hi ,can you add handlandmark filtering algorithm for eliminating handlandmark jittering in videos? thanks

aa12356jm avatar Mar 22 '21 06:03 aa12356jm

Adding Vehicle pose estimation to the pipe line using CarFusion dataset. Similar to Occlusion-net, and Apollocar3D.

dineshreddy91 avatar Apr 15 '21 16:04 dineshreddy91

Export to Torchscript !

simba0703 avatar Apr 16 '21 01:04 simba0703

Lite-HRNet, its already built with mmpose, so including into the main repo should be super simple. Would be amazing if it could work with the pytorch2onnx tool for deployment

pablovela5620 avatar Apr 17 '21 12:04 pablovela5620

Please support Halpe data set: https://github.com/Fang-Haoshu/Halpe-FullBody

It has 3 useful points in addition to the COCO-WholeBody.

ioctl-user avatar Apr 21 '21 18:04 ioctl-user

Hi everyone,

I intend to create my own keypoints dataset with 3 points of interest (two endpoints and one center point). Can anyone kindly help me on how I can create annotations to be loaded into mmpose? Because I believe that the repo is based on mmcv, how can I get my own dataloader? Any help in this regard will be highly appreciated. Thank you

rubeea avatar Apr 24 '21 16:04 rubeea

Support 3dpw dataset #682

jin-s13 avatar May 29 '21 05:05 jin-s13

Do you have any plans for the mano based hand keypoints detection? Also optimization with the IK loss

aqsc avatar Jul 24 '21 00:07 aqsc

Add 3D Pose Consistency Benchmark - https://github.com/open-mmlab/mmpose/issues/828

AmitMY avatar Jul 31 '21 07:07 AmitMY

Add https://github.com/mks0601/3DMPPE_POSENET_RELEASE into MMPose

znoland3 avatar Oct 13 '21 00:10 znoland3

It would be nice to add "PoseFormer". It based on VideoPose3D, which already supported.

Broken1334 avatar Nov 06 '21 11:11 Broken1334

It would be nice to add "CenterNet". it is a bottom up based 2d human pose estimation method and it groups keypoints of one person by combine regression and heatmap of keypoints which is quite different from associated embedding and affinity fields

ee-lynn avatar Nov 24 '21 13:11 ee-lynn

Blog:Next-Generation Pose Detection with MoveNet and TensorFlow.js 这里有movenet的简单介绍,https://storage.googleapis.com/movenet/MoveNet.SinglePose%20Model%20Card.pdf

APeiZou avatar Dec 18 '21 06:12 APeiZou

Add RLE into MMPose

canghaiyunfan avatar Dec 24 '21 09:12 canghaiyunfan

Background : 3d pose estimation (with video generation) with a high number of people (es: official video, minute 00:19 sec, but with a lot of people

Result video: the original video is put on the top-left, with the subsuquent 3d pose of the people on the right. If there are a lot of people, the final video has strange resolution (i.e 6000x400) because every people detected is on put on the same row.

What could be improve: split the people 3d pose visualization into multiple row

francescotaioli avatar Jan 20 '22 09:01 francescotaioli

It would be great to have a 'score_per_joint' option in test_cfg in order to output one score per joint, instead of having only a global score for the pose, my use case is related to associative embedding

piercus avatar Feb 04 '22 23:02 piercus

update Interhand2.6M dataset which contains MANO hand mesh parameters.......

liwenssss avatar Feb 17 '22 02:02 liwenssss

It would be nice to have Depth-Based 3D Hand Pose Estimation methods like A2J.

walsvid avatar Apr 11 '22 11:04 walsvid

It would be great to have SmoothNet trained on 3DPW and AIST++ :)

vra avatar Apr 21 '22 09:04 vra

It would be nice to add SmoothNet training code about pose estimation, hoping it could easily retrain on my own dataset.

qinb avatar Apr 24 '22 05:04 qinb

3D Human Mesh frankmocap

Joyako avatar May 16 '22 07:05 Joyako

3D Human Mesh frankmocap

Thanks for your feedback. 3D human mesh recovery is no longer supported in MMPose. We have MMHuman3D for this task and you are welcome to submit an issue there about your request.

ly015 avatar May 16 '22 12:05 ly015

It would be so helpful for better analysis if AP for each type of body joints are printed, for example 17 AP value for 17 kinds of body joints are given when inferencing a model in MS COCO body-keypoint dataset.

yshMars avatar May 21 '22 14:05 yshMars

Will be really helpful to implement MIPNet into mmpose:

  • It is particularly useful to tackle data where there are crowded/highly occluded humans. Was previously the SOTA on OCHuman before ViTPose came along. Within the realms of convnets, it should still be the SOTA, and it seems like the idea is general enough to be applied to different types of backbones.

levan92 avatar May 25 '22 09:05 levan92

Also similar to #1389 request, will be nice to integrate ViTPose into mmpose. ViTPose is already implemented in mmpose, so I expect integration to be much easier 😄

levan92 avatar May 25 '22 09:05 levan92

It would be so helpful for better analysis if AP for each type of body joints are printed, for example 17 AP value for 17 kinds of body joints are given when inferencing a model in MS COCO body-keypoint dataset.

@yshMars Already supported in https://github.com/open-mmlab/mmpose/pull/1170

jin-s13 avatar May 25 '22 15:05 jin-s13

It would be nice to support ConvNeXt backbones. It is a very simple model that is purely convolutional. They can serve as a drop-in replacement for ResNet or Swin Transformer architectures. ImageNet-22k pretrained ConvNeXt variants are considered state-of-the-art in this regime.

Official code: https://github.com/facebookresearch/ConvNeXt ConvNeXt was also implemented in the mmsegmentation and mmdetection libraries.

Thanks!

liuzhuang13 avatar Jun 07 '22 08:06 liuzhuang13