VPS icon indicating copy to clipboard operation
VPS copied to clipboard

Official website for "Video Polyp Segmentation: A Deep Learning Perspective (MIR 2022)"

Video Polyp Segmentation: A Deep Learning Perspective

license: mit LAST COMMIT ISSUES STARS ARXIV PAPER Gitter PWC PWC


Contents

  • 1. Features
  • 2. News
  • 3. VPS Dataset
  • 4. VPS Baseline
  • 5. VPS Benchmark
  • 6. Tracking Trends
  • 7. Citations
  • 8. FAQ
  • 9. License
  • 10. Acknowledgements

1. Features

In the deep learning era, we present the first comprehensive video polyp segmentation (VPS) study. Over the years, developments on VPS are not moving forward with ease since large-scale fine-grained segmentation masks are still not made publicly available. To tackle this issue, we first introduce a long-awaited high-quality per-frame annotated VPS dataset. There are four features of our work:

  • VPS Dataset: We recognize the importance of annotated medical data for substantial progress in research on medical AI systems’ development. And thus, our SUN-SEG dataset is open access, a non-profit database of the high-quality, large-scale, densely-annotated dataset for facilitating the colonoscopy diagnosis, localization, and derivative tasks. Our vision aims to provide data and knowledge to aid and educate clinicians, and also for the development of automated medical decision support systems.
  • VPS Baseline: We propose a simple but efficient baseline, which outperforms the 13 cutting-edge polyp segmentation approaches and run in super real-time (170fps). We hope such a baseline could attract more researchers to join our community and inspire them to develop more interesting solutions.
  • VPS Benchmark: For a fair comparison, we build an online leaderboard to keep up with the new progress of VPS community. Besides, we provide an out-of-the-box evaluation toolbox for the VPS task.
  • Tracking Trends: We elaborately collect a paper reading list to continuously track the latest updates in this rapidly advancing field.

2. News

  • [August/24/2022] :boom: Our paper has been accepted by Machine Intelligence Research (MIR) journal.
  • [July/03/2022] :boom: We update a new version of SUN-SEG with more fine-grained data splits, including seen/unseen senarios. More details refer to our technical report. Also, the new PaperWithCode page refer to SUN-SEG-Easy & SUN-SEG-Hard.
  • [May/11/2022] Release rejected labels: SUN-SEG-Rejected-Labels. More details see here.
  • [March/27/2022] Release pretrained checkpoints and whole benchamrks results.
  • [March/18/2022] Upload the whole training/testing code for our enhanced model PNS+.
  • [March/15/2022] Release the evaluation toolbox for the VPS task. Add a Awesome_Video_Polyp_Segmentation.md for tracking latest trends of this community.
  • [March/14/2022] Create the project page.

3. VPS Dataset


Figure 1: Annotation of SUN-SEG dataset. The object-level segmentation masks in SUN-SEG dataset of different pathological categories, which is densely annotated with experienced annotators and verified by colonoscopy-related researchers to ensure the quality of the proposed dataset.

Notably, based on some necessary privacy-preserving considerations from the SUN dataset, we could not directly share the download link of the video dataset with you without authorization. And please inform us of your institution and the purpose of using SUN-SEG in the email. Thank you for your understanding!

  • How to get access to our SUN-SEG dataset? Please refer to DATA_PREPARATION.
  • If you wanna know more descriptions about our SUN-SEG dataset. Please refer to our DATA_DESCRIPTION.md.

4. VPS Baseline

This work is the extension version of our conference paper (Progressively Normalized Self-Attention Network for Video Polyp Segmentation) accepted at MICCAI-2021. More details could refer to arXiv and Github Link


Figure 2: The pipeline of the proposed (a) PNS+ network, which is based on (b) the normalized self-attention (NS) block.

There are three simple-to-use steps to access our project code (PNS+):

  • Prerequisites of environment:

      conda create -n PNS+ python=3.6
      conda activate PNS+
      conda install pytorch=1.1.0 torchvision -c pytorch
      pip install tensorboardX tqdm Pillow==6.2.2
      pip install git+https://github.com/pytorch/tnt.git@master
    
  • Compiling the project:

    cd ./lib/PNS
    python setup.py build develop
    
  • Training:

    python ./scripts/my_train.py
    
  • Testing:

    Downloading pre-trained weights and move it into snapshot/PNSPlus/epoch_15/PNSPlus.pth, which can be found in this download link: OneDrive / Baidu Drive (Password: g7sa, Size: 108MB).

    python ./scripts/my_test.py
    

5. VPS Benchmark

We provide an out-of-the-box evaluation toolbox for the VPS task, which is written in Python style. You can just run it to generate the evaluation results on your custom approach. Or you can directly download the complete VPS benchmark including prediction map of each competitor at download link: OneDrive / Baidu Drive (Password: 2t1l, Size: 5.45G).

  • More instructions about Evaluation Toolbox refer to PageLink.

We also build an online leaderboard to keep up with the new progress of other competitors. We believe this is a fun way to learn about new research directions and stay in tune with our VPS community.

Here, we present a variety of qualitative and quantitative results of VPS benchamrk:

  • Visual prediction of top-performance competitors:


Figure 3: Qualitative comparison of three video-based models (PNS+, PNSNet, and 2/3D) and two image-based models (ACSNet, and PraNet).

  • Model-based performance:


Figure 4: Quantitative comparison on two testing sub-datasets, i.e., SUN-SEG-Easy (Unseen) and SUN-SEG-Hard (Unseen). `R/T' represents we re-train the non-public model, whose code is provided by the original authors. The best scores are highlighted in bold.

  • Attribute-based performance:


Figure 5: Visual attributes-based performance on our SUN-SEG-Easy (Unseen) and SUN-SEG-Hard (Unseen) in terms of structure measure.

6. Tracking Trends


To better understand the development of this field and to quickly push researchers in their research process, we elaborately build a Paper Reading List. It includes 119 colonoscopy imaging-based AI scientific research in recent 12 years. It includes several fields, such as image polyp segmentation, video polyp segmentation, image polyp detection, video polyp detection, and image polyp classification. Besides, we will provide some interesting resources about human colonoscopy.

Note: If we miss some treasure works, please let me know via e-mail or directly push a PR. We will work on it as soon as possible. Many thanks for your active feedbacks.

7. Citations

If you have found our work useful, please use the following reference to cite this project:

@article{ji2022vps,
    title={Video Polyp Segmentation: A Deep Learning Perspective},
    author={Ji, Ge-Peng and Xiao, Guobao and Chou, Yu-Cheng and Fan, Deng-Ping and Zhao, Kai and Chen, Geng and Van Gool, Luc},
    journal={Machine Intelligence Research},
    year={2022}
}

@inproceedings{ji2021pnsnet,
    title={Progressively Normalized Self-Attention Network for Video Polyp Segmentation},
    author={Ji, Ge-Peng and Chou, Yu-Cheng and Fan, Deng-Ping and Chen, Geng and Jha, Debesh and Fu, Huazhu and Shao, Ling},
    booktitle={MICCAI},
    pages={142--152},
    year={2021}
}

8. FAQ

  • Thanks to Tuo Wang for providing a great solution to upgrade the CUDA version when compling the NS block.

9. License

The dataset and source code is free for research and education use only. Any commercial usage should get formal permission first.

  • Video Source: SUN (Showa University and Nagoya University) Colonoscopy Video Database is the colonoscopy-video database for the evaluation of automated colorectal-polyp detection. The database comprises still images of videos, which are collected at the Showa University Northern Yokohama Hospital. Mori Laboratory, Graduate School of Informatics, Nagoya University developed this database. Every frame in the database was annotated by the expert endoscopists at Showa University.

  • Intended Use: This database is available for only non-commercial use in research or educational purpose. As long as you use the database for these purposes, you can edit or process images and annotations in this database. Without permission from Mori Lab., commercial use of this dataset is prohibited even after copying, editing, processing or any operations of this database. Please contact us for commercial use or if you are uncertain about the decision.

  • Distribution: It is prohibited to sell, transfer, lend, lease, resell, distribute, etc., as it is, or copy, edit, or process this database, in whole or in part.

10. Acknowledgements

  • Our dataset is built upon SUN (Showa University and Nagoya University) Colonoscopy Video Database, thanks very much for their wonderful work!
  • This codebase is based on our conference version PNSNet, which is accepted by MICCAI-2021 conference.