papers issues

CRIS: CLIP-Driven Referring Image Segmentation

## 論文概要テキストの内容に従いセグメンテーションを行う。CLIPを直接利用すると画像レベルと画素レベルの予測との間に矛盾があるため好ましくない結果が得られてしまう。そこでCLIPモデルの知識を伝達するCRISというフレームワークを提案。従来手法に圧倒的な性能差をつけてSOTA。 ![bib_20220618 00](https://user-images.githubusercontent.com/16313809/174427926-f2e7cc3f-1e50-4d9f-83d2-1bd7ad8ac179.jpg) https://openaccess.thecvf.com/content/CVPR2022/html/Wang_CRIS_CLIP-Driven_Referring_Image_Segmentation_CVPR_2022_paper.html ## Code 未確認。

tkuri

Conference: CVPR

Year: 2022

Application: Segmentation

Subject: CLIP

Pixel Screening Based Intermediate Correction for Blind Deblurring

## 論文概要ブラインドのデブラー。中間画像をスクリーニングすることでより良いカーネルを推定する。中間画像中の不要な構造の特定をするために線形劣化モデルが有用であることを示唆。特に大きなブラーに対して優位性を発揮。 ![bib_20220617 00](https://user-images.githubusercontent.com/16313809/174315171-4ce3b172-5465-45ea-9040-80aeecd4a5ef.jpg) https://openaccess.thecvf.com/content/CVPR2022/html/Zhang_Pixel_Screening_Based_Intermediate_Correction_for_Blind_Deblurring_CVPR_2022_paper.html ## Code 未確認。

tkuri

Conference: CVPR

Application: Deblurring

Year: 2022

Infrared Invisible Clothing: Hiding From Infrared Detectors at Multiple Angles in Real World

## 論文概要サーマル(遠赤外線)画像の歩行者検出器を欺くことができる「赤外敵対的衣服」の提案。遠赤外線パターンは印刷するのが困難なため新素材であるエアロジェルを用いて最適なコードパターンの衣服を設計。この衣服を着るとYOLOv3のAPを64.6%下げられ、ステルスすることができる。 ![bib_20220615 00](https://user-images.githubusercontent.com/16313809/173840435-2073d4ab-d703-4ccf-864b-d59122449b62.jpg) https://openaccess.thecvf.com/content/CVPR2022/html/Zhu_Infrared_Invisible_Clothing_Hiding_From_Infrared_Detectors_at_Multiple_Angles_CVPR_2022_paper.html ## Code 未確認。

tkuri

Conference: CVPR

Input: Thermal

Application: Pedestrian Detection

Year: 2022

Context-Aware Video Reconstruction for Rolling Shutter Cameras

## 論文概要ローリングシャッター歪みを補正する。従来手法は等速度・等加速度モーションの仮定から外れた場合にゴーストやアーティファクトを生む。そこで動き解釈モジュールを採用し運動場を推定し、2つのRSフレームを共通のGSフレームにワープさせ、次いでリファインメントをする。 ![bib_20220602 00](https://user-images.githubusercontent.com/16313809/171653261-05a92e43-1a0a-418b-a0d9-dec3814897be.jpg) https://arxiv.org/abs/2205.12912 ## Code https://github.com/GitCVfb/CVR

tkuri

Conference: CVPR

Year: 2022

Semantic Segmentation for Thermal Images: A Comparative Survey

## 論文概要 RGB＋遠赤外線（もしくは遠赤外線のみ）のセマセグの広範なサーベイ。データセットやDeepなセマセグ手法のレビュー等。（遠）赤外線は天候や照明条件に影響されず詳細な情報を捉えることができるため、セグメンテーションの高性能化に大きく寄与する。 ![bib_20220602 00](https://user-images.githubusercontent.com/16313809/171652742-dea3d8c1-e165-4c04-89ee-96e10983292a.jpg) https://arxiv.org/abs/2205.13278 ## Code 未確認

tkuri

Input: Thermal

Year: 2022

Application: Segmentation

Conference: CVPRW

TransforMatcher: Match-to-Match Attention for Semantic Correspondence

## 論文概要 Self-Attentionを用いたFeasibleな計算量のマッチングパイプラインTransforMatcher。Match-to-Match Attentionを導入しマッチ間のグローバルな相互作用を考慮し長距離の関連性を捉える。特に困難な外観の変動下において有効。SPair-71kとPF-PASCALでSOTA。 ![bib_20220530 00](https://user-images.githubusercontent.com/16313809/171008830-22d56fa9-70f5-49cf-9c90-3f31c0564dd8.jpg) https://arxiv.org/abs/2205.11634 ## Code 未確認。

tkuri

Conference: CVPR

Year: 2022

EdgeConv With Attention Module for Monocular Depth Estimation

## 論文概要単眼Depth推定(SIDE)の性能を上げるために、空間上で互いに近い画像パッチ間の関係性をEdgeConvを用いて学習し構造情報を抽出する。SIDEにEdgeConvを用いたのは初。NYUv2とKITTIデータセットでSOTA。 ![bib_20220507 00](https://user-images.githubusercontent.com/16313809/167258750-1bed6e98-eb23-4d91-b633-69b7a0bf5cbb.jpg) https://openaccess.thecvf.com/content/WACV2022/html/Lee_EdgeConv_With_Attention_Module_for_Monocular_Depth_Estimation_WACV_2022_paper.html ## Code 未確認。

tkuri

Conference: WACV

Application: SIDE

Year: 2022

Toward Fast, Flexible, and Robust Low-Light Image Enhancement

## 論文概要低照度環境での撮影において高速かつ柔軟かつロバストな画像強調を実現する新しい学習フレームワーク。自己キャリブモジュールの影響下で各ステージの出力に制約をかけることで多様なシーンに適応能力を付与。暗い環境化での顔検出や夜間の車載セマセグ等への応用が可能。 ![bib_20220503 00](https://user-images.githubusercontent.com/16313809/166466916-25a771a4-db1e-4c6e-a8aa-55fa19d19da2.jpg) https://arxiv.org/abs/2204.10137 ## Code https://github.com/vis-opt-group/SCI

tkuri

Conference: CVPR

Year: 2022

Subject: Self-Supervised

Application: Image Enhancement

SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis

## 論文概要スマホとIMUおよび高品質な外部Depthカメラで取得した、人間のポートレートのビデオシーケンス1000からなるデータセット。カメラポーズ推定用途。様々な場所・照明条件で取得され、一定間隔でスマホからフラッシュが照射される。Visual SLAM、SfM等の手法をベンチマーク。 ![bib_20220426 00](https://user-images.githubusercontent.com/16313809/165317389-4d2c7361-d737-4ee2-be89-a5ca1794467b.jpg) https://arxiv.org/abs/2204.10211 ## Code 未確認。 ## Dataset https://mobileroboticsskoltech.github.io/SmartPortraits/

tkuri

Subject: Dataset

Conference: CVPR

Input: LiDAR (dToF/iTOF)

Year: 2022

Application: SLAM

Application: SfM

Novel-View Synthesis of Human Tourist Photos

## 論文概要人物が写った写真の新規ビュー合成を行う。人物と背景を別々に（人物はメッシュで、背景は3次元点群によって）モデル化する。また合成された背景に合わせて人物を再照明するための外観変換手法を提案。 ![bib_20220424 00](https://user-images.githubusercontent.com/16313809/164978886-90566a46-400f-4790-890a-b72472cd9e82.jpg) https://openaccess.thecvf.com/content/WACV2022/html/Freer_Novel-View_Synthesis_of_Human_Tourist_Photos_WACV_2022_paper.html ## Code 未確認。

tkuri

Conference: WACV

Year: 2021

Subject: Neural Renderer

Application: View Synthesis

papers
papers copied to clipboard

Metadata

CRIS: CLIP-Driven Referring Image Segmentation

Pixel Screening Based Intermediate Correction for Blind Deblurring

Infrared Invisible Clothing: Hiding From Infrared Detectors at Multiple Angles in Real World

Context-Aware Video Reconstruction for Rolling Shutter Cameras

Semantic Segmentation for Thermal Images: A Comparative Survey

TransforMatcher: Match-to-Match Attention for Semantic Correspondence

EdgeConv With Attention Module for Monocular Depth Estimation

Toward Fast, Flexible, and Robust Low-Light Image Enhancement

SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis

Novel-View Synthesis of Human Tourist Photos

← Metadata

Owner

Metadata

papers papers copied to clipboard

Metadata

← Metadata

Owner

Metadata

papers
papers copied to clipboard