Paper – Page 51 – Robotics and Computer Vision Lab

[CVPR 2018] Pyramid Stereo Matching Network

안녕하세요, 열아홉 번째 X-Review입니다. 이번 논문은 2018년도 ICCV에 게재된 Pyramid Stereo Matching논문입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1. Introduction Stereo Matching은 서로 다른 시점의 stereo…

Paper X-Review

[RA-L 2022]MP6D: An RGB-D Dataset for Metal Parts’ 6D Pose Estimation

안녕하세요, 이번에도 6D pose estimation을 위한 데이터셋을 구성하기 위해 데이터셋 논문을 읽어보았습니다. 이번 논문의 핵심은 GT pose를 구성하였을 때, 좀 더 정확한 GT pose 정보를…

Paper X-Review

[CVPR 2023] Clover: Towards A Unified Video-Language Alignment and Fusion Model

이 논문의 주요 키워드 Universal Video-Language Pre-training Multi-modal Fusion & Alignment Semantic Enhanced Masked Language Modeling 이 논문을 깊게 이해하려면 다음 지식이 필요합니다. Multi-modal contrastive…

Paper X-Review

[ICCV-2021]Emerging Properties in Self-Supervised Vision Transformers

안녕하세요, 열여덟 번째 X-Review입니다. 이번 논문은 2021년도 ICCV에 게재된 Emerging Properties in Self-Supervised Vision Transformers 논문입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1. Introduction Transformer는 visual…

Paper X-Review

[CVPR 2019] RepMet: Representative-based metric learning for classification and few-shot object detection

안녕하세요. 스물 두번째 리뷰입니다. 최근 작성 중인 Pedestrian Detection과는 별도로, 논문은 Few-shot (One or Few) Object Detection에 대한 관한 논문입니다. Few-shot Classification에 관한 연구는 성황리에…

Paper X-Review

[CVPR 2023] CompletionFormer: Depth Completion with Convolutions and Vision Transformers

안녕하세요, 열여덟번째 x-review 입니다. 이번 논문은 2023년도 CVPR에 게재된 CompletionFormer으로 컨볼루션과 트랜스포머를 함께 사용하는 Depth Completion 논문 입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1. Introduction…

Paper X-Review

[2022 IROS] 6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark

안녕하세요, 이번에도 6D pose estimation 관련 논문입니다. 데이터셋을 취득하기 위해 기존 데이터셋이 어떻게 물체를 정의하였는지, 어떤 시나리오로 구성하였는지, Annotation은 어떻게 했는지에 대해 아이디어를 제공 받기…

Paper X-Review

[IEEE Wireless Communication 2018] Active Learning for Wireless IoT Intrusion Detection

안녕하세요, 허재연입니다. 요즘 6종 데이터셋에 대한 통일된 Active Learning 적용을 주제로 한 논문을 작성하고자 하고 있습니다. 6종 데이터 중 하나가 IoT(사물 인터넷) 데이터인데, 이와 관련된…

Paper X-Review

[ICCV 2023] UATVR: Uncertainty-Adaptive Text-Video Retrieval

이번 주차 X-Review는 23년도 ICCV에 게재된 <UATVR: Uncertainty-Adaptive Text-Video Retrieval>이라는 논문입니다. 중국 바이두에서 연구된 논문이네요. Text-Video Retrieval(이하 TVR)이라는 task는 비디오와 text 두 모달 간 공통의…

Paper X-Review

[MM 2022] X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

이런 분들께 이 논문을 추천드립니다. CLIP을 비디오에 적용하는 방식에 흥미가 있으신 분 Video Text Retrieval에서 fine-grained와 coarse-grained를 모두 활용하는 cross-grained 방식이 궁굼하신 분 이 논문을…

Category: Paper

[CVPR 2018] Pyramid Stereo Matching Network

[RA-L 2022]MP6D: An RGB-D Dataset for Metal Parts’ 6D Pose Estimation

[CVPR 2023] Clover: Towards A Unified Video-Language Alignment and Fusion Model

[ICCV-2021]Emerging Properties in Self-Supervised Vision Transformers

[CVPR 2019] RepMet: Representative-based metric learning for classification and few-shot object detection

[CVPR 2023] CompletionFormer: Depth Completion with Convolutions and Vision Transformers

[2022 IROS] 6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark

[IEEE Wireless Communication 2018] Active Learning for Wireless IoT Intrusion Detection

[ICCV 2023] UATVR: Uncertainty-Adaptive Text-Video Retrieval

[MM 2022] X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

Conference Deadline

NEW POST

New Comment