X-Review – Page 59 – Robotics and Computer Vision Lab

[arXiv 2024] Emotion-LLaMA

안녕하세요. 정말 따근따근한 MER (Multimodal Emotion Recognition)논문을 들고 왔습니다. arXiv에 6월에 등록된 논문으로 EMER과 마찬가지로 under review 중에 있는 논문입니다. 새로운 MER 분야로 찾아보게 되니…

Paper X-Review

[ICCV 2023] Multi-modal 3D Object Detection with Object-Centric Fusion

안녕하세요, 서른 아홉번째 x-review 입니다. 이번 논문은 2023년도 ICCV에 게재된 Multi-modal 3D Object Detection with Object-Centric Fusion입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1. Introduction 3차원…

Paper X-Review

[CVPR 2024] Unified Entropy Optimization for Open-Set Test-Time Adaptation

안녕하세요. 오늘 리뷰할 논문은 CVPR 2024에서 발표된 open-set TTA 분야의 논문입니다. 실험을 classification 에서만 진행하긴 하지만 open-set 키워드에 이끌려 한번 읽어보게 되었습니다. 리뷰 바로 시작하도록…

Paper X-Review

[arXiv 2023] Open World Object Detection in the Era of Foundation Models (FOMO)

안녕하세요. 이번 주 논문은 Open World Object Detection(OWOD) 분야의 논문 중 Foundation model을 활용한 FOMO입니다. 저는 이번 24년 상반기 랩실 기초교육 이후, 최종적으로 로보틱스 팀에…

Paper X-Review

[CVPR2024]Plug and Play Active Learning for Object Detection

추천 독자 Active Learning 연구에 관심이 있으며, 이를 Object Detection 연구로 확장하고 싶은 사람. Contribution Generalized Method제목에도 나타났듯이 Plug and Play 가능한 방법론으로, 다양한 object…

X-Review

[CVPR2024]Retrieval-Augmented Open-Vocabulary Object Detection

안녕하세요. 오늘 리뷰할 논문은 OVOD task를 다룬 Retrieval-Augmented Open-Vocabulary Object Detection입니다. CVPR 논문들을 뒤져보다가 마침 궁금했던 OVOD task를 고려대학교, 삼섬 리서치에서 작성한 논문이 있길래 OVOD…

X-Review

[ICCV 2019] Rethinking ImageNet Pre-Training

안녕하세요. 허재연입니다. 요즘 KAIST PD dataset만을 가지고 어떻게 하면 detection의 가중치 초기화를 잘 할 수 있을지 고민하고 있습니다. 아무래도 KAIST 데이터셋의 크기가 ImageNet과 비교하면 상당히…

Conference X-Review

[NeurIPS 2023] Visual Instruction Tuning

CVPR 세미나에서 발표했던 것처럼, 이번주부터는 Multi-modal(Text, Image) model에 대해 리뷰해보려고 합니다. 가장 첫번째로 Meta의 LLM 모델인 LLaMA를 사용한 Vision-Language 모델인 LLaVA에 대해 다뤄보겠습니다. Conference: NeurIPS…

News Paper X-Review

[CVPR 2022] Crafting Better Contrastive Views for Siamese Representation Learning

안녕하세요 정의철 연구원입니다. 이번에 제가 소개할 논문은 2022년 CVPR에 게재된 ‘Crafting Better Contrastive Views for Siamese Representation Learning’이란 논문입니다. 이번 논문은 contrastive learning에서 두 view를…

X-Review

[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving

안녕하세요. 오늘 review할 논문은 ICCV 2023에 게재된 SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving입니다. 리뷰를 시작하기 전에 본 논문이 하고자 하는 것을 간단히 소개드리자면,…

Category: X-Review

[arXiv 2024] Emotion-LLaMA

[ICCV 2023] Multi-modal 3D Object Detection with Object-Centric Fusion

[CVPR 2024] Unified Entropy Optimization for Open-Set Test-Time Adaptation

[arXiv 2023] Open World Object Detection in the Era of Foundation Models (FOMO)

[CVPR2024]Plug and Play Active Learning for Object Detection

[CVPR2024]Retrieval-Augmented Open-Vocabulary Object Detection

[ICCV 2019] Rethinking ImageNet Pre-Training

[NeurIPS 2023] Visual Instruction Tuning

[CVPR 2022] Crafting Better Contrastive Views for Siamese Representation Learning

[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving

Conference Deadline

NEW POST

New Comment