Paper – Page 30 – Robotics and Computer Vision Lab

[NIPS 2023] Understanding the latent space of diffusion models through the lens of riemannian geometry

안녕하세요, 정의철 연구원입니다. 이번에 소개할 논문은 지난번 KCCV 학회에 참관했을 때 포스터 세션에서 접하게 된 논문인데, 제목은 ‘Latent Space Geometry in Diffusion Models’입니다. 이 논문은…

Paper X-Review

[ECCV 2024] BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

안녕하세요. 이번주 X-Review는 오랜만에 다시 비디오의 Moment Retrieval task 논문으로 돌아왔습니다. 소개해드릴 BAM-DETR이라는 논문은, 보통 Moment Retrieval과 Highlight Detection이라는 2가지 task를 동시에 수행하는 DETR 기반의…

Paper X-Review

[ECCV 2024] OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

안녕하세요, 마흔 세번째 x-review 입니다. 이번 논문은 2024년도 ECCV에 게재된 OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation 입니다. 23년도에 처음으로 3D detection에서…

Conference Paper X-Review

[ICLR 2024] VISION TRANSFORMERS NEED REGISTERS

이번 논문은 Vision Transformers 기반 large model의 특징 표현력을 향상시키기 위해 원인을 찾아 분석하고 이에 대한 해결책을 제시한 논문입니다. 해당 기법에 주목하게 된 계기는 Vision…

Paper X-Review

[ICLR 2022] Open-Vocabulary Object Detection via Vision and Language Knowledge Distillation

오늘 가져온 논문은 Open-Vocabulary Object Detection 분야의 논문입니다. 일반적인 Detection 모델과는 달리, 임의로 주어지는 text input 에 해당하는 object를 이미지 내에서 찾는 task 이죠.현존하는 Object…

Paper X-Review

[arXiv 2024] UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause

pdf 안녕하세요. 오늘 가져온 논문은 MER(Multimodal Emotion Recognition)을 팔로업 하는 사람들이면 알법한 논문인 UniMSE의 저자의 후속 논문입니다. UniMSE가 2022년도 논문인데 최근 논문을 보니 계속해서 affecting…

Paper X-Review

[CVPR 2022] Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint

오랜만에 Object Detection을 위한 Active Learning 논문을 리뷰해보겠습니다. Multi-class가 존재하는 object detection 태스크에서, 보다 정확하고 균일한 데이터셋을 선택하는 방식을 제안한 연구입니다. Conference: CVPR 2022 Title:…

News Paper X-Review

[CVPR2022] Grounded Language-Image Pre-training(GLIP)

안녕하세요. 오늘 소개 시켜드릴 논문은 Grounded Language-Image Pre-training이란 논문으로 VLM분야의 foundation모델로 GLIP이란 모델과 학습법을 제안한 논문이 되겠습니다. 해당 논문을 읽게 된 이유는 센서과제에서 학습 때…

Paper X-Review

[NeurIPS 2022] Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts

안녕하세요. 이번 주 X-Review에서는 22년도 NeurIPS에 게재된 논문 <Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts>를 소개해드리겠습니다. 본 논문은 현재 구글 딥마인드로 병합된…

Paper X-Review

[CVPR 2023] CLIP the Gap: A Single Domain Generalization Approach for Object Detection

오늘 리뷰할 논문은 Single Domain Generalization (SDG) 분야의 논문입니다.일반적인 Domain Generalization 에서는 여러 source dataset을 사용하는 데에 반해, 본 SDG 분야에서는 단일 source dataset만을 사용해서…

Category: Paper

[NIPS 2023] Understanding the latent space of diffusion models through the lens of riemannian geometry

[ECCV 2024] BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

[ECCV 2024] OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

[ICLR 2024] VISION TRANSFORMERS NEED REGISTERS

[ICLR 2022] Open-Vocabulary Object Detection via Vision and Language Knowledge Distillation

[arXiv 2024] UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause

[CVPR 2022] Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint

[CVPR2022] Grounded Language-Image Pre-training(GLIP)

[NeurIPS 2022] Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts

[CVPR 2023] CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Conference Deadline

NEW POST

New Comment