X-Review – Page 2 – Robotics and Computer Vision Lab

[CVPR 2026]SRA-Det: Learning Omni-Grained Open-Vocabulary Detection Beyond Category Names

Abstract OVD는 임의의 텍스트로 설명되는 객체 탐지를 목표로 하지만, 대부분 coarse한 수준에서 동작하며, 세밀한 속성에는 동작하는 데 어려움을 겪습니다. 해당 논문은 모델과 데이터 두 관점에서…

[ICCV2023] Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

안녕하세요 손우진입니다. 이번에 리뷰할 논문은 self-supervised learning 방법론인 I-JEPA입니다. 사실 JEPA는 얀 르쿤이 주도한 방법론이라 발표 당시부터 많은 주목을 받았고, 최근 ICML 2026에 참관을 다녀오면서도…

X-Review

[ICRA 2026] MIDAS Hand : Modular low – Impedance Direct-driven Anthropomorphic Sensing Hand

안녕하세요 최인하입니다. 이번에 리뷰할 논문은 Hand hardware 논문입니다. 이 논문을 리뷰 해야할지 말아야할지 많이 고민했었는데, 기존의 hand hardware와 비교했을 때 어떤점이 다르길래 ICRA에 붙었는지 너무…

X-Review

[ICML 2026] Omni-Perception Policy Optimization for Multimodal Emotion Reasoning

안녕하세요. 이번에는 Omni-Perception Policy Optimization for Multimodal Emotion Reasoning 논문을 읽어보았습니다. 이 논문은 감정 AI가 사람의 표정, 목소리, 말의 내용까지 함께 보고 감정을 추론할 때,…

Paper X-Review

[ICML 2026] Position Is All You Need: A Free Lunch Token Compression Strategy for MLLM-based Referring Expression Segmentation

안녕하세요 이번에는 ICML 학회에 다녀오게 되면서 보게 된 포스터 논문 중 제가 연구하고 있는 분야와 동일 분야를 연구하고 있던 논문이 있어서 가져왔습니다. 기존에 VQA task…

X-Review

[ ICML 2026 ] Think in Latent, Explain in Language: Self-Explainable Latent Reasoning

안녕하세요. 오늘은 ICML 2026에서 흥미로운 주제를 다룬 논문이 있어 소개하려 합니다. 요즘 latent reasoning 쪽을 흥미 있게 팔로우하고 있었는데, ICML에 관련 논문이 있어 읽어보고 리뷰하게…

X-Review

[ICRA 2026] Spatially-anchored Tactile Awareness for Robust Dexterous Manipulation

안녕하세요. 이번에 읽은 논문은 촉각 센서와 관련된 논문입니다. 이번에 진행하는 휴머노이드 실증 과제에 촉각 센서가 사용될 예정인데.. 이전부터 관련해서 관심은 많았지만 실제로 학습에 사용해본 적은…

Paper X-Review

[ICML 2026] Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation

안녕하세요. 이번에 리뷰로 가져온 논문은 ICML 2026에 올라온 Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation이라는 논문입니다. 해당 논문의 핵심 아이디어 같은 경우는 지금까지 리뷰했던…

Paper X-Review

[ICML2026]VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

본 논문은 새롭게 수집된 고품질의 QA 밴치마크 데이터셋을 제공하고, 이를 활용하였을때 모델의 성능이 개선됨을 통해 잘 구성된 데이터셋이 모델 성능 개선에 필수 요소임을 드러낸 연구입니다….

X-Review

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Introduction 자율주행을 하기 위해서는 먼저 주변의 환경을 인식하고, 미래의 움직임을 예측하고, 이를 토대로 내가 어떻게 갈지 계획해야합니다. 이 논문 이전에 유명한 자율주행 방법론을 한번 살펴봅시다(UniAD,…

Category: X-Review

[CVPR 2026]SRA-Det: Learning Omni-Grained Open-Vocabulary Detection Beyond Category Names

[ICCV2023] Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

[ICRA 2026] MIDAS Hand : Modular low – Impedance Direct-driven Anthropomorphic Sensing Hand

[ICML 2026] Omni-Perception Policy Optimization for Multimodal Emotion Reasoning

[ICML 2026] Position Is All You Need: A Free Lunch Token Compression Strategy for MLLM-based Referring Expression Segmentation

[ ICML 2026 ] Think in Latent, Explain in Language: Self-Explainable Latent Reasoning

[ICRA 2026] Spatially-anchored Tactile Awareness for Robust Dexterous Manipulation

[ICML 2026] Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation

[ICML2026]VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Conference Deadline

NEW POST

New Comment