Robotics and Computer Vision Lab

최 인하 on [ICML 2021] Learning Transferable Visual Models From Natural Language Supervision11/02/2025
안녕하세요 찬미님. 평소에 관심이 있던 논문이었는데 좋은 리뷰 감사합니다! 읽으면서 이해가 잘 안되는 부분 질문 드리겠습니다! 전문화되거나 추상적인 task에서 약한…
권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025
댓글 감사합니다. 이해하신 과정이 맞습니다. Descriptor 라는 것은 '현재 입력으로 들어간 이미지/point clouds 데이터를 대표하는 global vector' 라고 생각하시면 됩니다.…
권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025
댓글 감사합니다. A1: 네, 2D image에서 H*W 패치를 나누어 입력하는 개념과 유사합니다. 본 논문에서는 3D 공간을 다루기에 x*y*z 세 축이…
황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025
안녕하세요 질문 감사드립니다 우선 해당 결과는 학습 데이터 편향으로 보시면 좋을 것 같습니다. late fusion 구조의 한계란, VLM 모델이 질문에…
황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025
안녕하세요 질문 감사드립니다 먼저 윗 질문에 대해서는 확인하지 못한 것 같습니다. 다음 질문에 대해서도 말씀드리자면 본 논문은 기존에 지각하지 못했던…

Recent Posts

[ICCV 2023] HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training

ORCA: An open-Source, Reliable, Cost-Effective, Anthropomorphic Robotic Hand for Uninterrupted Dexterous Task Learning

2025 자율주행 인공지능 챌린지 후기

[NeurIPS 2025]Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

[WACV 2025] DDS: Decoupled Dynamic Scene-Graph Generation Network

[CoRL 2025(Oral)] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition

[IROS 2025] Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels

[ArXiv 2025] VLA-0: Building State-of-the-Art VLAs with Zero Modification

[ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer

[ICCV 2023] Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval

Conference Deadline

NEW POST

New Comment