Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[CVPR 2025] Language-Guided Image Tokenization for Generation
  • Posted on: 07/13/2025 –
  • Comments: 4 Comments
[ECCV 2024] KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval
  • Posted on: 07/07/2025 –
  • Comments: No Comments
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
  • Posted on: 06/29/2025 –
  • Comments: 4 Comments
[CVPR 2025] Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
  • Posted on: 06/08/2025 –
  • Comments: 4 Comments
[CVPR 2025] MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval
  • Posted on: 05/26/2025 –
  • Comments: 6 Comments
[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
  • Posted on: 05/19/2025 –
  • Comments: 6 Comments
[CVPR 2025] Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
  • Posted on: 05/05/2025 –
  • Comments: 6 Comments
[CVPR 2025] Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
  • Posted on: 04/27/2025 –
  • Comments: 4 Comments
[CVPR 2025] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
  • Posted on: 04/21/2025 –
  • Comments: No Comments
[CVPR 2023] Clover : Towards A Unified Video-Language Alignment and Fusion Model
  • Posted on: 04/14/2025 –
  • Comments: 2 Comments
Newer Posts 1 2 3 … 11 12 Older Posts

Conference Deadline

NEW POST

  • 2025 자율주행 인공지능 챌린지 후기
  • [NeurIPS 2025]Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
  • [WACV 2025] DDS: Decoupled Dynamic Scene-Graph Generation Network
  • [CoRL 2025(Oral)] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition
  • [IROS 2025] Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels

New Comment

  1. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. 이해하신 과정이 맞습니다. Descriptor 라는 것은 '현재 입력으로 들어간 이미지/point clouds 데이터를 대표하는 global vector' 라고 생각하시면 됩니다.…

  2. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. A1: 네, 2D image에서 H*W 패치를 나누어 입력하는 개념과 유사합니다. 본 논문에서는 3D 공간을 다루기에 x*y*z 세 축이…

  3. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 우선 해당 결과는 학습 데이터 편향으로 보시면 좋을 것 같습니다. late fusion 구조의 한계란, VLM 모델이 질문에…

  4. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 먼저 윗 질문에 대해서는 확인하지 못한 것 같습니다. 다음 질문에 대해서도 말씀드리자면 본 논문은 기존에 지각하지 못했던…

  5. 황 유진 on [ACL Findings 2025] Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs10/27/2025

    안녕하세요 질문 감사드립니다 본 논문에서는 1)NLP 분야에서 두 문장(정답/예측)간의 단어적 겹침 정도를 평가하는 Syntactic similarity 지표인(R1/R2/RL/Recall/F1-score)와, 2)LLM을 기반으로 맥락적 유사도를…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV