Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[ICCV 2023] Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval
  • Posted on: 10/19/2025 –
  • Comments: 2 Comments
[ICCV 2023] UATVR: Uncertainty-Adaptive Text-Video Retrieval
  • Posted on: 10/12/2025 –
  • Comments: 2 Comments
[CVPR 2025] SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
  • Posted on: 09/21/2025 –
  • Comments: 6 Comments
[Arxiv 2025] GAID: Frame-Level Gated Audio-Visual Integration with Directional Perturbation for Text-Video Retrieval
  • Posted on: 09/13/2025 –
  • Comments: 4 Comments
[ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
  • Posted on: 09/06/2025 –
  • Comments: 2 Comments
[ICCV 2023] Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment
  • Posted on: 08/31/2025 –
  • Comments: 12 Comments
[ICCV 2025] DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
  • Posted on: 08/18/2025 –
  • Comments: 3 Comments
[NAACL 2025] DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models
  • Posted on: 08/11/2025 –
  • Comments: 2 Comments
[ICCV 2025] Everything is a Video: Unifying Modalities through Next-Frame Prediction
  • Posted on: 07/28/2025 –
  • Comments: 8 Comments
2025년 상반기 회고문 @홍주영
  • Posted on: 07/21/2025 –
  • Comments: 2 Comments
1 2 … 11 12 Older Posts

Conference Deadline

NEW POST

  • 2025 자율주행 인공지능 챌린지 후기
  • [NeurIPS 2025]Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
  • [WACV 2025] DDS: Decoupled Dynamic Scene-Graph Generation Network
  • [CoRL 2025(Oral)] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition
  • [IROS 2025] Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels

New Comment

  1. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. 이해하신 과정이 맞습니다. Descriptor 라는 것은 '현재 입력으로 들어간 이미지/point clouds 데이터를 대표하는 global vector' 라고 생각하시면 됩니다.…

  2. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. A1: 네, 2D image에서 H*W 패치를 나누어 입력하는 개념과 유사합니다. 본 논문에서는 3D 공간을 다루기에 x*y*z 세 축이…

  3. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 우선 해당 결과는 학습 데이터 편향으로 보시면 좋을 것 같습니다. late fusion 구조의 한계란, VLM 모델이 질문에…

  4. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 먼저 윗 질문에 대해서는 확인하지 못한 것 같습니다. 다음 질문에 대해서도 말씀드리자면 본 논문은 기존에 지각하지 못했던…

  5. 황 유진 on [ACL Findings 2025] Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs10/27/2025

    안녕하세요 질문 감사드립니다 본 논문에서는 1)NLP 분야에서 두 문장(정답/예측)간의 단어적 겹침 정도를 평가하는 Syntactic similarity 지표인(R1/R2/RL/Recall/F1-score)와, 2)LLM을 기반으로 맥락적 유사도를…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV