Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[CVPR 2020] End-to-End Learning of Visual Representations from Uncurated Instructional Videos
  • Posted on: 04/07/2025 –
  • Comments: 2 Comments
[2022 Neurocomputing]CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning
  • Posted on: 03/31/2025 –
  • Comments: 6 Comments
[Arxiv 2024] Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
  • Posted on: 02/02/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
  • Posted on: 01/19/2025 –
  • Comments: 2 Comments
[CVPR 2024] MAFA: Managing False Negatives for Vision-Language Pre-training
  • Posted on: 01/13/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models
  • Posted on: 01/12/2025 –
  • Comments: 2 Comments
[홍주영] 2024년을 보내면서
  • Posted on: 12/30/2024 –
  • Comments: No Comments
[ECCV 2024] HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
  • Posted on: 11/03/2024 –
  • Comments: 2 Comments
[ECCV 2024] Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
  • Posted on: 09/30/2024 –
  • Comments: No Comments
[CVPR 2022] Grounded Language-Image Pre-training
  • Posted on: 09/09/2024 –
  • Comments: 8 Comments
Newer Posts 1 2 3 4 … 11 12 Older Posts

Conference Deadline

NEW POST

  • 2025 자율주행 인공지능 챌린지 후기
  • [NeurIPS 2025]Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
  • [WACV 2025] DDS: Decoupled Dynamic Scene-Graph Generation Network
  • [CoRL 2025(Oral)] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition
  • [IROS 2025] Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels

New Comment

  1. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. 이해하신 과정이 맞습니다. Descriptor 라는 것은 '현재 입력으로 들어간 이미지/point clouds 데이터를 대표하는 global vector' 라고 생각하시면 됩니다.…

  2. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. A1: 네, 2D image에서 H*W 패치를 나누어 입력하는 개념과 유사합니다. 본 논문에서는 3D 공간을 다루기에 x*y*z 세 축이…

  3. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 우선 해당 결과는 학습 데이터 편향으로 보시면 좋을 것 같습니다. late fusion 구조의 한계란, VLM 모델이 질문에…

  4. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 먼저 윗 질문에 대해서는 확인하지 못한 것 같습니다. 다음 질문에 대해서도 말씀드리자면 본 논문은 기존에 지각하지 못했던…

  5. 황 유진 on [ACL Findings 2025] Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs10/27/2025

    안녕하세요 질문 감사드립니다 본 논문에서는 1)NLP 분야에서 두 문장(정답/예측)간의 단어적 겹침 정도를 평가하는 Syntactic similarity 지표인(R1/R2/RL/Recall/F1-score)와, 2)LLM을 기반으로 맥락적 유사도를…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV