Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[CVPR 2025] Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
  • Posted on: 04/27/2025 –
  • Comments: 4 Comments
[CVPR 2025] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
  • Posted on: 04/21/2025 –
  • Comments: No Comments
[CVPR 2023] Clover : Towards A Unified Video-Language Alignment and Fusion Model
  • Posted on: 04/14/2025 –
  • Comments: 2 Comments
[CVPR 2020] End-to-End Learning of Visual Representations from Uncurated Instructional Videos
  • Posted on: 04/07/2025 –
  • Comments: 2 Comments
[2022 Neurocomputing]CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning
  • Posted on: 03/31/2025 –
  • Comments: 6 Comments
[Arxiv 2024] Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
  • Posted on: 02/02/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
  • Posted on: 01/19/2025 –
  • Comments: 2 Comments
[CVPR 2024] MAFA: Managing False Negatives for Vision-Language Pre-training
  • Posted on: 01/13/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models
  • Posted on: 01/12/2025 –
  • Comments: 2 Comments
[홍주영] 2024년을 보내면서
  • Posted on: 12/30/2024 –
  • Comments: No Comments
Newer Posts 1 2 3 4 … 11 12 Older Posts

Conference Deadline

NEW POST

  • [arXiv 2025] VideoRAG: Retrieval-Augmented Generation over Video Corpus
  • [WACV 2026] UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
  • DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
  • [NeurIPS 2020]Object-Centric Learning with Slot Attention
  • [ICRA 2024]NoMaD : Goal Masked Diffusion Policies for Navigation and Exploration

New Comment

  1. 김 영규 on [NeurIPS 2025] PhysX-3D: Physical-Grounded 3D Asset Generation11/26/2025

    안녕하세요 정민님 댓글 감사합니다. 정리를 하자면 Absolute Scale의 경우는 사람이 직접 기입합니다. Kinematics에 관련된 부분도 수학적인 기하 알고리즘을 통해 접촉면…

  2. 김 영규 on [NeurIPS 2025] PhysX-3D: Physical-Grounded 3D Asset Generation11/26/2025

    안녕하세요 우현님, 다른 리뷰에 대한 댓글인것 같긴 한데, 일단 답변 드리겠습니다 우선 Robo-SAM 자체가 segment 해야하는게 쉽게 로봇, task 관련…

  3. 김 영규 on [IROS 2025] RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation11/26/2025

    안녕하세요 정우님 댓글 감사합니다. 답변이 늦어 죄송합니다 음.. 일단 최근 Imitation Learning 모델들은 rollout 단위로 학습하지 않고 video 기준으로 특정…

  4. 김 영규 on [IROS 2025] RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation11/26/2025

    안녕하세요 정민님 댓글 감사합니다. 답변이 늦은점 죄송합니다,, A 1,2 (제가 설명을 깔끔하게 못 한것 같습니다,,) Robo-SAM이 저자들이 제안한 3800장의 데이터셋으로…

  5. 신 인택 on [NeurIPS 2020]Object-Centric Learning with Slot Attention11/25/2025

    안녕하세요 재연님 답글 감사합니다. 각 질문에 대해서 답글을 달아드리자면 1. 논문에서 learnable slot 에 대해서 언급하지는 않았으나 random sampled slot이…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV