Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

황 유진

About Posts
[CVPR2024] Towards Automated Movie Trailer Generation
  • Posted on: 07/07/2025 –
  • Comments: 6 Comments
[CVPR2023]Causalainer: Causal Explainer for Automatic Video Summarization
  • Posted on: 06/30/2025 –
  • Comments: 2 Comments
[CVPR2023]Align and Attend: Multimodal Summarization with Dual Contrastive Losses
  • Posted on: 06/09/2025 –
  • Comments: 6 Comments
[arXiv2025]Video Summarization with Large Language Models
  • Posted on: 05/26/2025 –
  • Comments: 2 Comments
[AAAI2024]V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
  • Posted on: 05/19/2025 –
  • Comments: 4 Comments
[NeurIPS2021]CLIP-It! Language-Guided Video Summarization
  • Posted on: 05/12/2025 –
  • Comments: 4 Comments
[CVPR2024]Scaling Up Video Summarization Pretraining with Large Language Models
  • Posted on: 05/05/2025 –
  • Comments: 11 Comments
[arXiv 2025] Video-T1: Test-Time Scaling for Video Generation
  • Posted on: 04/21/2025 –
  • Comments: 4 Comments
[arXiv 2025]Video-R1: Reinforcing Video Reasoning in MLLMs
  • Posted on: 04/07/2025 –
  • Comments: 4 Comments
[PMLR 2020]Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks
  • Posted on: 03/17/2025 –
  • Comments: 4 Comments
Newer Posts 1 2 3 … 14 15 Older Posts

Conference Deadline

NEW POST

  • 2025 자율주행 인공지능 챌린지 후기
  • [NeurIPS 2025]Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
  • [WACV 2025] DDS: Decoupled Dynamic Scene-Graph Generation Network
  • [CoRL 2025(Oral)] SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition
  • [IROS 2025] Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels

New Comment

  1. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. 이해하신 과정이 맞습니다. Descriptor 라는 것은 '현재 입력으로 들어간 이미지/point clouds 데이터를 대표하는 global vector' 라고 생각하시면 됩니다.…

  2. 권 석준 on [ICRA 2025] HeLiOS: Heterogeneous LiDAR Place Recognition via Overlap-based Learning and Local Spherical Transformer10/29/2025

    댓글 감사합니다. A1: 네, 2D image에서 H*W 패치를 나누어 입력하는 개념과 유사합니다. 본 논문에서는 3D 공간을 다루기에 x*y*z 세 축이…

  3. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 우선 해당 결과는 학습 데이터 편향으로 보시면 좋을 것 같습니다. late fusion 구조의 한계란, VLM 모델이 질문에…

  4. 황 유진 on [ACCV2024]Vision language models are blind: Failing to translate detailed visual features into words10/27/2025

    안녕하세요 질문 감사드립니다 먼저 윗 질문에 대해서는 확인하지 못한 것 같습니다. 다음 질문에 대해서도 말씀드리자면 본 논문은 기존에 지각하지 못했던…

  5. 황 유진 on [ACL Findings 2025] Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs10/27/2025

    안녕하세요 질문 감사드립니다 본 논문에서는 1)NLP 분야에서 두 문장(정답/예측)간의 단어적 겹침 정도를 평가하는 Syntactic similarity 지표인(R1/R2/RL/Recall/F1-score)와, 2)LLM을 기반으로 맥락적 유사도를…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV