Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[CVPR 2020] End-to-End Learning of Visual Representations from Uncurated Instructional Videos
  • Posted on: 04/07/2025 –
  • Comments: 2 Comments
[2022 Neurocomputing]CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning
  • Posted on: 03/31/2025 –
  • Comments: 6 Comments
[Arxiv 2024] Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
  • Posted on: 02/02/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
  • Posted on: 01/19/2025 –
  • Comments: 2 Comments
[CVPR 2024] MAFA: Managing False Negatives for Vision-Language Pre-training
  • Posted on: 01/13/2025 –
  • Comments: 2 Comments
[EMNLP 2024] Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models
  • Posted on: 01/12/2025 –
  • Comments: 2 Comments
[홍주영] 2024년을 보내면서
  • Posted on: 12/30/2024 –
  • Comments: No Comments
[ECCV 2024] HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
  • Posted on: 11/03/2024 –
  • Comments: 2 Comments
[ECCV 2024] Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
  • Posted on: 09/30/2024 –
  • Comments: No Comments
[CVPR 2022] Grounded Language-Image Pre-training
  • Posted on: 09/09/2024 –
  • Comments: 8 Comments
Newer Posts 1 2 3 … 10 11 Older Posts

Conference Deadline

NEW POST

  • [CoRL 2024] 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
  • [CVPR 2025] Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing
  • [TPAMI 2018] SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
  • [NIPS 2024] Introspective Planning: Aligning Robots’ Uncertainty with Inherent Task Ambiguity
  • [ECCV 2024]FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models

New Comment

  1. 황 유진 on [CVPR2024] Towards Automated Movie Trailer Generation07/15/2025

    안녕하세요 리뷰 읽어주셔서 감사합니다. 모델은 전체 영화가 입력되게 됩니다. 논문의 해당 부분에서 확인할 수 있는데요, 영화 M을 전체 n개의 sequence로…

  2. 황 유진 on [CVPR2024] Towards Automated Movie Trailer Generation07/15/2025

    안녕하세요 리뷰 읽어주셔서 감사합니다. 우선 CCANet은 2020년 논문으로 CLIP-IT 보다 앞서서 발표된 성과입니다. 이러한 이유로 기존 video summerization의 주요 테스크에…

  3. 황 유진 on [CVPR2024] Towards Automated Movie Trailer Generation07/15/2025

    안녕하세요 리뷰 읽어주셔서 감사합니다. 먼저 video summarization task에서 일반적인 평가방식의 경우 모델의 요약 결과와 정답간의 유사도인 F1-score로 측정하는 것이 일반적입니다.…

  4. 류 지연 on [TPAMI 2018] SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition07/14/2025

    안녕하세요 윤서님 질문 감사합니다. 1. 단어를 구성하는 인접한 character 묶음을 하나의 subword입니다. 예를 들어 subword의 길이를 다음과 같이 정해두고 (l_min…

  5. 정 윤서 on [AAAI 2024](Oral) AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models07/14/2025

    댓글 감사합니다. 물론 인택님이 언급한 것처럼 성능을 보면 어느정도 일반화 가능한 것은 사실이지만, 실제로 논문에서는 simulation한 anomaly sample과 실제 anomaly…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV