Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

홍 주영

About Posts
[CVPR 2025] Language-Guided Image Tokenization for Generation
  • Posted on: 07/13/2025 –
  • Comments: 2 Comments
[ECCV 2024] KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval
  • Posted on: 07/07/2025 –
  • Comments: No Comments
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
  • Posted on: 06/29/2025 –
  • Comments: 4 Comments
[CVPR 2025] Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
  • Posted on: 06/08/2025 –
  • Comments: 4 Comments
[CVPR 2025] MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval
  • Posted on: 05/26/2025 –
  • Comments: 6 Comments
[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
  • Posted on: 05/19/2025 –
  • Comments: 6 Comments
[CVPR 2025] Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
  • Posted on: 05/05/2025 –
  • Comments: 6 Comments
[CVPR 2025] Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
  • Posted on: 04/27/2025 –
  • Comments: 4 Comments
[CVPR 2025] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
  • Posted on: 04/21/2025 –
  • Comments: No Comments
[CVPR 2023] Clover : Towards A Unified Video-Language Alignment and Fusion Model
  • Posted on: 04/14/2025 –
  • Comments: 2 Comments
1 2 … 10 11 Older Posts

Conference Deadline

NEW POST

  • [CoRL 2024] 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
  • [CVPR 2025] Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing
  • [TPAMI 2018] SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
  • [NIPS 2024] Introspective Planning: Aligning Robots’ Uncertainty with Inherent Task Ambiguity
  • [ECCV 2024]FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models

New Comment

  1. 류 지연 on [TPAMI 2018] SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition07/14/2025

    안녕하세요 윤서님 질문 감사합니다. 1. 단어를 구성하는 인접한 character 묶음을 하나의 subword입니다. 예를 들어 subword의 길이를 다음과 같이 정해두고 (l_min…

  2. 정 윤서 on [AAAI 2024](Oral) AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models07/14/2025

    댓글 감사합니다. 물론 인택님이 언급한 것처럼 성능을 보면 어느정도 일반화 가능한 것은 사실이지만, 실제로 논문에서는 simulation한 anomaly sample과 실제 anomaly…

  3. 정 윤서 on [CVPR 2024] PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection07/14/2025

    안녕하세요. 좋은 리뷰 감사합니다. V-V attn은 value 간의 attention만 수행하는 것입니다. 기존 CLIP attention 방식이 보통 CLS 토큰이 모든 patch에…

  4. 손 건화 on [arXiv 2025] Perfecting Depth: Uncetrainty-Aware Enhancement of Metric Depth07/14/2025

    안녕하세요, 리뷰 읽어주셔서 감사합니다. 말씀하신 분산이 높다고 해서 반드시 센서 depth가 신뢰할 수 없다는 뜻은 아닙니다. 실제로 depth 자체는 맞는…

  5. 정 윤서 on [arXiv 2024] Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts07/14/2025

    댓글 감사합니다. 넵 맞습니다. 이론적으로는 SAM말고 다른 segmentation 모델에 적용가능합니다. 다만 본 논문에서 제안된 CBR, CGR 모듈이 SAM 기반으로 설계되었기에…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV