Skip to content

Robotics and Computer Vision Lab

AI in Sensing, AI in Perception, AI in Action

  • About
    • History
    • Photo
    • Admission
  • Members
  • Publications
    • Patents
  • X-Review
  • X-Diary
  • Peer Review

Profile

천 혜원

About Posts
[IEEE Trans Affect Comput 2022] Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition
  • Posted on: 06/09/2024 –
  • Comments: 4 Comments
[ICASSP 2024] Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer
  • Posted on: 06/02/2024 –
  • Comments: 3 Comments
[ICASSP 2022] Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
  • Posted on: 05/28/2024 –
  • Comments: 4 Comments
[ICASSP 2024] RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement
  • Posted on: 05/20/2024 –
  • Comments: 8 Comments
[INTERSPEECH 2021] Rethinking Evaluation in ASR: Are Our Models Robust Enough?
  • Posted on: 05/06/2024 –
  • Comments: 1 Comment
[Interspeech 2023] Episodic Memory For Domain-Adaptable, Robust Speech Emotion Recognition
  • Posted on: 04/29/2024 –
  • Comments: 1 Comment
[ICCV 2023] Boosting Multi-modal Model Performance with Adaptive Gradient Modulation
  • Posted on: 04/17/2024 –
  • Comments: 1 Comment
[NAACL 2022] Analyzing Modality Robustness in Multimodal Sentiment Analysis
  • Posted on: 03/17/2024 –
  • Comments: 2 Comments
[CVPR 2022] Balanced Multimodal Learning via On-the-fly Gradient Modulation
  • Posted on: 03/03/2024 –
  • Comments: 1 Comment
[ICLR 2017] Pruning Filters for Efficient ConvNets
  • Posted on: 02/04/2024 –
  • Comments: 4 Comments
Newer Posts 1 2 3 … 5 6 Older Posts

Conference Deadline

NEW POST

  • [arxiv 2025.02] SOFAR: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
  • [arXiv 2024] Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG
  • [ArXiv 2025]Accurate and efficient Zero-shot 6D pose estimation with frozen foundation models
  • [NIPS2023] Self-Chained Image-Language Model for Video Localization and Question Answering
  • [CVPR 2023] Feature Aggregated Queries for Transformer-based Video Object Detectors

New Comment

  1. 신 인택 on [ICLR2024]CLIPSELF : VISION TRANSFORMER DISTILLS ITSELF FOR OPEN-VOCABULARY DENSE PREDICTION09/09/2025

    안녕하세요 우현님 답글 감사합니다. knowledge distill을 teacher 모델로부터 하는만큼 저자는 classification 과 image corp 부분에서 teacher의 성능이 높은것을 figure를 통해…

  2. 신 인택 on [CVPR 2016]Deep Residual Learning for Image Recognition09/09/2025

    안녕하세요 재윤님 답글 감사합니다. 이해하신 내용이 맞긴합니다만 아래 괄호에 적힌 universal approximation theorem 를 참고하시면 multiple nonlinear layers가 점근적으로 복잡한…

  3. 재윤 이 on [CVPR 2016]Deep Residual Learning for Image Recognition09/09/2025

    안녕하세요 신인택 연구원님, ResNet을 예습해 보고자하는 생각으로 본 x-review를 읽게 되었는데, 대략적인 흐름 파악을 하는데 큰 도움이 되었습니다. 초심자의 입장에서…

  4. 정우 김 on [ICCV 2019] Rethinking ImageNet Pre-Training09/09/2025

    안녕하세요 재연님 상세한 리뷰 덕에 논문을 잘 이해했습니다. 좋은 리뷰 감사합니다. URP과정에서 pretrained를 제대로 불러오지 못한채로 학습을 돌렸다가 결과가 하나도…

  5. 정 의철 on [2025 CVPR] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions09/08/2025

    안녕하세요 성준님 질문 감사합니다. 먼저 co-attention에서 서로 다른 모달리티가 들어와도 projection을 통해서 차원은 맞춰줄 수 있습니다. query-aware adaptive filtering은 단지…

  • Sign-in
  • RCV-Calendar
  • RCV-Github
  • Paper R/W
    • Arxiv
    • Deadline
    • Overleaf
  • Coding
    • OnlineJudge
    • Kaggle

포기하지 않는 강한 집념 만이 작은 차이를 만든다.

Design by SejongRCV