Paper – Page 81 – Robotics and Computer Vision Lab

[ICASSP 2020] Multi-Conditioning and Data Augmentation Using Generative Noise Model for Speech Emotion Recognition in Noisy Conditions

이번에도 Speech Emotion Recognition (SER) 관련 논문입니다. 음성 인식 분야에서는 노이즈(잡음)가 모델의 성능에 영향을 끼치는 중요한 요인 중 하나입니다. 본 논문은 ‘생성 모델’을 사용하여 만든…

Paper X-Review

[ICLR2020] PSEUDO-LIDAR++: ACCURATE DEPTH FOR 3D OBJECT DETECTION IN AUTONOMOUS DRIVING

신정민 군이 Pseudo-Lidar v1을 리뷰를 해서 곧 v2를 할거 같은 예감이 들어 호다닥 제가 작성해버리는 리뷰가 되겠습니다. 인터셉트… ㅎ 먼저 개요는 pseudo-Lidar와 동일합니다. Lidar는 비싸며…

Paper X-Review

[CVPR 2022] Hierarchical Self-supervised Representation Learning for Movie Understanding

오늘 리뷰할 논문은 이번 CVPR 2022에 게재승인된 “Hierarchical Self-supervised Representation Learning for Movie Understanding”이라는 논문입니다. 영화와 같이 길이가 길고 여러 이벤트가 얽혀있는 비디오에 대해 self-supervised…

Paper X-Review

[ICCV 2021] An Empirical Study of Training Self-Supervised Vision Transformers

정말 오랜만에 엑스리뷰를 쓰게 되었네요. 오랜만에 돌아온 만큼, 지난 리뷰도 다시 상기시키면서 최신 방법론까지 익힐 수 있는 논문에 대한 리뷰를 해보려고 합니다. 바로 self-supervised leaning…

Paper X-Review

[CVPR 2022] Unsupervised Pre-training for Temporal Action Localization Tasks

Before Review 오래간만에 논문 리뷰입니다. 지난 아듀 세미나에서 소개했던 Temporal Localization과 관련된 논문입니다. 논문의 컨셉 자체는 제목에도 적혀있지만, Temporal Action Localization Task를 위한 비지도 학습 기반 사전학습 방법입니다. 당연한 방향이기도 하고 저도 작년에…

Paper X-Review

Optimization Theory (Convex Optimization Problems)

이제 본격적으로 최적화에 대해서 알아보도록 하겠습니다. 이전까지는 Convex Set이 무엇인지 그리고 Convex Function이 무엇인지 알아보는 과정을 거쳤습니다. 그러한 지식을 바탕으로 이제 최적화가 무엇이고 그 중…

Paper X-Review

[EAIS 2020] Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients

음성으로부터 사람의 감정을 인식하는 문제, Speech Emotion Recognition (SER) 관련 논문입니다. 본 논문의 핵심 아이디어는 CNN 기반 모델을 이용하여 SER 문제를 해결하는 것입니다. 해당 모델은…

News Paper X-Review

[arXiv2015]Particular object retrieval with integral max-pooling of CNN activations

Abstract CNN feature를 이용한 이미지 representation 은 기존의 short-vector represnetation방식보다 좋은 성능을 낸다. 그러나 기하학적 정보가 필요한 re-ranking방식과 호환되지 않으며, 정확한 descriptor매칭, 기하학적 re-ranking, 또는…

Paper X-Review

[CVPR2022] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer

TransDSSL 논문을 작성하면서 다음 연구주제로 생각하고 있던 것은 Self-supervised 로 Depth estimation을 한 후, 예측한 Depth를 Pseudo-LiDAR로 사용해서 3D object detection을 하는 것입니다. 따라서 현재…

Paper X-Review

[CVPR 2022] End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection

이번에 가져온 리뷰는 “Event boundary detection”입니다. CVPR 2022 논문들 중에서 딱 보이길래… 눈길이 가서 읽었습니다. 이 “Event boundary detection”는 일반적인 이벤트의 경계를 찾는 task입니다. CVPR…

Category: Paper

[ICASSP 2020] Multi-Conditioning and Data Augmentation Using Generative Noise Model for Speech Emotion Recognition in Noisy Conditions

[ICLR2020] PSEUDO-LIDAR++: ACCURATE DEPTH FOR 3D OBJECT DETECTION IN AUTONOMOUS DRIVING

[CVPR 2022] Hierarchical Self-supervised Representation Learning for Movie Understanding

[ICCV 2021] An Empirical Study of Training Self-Supervised Vision Transformers

[CVPR 2022] Unsupervised Pre-training for Temporal Action Localization Tasks

Optimization Theory (Convex Optimization Problems)

[EAIS 2020] Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients

[arXiv2015]Particular object retrieval with integral max-pooling of CNN activations

[CVPR2022] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer

[CVPR 2022] End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection

Conference Deadline

NEW POST

New Comment