Paper – Page 115 – Robotics and Computer Vision Lab

DepressNet – Visually Interpretable Representation Learning for Depression Recognition from Facial Images

IITP 연구노트 작성을 위한 우울증 논문 리뷰입니다. 금일 리뷰를 진행할 논문은 지난번 리뷰했던 논문에서 사용한 AVEC depression dataset 데이터셋을 이용한 우울증 예측 논문입니다. 참고로 2018년에…

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks (CycleGAN)

이번 리뷰는 Cycle GAN입니다. Cycle GAN은 나온지도 꽤 된 논문이며, GAN과 관련된 논문들에서 정말 많이 사용되는 방법론이기 때문에 정리된 글들도 많지만 단순히 글만 읽는 것이…

Paper X-Review 미분류

Unsupervised Monocular Depth Estimation & Multi-view Geometry

1. 해당 논문을 다루게된 배경 이번주에 Multi-view Geometry를 공부하며 카메라와 월드좌표계 사이의 관계에 대해서 학습했습니다. 그러다보니 자연스럽게 Depth Estimation 까지 공부가 확장되었습니다. Stereo camera의 경우…

Paper X-Review

[ICCV 2019] SlowFast Networks for Video Recognition

이 논문은 사람의 시각 인식 세포가 Fast motion과 Slow motion에 동시에 반응하는 것에서 영감을 얻게된 논문입니다. 주로 어떤 사람이 걷다가 속도를 올려 뛰게 된다해도 그…

Paper X-Review 미분류

PointPillars: Fast Encoders for Object Detection from Point Clouds

논문소개 이번에 리뷰하게된 논문은 3D object detection에 관한 논문입니다. 3D Object detection은 자율주행, 로보틱스에서 많이 사용되며 2D와 비교하면 상당히 난이도가 있는편입니다. 이때 난이도라고하면 학습난이도, 코딩난이도,…

Conference Paper X-Review

[ICCV 2019] Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection

해당 방법은 Multisepctral Pedestrian Detection Datasets들이 하나의 bounding box를 사용하여 예측함으로써 다른 장비를 통하여 촬영된 multi-modal간 어긋난 현상을 추가적인 모듈을 통해 정렬함으로써 성능을 향상시킨 방법론…

Conference Paper X-Review

[BMVC2020] Anchor-free Small-scale Multispectral Pedestrian Detection

2020년 4월 저는 CVPR에 나온 Anchor Free 방식으로 당시 Pedestrian Detection 네트워크인 CSP를 소개한적 있습니다. 그로부터 9개월 뒤인 오늘 제가 소개할 논문은 CSP를 Multispectral Pedestrian…

Paper X-Review

FRAME ATTENTION NETWORKS FOR FACIAL EXPRESSION RECOGNITION IN VIDEOS

SMART논문을 보다가 보게 된 논문이다. 논문을 소개하기 전에 SMART에서 나온 Attention and Relation models에 대해 소개하겠다. (SMART는 지난번 리뷰한 논문으로 링크는 다음과 같다)1. Attentionattention의 개념은…

Paper X-Review

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets

본 논문은 Do these Models Really Capture Temporal Information? 라는 질문에 관한 논문이다. 보통 모델이 깊어질수록 낮은 수준의 정보는 점점 사라진다. 아래 그림1은 원본 비디오…

Paper

Depression Status Estimation by Deep Learning based Hybrid Multi-Modal Fusion Model

IITP가 올해에도 계속 진행되기 때문에 찾아본 논문입니다. 저희가 진행하는 연구의 방향과 비슷한 것 같아 논문을 읽어보았습니다. Application Usage Overview 본 논문에서도 피실험자의 데이터를 안드로이드 폰으로…

Category: Paper

DepressNet – Visually Interpretable Representation Learning for Depression Recognition from Facial Images

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks (CycleGAN)

Unsupervised Monocular Depth Estimation & Multi-view Geometry

[ICCV 2019] SlowFast Networks for Video Recognition

PointPillars: Fast Encoders for Object Detection from Point Clouds

[ICCV 2019] Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection

[BMVC2020] Anchor-free Small-scale Multispectral Pedestrian Detection

FRAME ATTENTION NETWORKS FOR FACIAL EXPRESSION RECOGNITION IN VIDEOS

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets

Depression Status Estimation by Deep Learning based Hybrid Multi-Modal Fusion Model

Conference Deadline

NEW POST

New Comment