황 유진 – Page 11 – Robotics and Computer Vision Lab

Meta Pseudo Labels

소개 해당 논문은 새로운 semi-supervised learning 방식에 대해 소개한다. 해당 논문은 ImageNet에서 top-1 accuracy 기준 90.2%로 높은 성능을 보였으며 supervised learning 방식에 대적할 정도로 높은…

Continue Reading

[2021CVPR]Global2Local: Efficient Structure Search for Video Action Segmentation

논문이 생각보다 어려워 리뷰 시간이 늦어졌습니다.. 논문 소개 해당 논문은 Video Action Segmentation를 위해 Temporal receptive fields를 조절하여 모델의 구조를 찾기위한 논문입니다. 제안하는 논문은 Large…

Continue Reading

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning

해당 논문은 pixel-level로 Contrastive learning을 하는 방식을 제안한다. 기존의 unsupervised learning에서 Contrastive learning은 이미지 전체인 instance level로 이루어지곤 했다. 이러한 학습 방식은 image classification task에서는…

Continue Reading

Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations

Introduction 해당 논문은 temporal action detection task 를 위한 논문이다. temporal action detection 이란 비디오에서 action 영역의 시작과 끝을 (SSD의 bounding box)같이 localization과 classification을 진행하는…

Continue Reading

[CVPR2021] Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution

소개 해당 연구는 Probabilistic Abduction and Execution (PrAE) learner 를 제안한다. 이를 통해 기존의 후보군 중 카테고리를 선택하는 학습법이 아닌 확률적인 방법으로 답을 예측하며, visual…

Continue Reading

[CVPR2021] Spatiotemporal Contrastive Video Representation Learning

해당 논문은 spatio, temporal 정보를 이용한 video augmentation을 통해 contrastive learning을 진행한는 방식을 소개한 논문이다. Temporal Augmentation하나의 비디오에서 두개의 clip을 positive pair로 사용하는 방식이다. 이때…

Continue Reading

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations

오늘 소개드릴 objectron은 3D object detection 데이터셋으로 mediapipe에서 mobile real-time 3D object detection 을 위해 학습 데이터로 사용한 데이터입니다. 오늘 리뷰는 크게 1. 3D object…

Continue Reading

Momentum Contrast for Unsupervised Visual Representation Learning (MoCo)

해당 논문은 unsupervised visual learning을 dynamic dictionaries 구축 과정으로 정의하며 contrastive learning을 위한 크고 지속적인 사전을 구축하기 위한 연구를 소개합니다.IntroductionUnsupervised representation learning은 language processing 분야에서…

Continue Reading

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

해당 논문은 이미지 기반 unsupervised 방법론 모델을 통해 video에 대한 representation learning 실험을 진행한다.이미지 기반 방법론은 SimCLR, MoCo, BYOL, SwAV를 이용한다. 이미지기반 unsupervised 방법론들은 하나의…

Continue Reading

[CVPR2021] On Semantic Similarity in Video Retrieval

해당 논문은 Semantic Similarity의 범위에 대해 논하는 논문이다. 처음 논문을 선택한 이유는 similarity의 범위에 대해 분석하고 있는 내용인지 궁금하여 선택하였다. 해당 논문에서 다루는 video retrieval은…

Continue Reading