X-Review – Page 84 – Robotics and Computer Vision Lab

[IEEE Wireless Communication 2018] Active Learning for Wireless IoT Intrusion Detection

안녕하세요, 허재연입니다. 요즘 6종 데이터셋에 대한 통일된 Active Learning 적용을 주제로 한 논문을 작성하고자 하고 있습니다. 6종 데이터 중 하나가 IoT(사물 인터넷) 데이터인데, 이와 관련된…

[arXiv 2018]Deep Residual Learning for Image Recognition

안녕하세요 이번에 제가 리뷰할 논문은 ‘ResNet: Deep Residual Learning for Image Recognition’입니다. 2015년에 Kaiming He 등의 연구진은 Residual Networks(ResNet)라는 아키텍처를 제안하며 네트워크의 깊이가 증가함에 따라…

X-Review

[ICASSP 2022] Wav2CLIP: Learning Robust Audio Representations from Clip

이번 주 리뷰는 Wav2CLIP이라는 논문으로 Contrastive Language–Image Pre-training (CLIP)에서 파생된 audio representation learning method입니다. 우리가 기존에 알고 있던 CLIP은 image와 text를 동일 feature space로 projection하고…

Paper X-Review

[ICCV 2023] UATVR: Uncertainty-Adaptive Text-Video Retrieval

이번 주차 X-Review는 23년도 ICCV에 게재된 <UATVR: Uncertainty-Adaptive Text-Video Retrieval>이라는 논문입니다. 중국 바이두에서 연구된 논문이네요. Text-Video Retrieval(이하 TVR)이라는 task는 비디오와 text 두 모달 간 공통의…

X-Review

[CVPR2016]Deep Residual Learning for Image Recognition

안녕하세요 오늘의 X-Review는 ResNet입니다. ResNet은 2015년도 ImageNet Classification 대회인 ILSVRC 대회에서 1등을 차지하고 현재까지 backbone모델로 많이 사용되는 모델입니다. 다들 익숙하신 내용이겠지만 CNN과 VGG모델을 알고 있다는…

Conference

[ICCV2023] EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction

이번에 소개드릴 논문은 ICCV2023에 게재된 EfficientViT라는 방법론입니다. backbone에 대한 논문이며, image classification 같은 task 대신 segmentation, super resolution과 같은 dense level prediction task에 초점을 맞추어…

Paper X-Review

[MM 2022] X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

이런 분들께 이 논문을 추천드립니다. CLIP을 비디오에 적용하는 방식에 흥미가 있으신 분 Video Text Retrieval에서 fine-grained와 coarse-grained를 모두 활용하는 cross-grained 방식이 궁굼하신 분 이 논문을…

X-Review

[CVPR 2023] PMR: Prototypical Modal Rebalance for Multimodal Learning

오늘도 멀티모달 논문입니다! 제가 이제까지 VQA 논문을 읽은 이유는 Multimodal bias에 관심이 많아서 인데요. 두개의 모달리티를 모두 사용하지만 하나의 모달리티만 학습되는 상황에 어떻게 대처를 하는가에…

Paper X-Review

[ICCV 2023] Segment anything

안녕하세요, 열일곱번째 X-Review 입니다. 이번 논문은 2023년도 ICCV에 게재된 Segment anything 논문 입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1. Introduction 대용량 데이터셋으로 학습한 LLM은 NLP…

Paper X-Review

[CVPR 2018] Deep Depth Completion of a Single RGB-D Image

안녕하세요, 열일곱번째 x-review 입니다. 이번 논문은 2018년도 CVPR에 게재된 Deep Depth Completion of a Single RGB-D Image이라는 Depth Completion 논문 입니다. 그럼 바로 리뷰 시작하겠습니다 !…

Category: X-Review

[IEEE Wireless Communication 2018] Active Learning for Wireless IoT Intrusion Detection

[arXiv 2018]Deep Residual Learning for Image Recognition

[ICASSP 2022] Wav2CLIP: Learning Robust Audio Representations from Clip

[ICCV 2023] UATVR: Uncertainty-Adaptive Text-Video Retrieval

[CVPR2016]Deep Residual Learning for Image Recognition

[ICCV2023] EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction

[MM 2022] X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

[CVPR 2023] PMR: Prototypical Modal Rebalance for Multimodal Learning

[ICCV 2023] Segment anything

[CVPR 2018] Deep Depth Completion of a Single RGB-D Image

Conference Deadline

NEW POST

New Comment