07/22/2024 – Robotics and Computer Vision Lab

[Neurips 2020] What Makes for Good Views for Contrastive Learning

1. Introduction 대조 학습(contrastive multiview learning)은 동일한 장면의 두 view을 representation space에서 가깝게 하고, 다른 장면의 두 view을 멀어지게 합니다. 이는 자연스럽고 강력한 아이디어이지만 중요한…

Paper X-Review

[ECCV2022]Detecting Twenty-thousand Classes using Image-level Supervision

#676478 이번에 리뷰드릴 논문은 Object Detection 데이터셋의 다양성 한계를 극복하는 방법론을 다루는 논문입니다. Meta AI(이하, 메타)와 텍사스 대학에서 발표된 연구이며 ECCV 2022에 등재되었습니다. 그럼 리뷰를…

Conference X-Review

[NerulPS 2022] Flamingo: a Visual Language Model for Few-Shot Learning

당분간 LMM 및 여러 VLM를 리뷰해보려고 하는데요, 이번에 리뷰할 논문은 구글 딥마인드에서 발표한 Visual Language Model(VLM)인 Flamingo 라는 논문입니다. 제목에서와 같이 Few-shot으로도 다양한 task를 수행할…

X-Review

[ICASSP 2023]Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation

본 논문은 speech enhancemeht와 speech separation task를 e2e 방식으로 수행하며, downstream인 separation에 유효한 정보의 손실을 막기 위해 gradient modulation을 사용하는 방법론에 관한 것으로, speech enhancemet를…

Conference News Paper X-Review

[CoRL 2023 oral] VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

이번 논문은 아주 재밌는 논문 입니다. LLM을 활용해 명시적인 명령어로부터 로봇 조작의 추론 및 명령어 생산하고 VLM(~OVD)을 활용해 로봇을 위한 3차원 공간에 대한 이해를 얻어…

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

[일:] 2024년 07월 22일

[Neurips 2020] What Makes for Good Views for Contrastive Learning

[ECCV2022]Detecting Twenty-thousand Classes using Image-level Supervision

[NerulPS 2022] Flamingo: a Visual Language Model for Few-Shot Learning

[ICASSP 2023]Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation

[CoRL 2023 oral] VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

학술대회 마감

최신 글

최신 댓글

[Neurips 2020] What Makes for Good Views for Contrastive Learning

[ECCV2022]Detecting Twenty-thousand Classes using Image-level Supervision

[NerulPS 2022] Flamingo: a Visual Language Model for Few-Shot Learning

[ICASSP 2023]Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation

[CoRL 2023 oral] VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

학술대회 마감

태그

카테고리

최신 글

최신 댓글