X-Review – Page 40 – Robotics and Computer Vision Lab

[arXiv 2025] RE3SIM: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation

안녕하세요 이번주에는 3D reconstruction과 neural rendering을 활용한 엄청나게 현실적인 real-to-sim 시스템에 대해서 리뷰해보도록 하겠습니다. Introduction 실제 환경에서 전문가의 Teleoperation을 통한 데이터 수집은 뛰어난 일반화 능력이…

Paper X-Review

[NeurIPS2025]Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection

오늘 소개 드릴 논문은 제목에서와 같이 Diversity(다양성)와 Representative(대표성)를 동시에 고려하는 고가치 데이터 선별 방법에 관한 논문입니다. 두 지표는 Coresets[arxiv]과 같은 기존 Active Learning 연구에서 자주…

Paper X-Review

[ArXiv 2024] InstructOCR: Instruction Boosting Scene Text Spotting

안녕하세요, 쉰 세번째 X-Review입니다. 이번 논문은 2024년도 ArXiv에 올라온 InstructOCR: Instruction Boosting Scene Text Spotting논문입니다. 바로 시작하도록 하겠습니다. ? 1. Introduction 최근 vision과 text를 함께…

Paper X-Review

[AAAI 2025] Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

안녕하세요, 56번째 x-review 입니다. 이번 논문은 AAAI 2025년도에 게재된 depth completion 논문으로, 제가 이전에 리뷰한 Marigold를 활용한 방법론 입니다. 그럼 바로 리뷰 시작하겠습니다 ! 1….

Conference Paper X-Review

[arXiv 2024] Occam’s LGS: A Simple Approach for Language Gaussian Splatting

이번 리뷰 논문은 3D Language Feature Splatting 기법에 대해서 다루고자 합니다. 제목 중 Occam이라는 용어가 보일 겁니다. 저 용어는 Occam’s Razor (오컴의 면도날)라는 단순의 미학을…

Conference News X-Review

[ICRA 2024]Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

Abstraction Affordance를 인식하고 pose를 추정하는 것은 로봇의 조작에 중요하며, 이 둘을 융합하므로써 작업과 연관된 affordance를 잡기 위한 pose를 생성해내므로써 로봇의 조작 능력이 개선될 수 있습니다….

Paper X-Review

[CVPR2023] Deep Deterministic Uncertainty: A New Simple Baseline

안녕하세요 본 리뷰는 일반적인 딥러닝 모델(Deterministic model)에서 불확실성을 추론하는 방법의 베이스라인을 제시하는 논문을 소개하려고 합니다. 앞서서 Bayesian learning의 장점은 불확실성을 이론적으로 정의하여 추정할 수 있는…

Paper X-Review

[ICML 2021] ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

안녕하세요, 허재연입니다. 오늘 리뷰할 논문은 google research에서 2021년 ICLR에 게재한 ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision 이라는 논문입니다. CLIP과…

Paper X-Review

[CVPR 2024]ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

안녕하세요, 쉰 두번째 X-Review입니다. 이번 논문은 2024년도 CVPR에 게재된 ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting논문입니다. 바로 시작하도록 하겠습니다….

X-Review

[CVPR2023]Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language

안녕하세요. 박성준 연구원입니다. 오늘 리뷰할 논문은 CVPR2023에 게재된 논문으로 Vision-Language task에서 Compositional Generalization 능력에 대한 논문입니다. Introduction Compositionality는 인간의 인지 능력에서 중요한 능력 중에 하나로…

Category: X-Review

[arXiv 2025] RE3SIM: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation

[NeurIPS2025]Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection

[ArXiv 2024] InstructOCR: Instruction Boosting Scene Text Spotting

[AAAI 2025] Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

[arXiv 2024] Occam’s LGS: A Simple Approach for Language Gaussian Splatting

[ICRA 2024]Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

[CVPR2023] Deep Deterministic Uncertainty: A New Simple Baseline

[ICML 2021] ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

[CVPR 2024]ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

[CVPR2023]Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language

Conference Deadline

NEW POST

New Comment