김 태주 – Page 11 – Robotics and Computer Vision Lab

홍 주영 on [arxiv 2025] Motus: A Unified Latent Action World Model01/25/2026
리뷰 잘 읽었습니다. 저한테는 워낙 익숙하지 않은 분야다보니 질문이 있어 댓글 남깁니다. 1. 논문에서는 World Model을 미래 observation을 예측하는 모델로…
홍 주영 on [Arxiv 2025] VOST-SGG: VLM-Aided One-Stage Spatio-Temporal Scene Graph Generation01/25/2026
리뷰 잘 읽었습니다. 몇 가지 궁금한 점이 있어 댓글 남겨두겠습니다! 궁금한게... position query를 MS-COCO pretrained anchor로 초기화한다고 했는데, 비디오 도메인에서도…
김 영규 on [arXiv 2025] Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow01/22/2026
안녕하세요 승현님 댓글 감사합니다. 해당 figure는 반투명으로 표현된 여러 초기 포즈들의 inital state에서 출발해도 일관되게 로봇이 작업을 완수 할 수…
김 영규 on [arXiv 2025] IGen: Scalable Data Generation for Robot Learning from Open-World Images01/22/2026
안녕하세요 우현님 댓글 감사합니다. 사실 pointcloud만으로 영상을 만드는건 품질이 떨어지지만, RGB에서 특정 K를 기준으로 Depth를 추정하고 Pointcloud로 만들었다면 같은 K로…
김 영규 on [arXiv 2025] IGen: Scalable Data Generation for Robot Learning from Open-World Images01/22/2026
안녕하세요 인하님 댓글 감사합니다. 저도 과정이 복잡하다고 느꼈는데, Open Image로부터 데이터를 얻으려다보니 다양한 모듈들이 조합되어서 더 파이프라인이 커지고 복잡해지는 것…

Author: 김 태주

[ISCAS 2021]Monocular 3D Pedestrian Localization Fusing with Bird’s Eye View

[ICRA2021]MonStereo: When Monocular and Stereo Meet at the Tail of 3D Human Localization

[ICCV 2019] MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

[CVPR 2021] Categorical Depth Distribution Network for Monocular 3D Object Detection

[CVPR2019] Coloring With Limited Data: Few-Shot Colorization via Memory-Augmented Networks

[TPAMI 2020] Parallax Attention for Unsupervised Stereo Correspondence Learning

[CVPR 2021] CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching

[NeurIPS 2020] Wasserstein Distances for Stereo Disparity Estimation

[CVPR 2020]Self-Supervised Deep Visual Odometry with Online Adaptation

Protected: [ICCV 2021, PeerReview] 7158

Conference Deadline

NEW POST

New Comment