김 영규 – Robotics and Computer Vision Lab

이 재윤 on [CVPR 2026] SARMAE : Masked Autoencoder for SAR Representation Learning05/11/2026
안녕하세요 우진님, 좋은 질문 감사합니다. 이쪽 분야를 접한 이유는 저희 팀 기업 과제가 task가 SAR object detection이고, 과제 팔로우업을 겸해서…
이 재윤 on [CVPR 2026] SARMAE : Masked Autoencoder for SAR Representation Learning05/11/2026
안녕하세요 정우님, 좋은 질문 감사합니다. DINOv3는 frozen 상태로 optical branch에서 이미지 패치 feature를 추출하는 용도로만 사용되며, SAR branch에서는 일반적인 ViT…
이 재윤 on [CVPR 2026] SARMAE : Masked Autoencoder for SAR Representation Learning05/11/2026
안녕하세요 인택님, 좋은 질문 감사합니다. 말씀주신 대로 SAR-1M 데이터셋은 SAR 이미지 중 매칭된 광학 이미지 쌍이 존재하는 경우도 있고, 아닌…
이 재윤 on [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?05/11/2026
안녕하세요 예은님, 좋은 리뷰 감사합니다. description selection 과정에서, 단순히 타겟 클래스의 이미지와 가장 유사도가 높은 텍스트를 고르는 것에 그치지 않고…
최 인하 on [RSS 2025] DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation05/11/2026
안녕하세요 승현님 좋은 질문 감사합니다 프로젝트 페이지에 따로 fingertip nail을 사용해서 task를 수행한 정성적인 영상 결과가 있습니다. 예를 들어서 바닥에…

Author: 김 영규

GR00T : An Open Foundation Model for Generalist Humanoid Robots

[arXiv 2026] PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance

[arXiv 2026] Beyond Imitation: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models

[ICLR 2026] Emergent Dexterity via Diverse Resets and Large-Scale Reinforcement Learning

[ICLR 2026] Self-Improving Vision-Language-Action Models with Data Generation via Residual RL

[arXiv 2026] Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

[arXiv 2026] How to Peel with a Knife : Aligning Fine-Grained Manipulation with Human Preference

[arXiv 2026] Observing and Controlling Features in Vision-Language-Action Models

[arXiv 2026] Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline

[arXiv 2026] EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

Conference Deadline

NEW POST

New Comment