Robotics and Computer Vision Lab

이 재윤 on [CVPR 2026] DIvide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding05/26/2026
안녕하세요 찬미님, 좋은 리뷰 감사합니다. 저도 최근에 adaptive frame sampling 논문을 읽게 되서 제가 읽은 논문과 어떤 차이가 있을지 궁금해서…
이 재윤 on [arXiv 2026] Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video05/26/2026
안녕하세요 성준님, 좋은 리뷰 감사합니다. Fig 7에서 일부 모델에서는 thinking 활성화 후 성능 드랍(regression)이 관찰되며 wo. subtitle 설정에서 더 두드러지는데,…
손 우진 on [arXiv 2026]Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned05/26/2026
안녕하세요 우현님 좋은리뷰 감사합니다 요약하면 VNM들이 real-world에서 SR만으로는 안 드러나는 failure mode가 많고 복잡한 architecture가 단순한 GNM보다 항상 낫지는 않다는…
손 우진 on [CVPR 2026] EgoX: Egocentric Video Generation from a Single Exocentric Video05/26/2026
안녕하세요 정우님 좋은 리뷰 감사합니다 질문 한가지 남깁니다.. Egocentric camera pose는 어떻게 받는 건가요? 학습 때는 Ego-Exo4D의 GT pose를 그대로…
황 찬미 on [ICLR 2026] AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models05/26/2026
안녕하세요 인택님 리뷰 감사합니다! 뭔가 단순한 image-aware adjustment 설계만으로도 성능이 잘 나온다는 점이 흥미로웠습니다. 결국 이미지가 단순한지 복잡한지에 따라 두…

Recent Posts

[NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models

[HRI 2026] Learning Human Preferences over a Human-Robot Collaboration Based on Explicit and Implicit Human Feedback

[CoRL 2024] APRICOT : Active Preference Learning and Constraint-Aware Task Planning with LLMs

[ICML 2026] VideoBrain : Learning Adaptive Frame Sampling for Long Video Understanding

[ICLR 2026 Workshop] World Action Models are Zero-shot Policies

[CVPR 2026] EgoX: Egocentric Video Generation from a Single Exocentric Video

[ICLR 2026] AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models

[arXiv 2026] Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video

[arXiv 2026]Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

[arXiv 2026] Zero-shot World Models Are Developmentally Efficient Learners

Conference Deadline

NEW POST

New Comment