Robotics and Computer Vision Lab

신 인택 on [CVPR2025] GeoDepth08/04/2025
안녕하세요 정민님, 불친절한 논문에서 구체적인 리뷰 감사합니다. 뭔가 SOTA는 아니라곤 하지만 CVPR이 아니라면 저자의 방법론이 더 확장될 가능성이 있다 생각하는지…
이 상인 on LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/04/2025
안녕하세요. 리뷰 잘 읽었습니다. 리뷰 내용 중에 때 "object query는 일반적으로 하나의 작은 피처만을 포함하므로, LLM 내부에 cross-attention layer를 새로…
신 인택 on LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/04/2025
안녕하세요 우현님 리뷰 재밌게 읽었습니다. 뭔가 classification용 데이터셋에서 수도라벨을 만들때, MM-GDINO를 통해 만든 박스 시각화 이미지를 하나쯤 첨부했으면? 그 모델의…
이 상인 on [arXiv 2025] Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather08/04/2025
안녕하세요. 리뷰 잘 읽었습니다. Distillation Learning파트에서 중간 출력을 활용해서 loss를 설계하는 모습을 보이는데, 이게 그럼 출력이 아니라 feature map만으로 loss를…
안 우현 on [ICCV 2025] SVTRv2: CTCBeats Encoder-Decoder Models in Scene Text Recognition08/04/2025
안녕하세요 지연님 좋은 리뷰 감사합니다. SGM 설명해주시는 부분에서 저는 문맥상 SGM이 학습시에만 사용되서 visual feature가 context정보를 학습하도록 돕는다고 이해했는데 "SGM은…

[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

[WACV 2024] DTrOCR: Decoder-only Transformer for Optical Character Recognition

[CVPR2023]Causalainer: Causal Explainer for Automatic Video Summarization

[ECCV 2020] End-to-End Object Detection with Transformers

CVPR 2025 참관기

[arXiv 2025] [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster

[arXiv 2025] Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data

[AAAI 2025] Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

[arXiv 2025] DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation

[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

Conference Deadline

NEW POST

New Comment