Robotics and Computer Vision Lab

신 인택 on [CVPR2025] GeoDepth08/04/2025
안녕하세요 정민님, 불친절한 논문에서 구체적인 리뷰 감사합니다. 뭔가 SOTA는 아니라곤 하지만 CVPR이 아니라면 저자의 방법론이 더 확장될 가능성이 있다 생각하는지…
이 상인 on LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/04/2025
안녕하세요. 리뷰 잘 읽었습니다. 리뷰 내용 중에 때 "object query는 일반적으로 하나의 작은 피처만을 포함하므로, LLM 내부에 cross-attention layer를 새로…
신 인택 on LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/04/2025
안녕하세요 우현님 리뷰 재밌게 읽었습니다. 뭔가 classification용 데이터셋에서 수도라벨을 만들때, MM-GDINO를 통해 만든 박스 시각화 이미지를 하나쯤 첨부했으면? 그 모델의…
이 상인 on [arXiv 2025] Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather08/04/2025
안녕하세요. 리뷰 잘 읽었습니다. Distillation Learning파트에서 중간 출력을 활용해서 loss를 설계하는 모습을 보이는데, 이게 그럼 출력이 아니라 feature map만으로 loss를…
안 우현 on [ICCV 2025] SVTRv2: CTCBeats Encoder-Decoder Models in Scene Text Recognition08/04/2025
안녕하세요 지연님 좋은 리뷰 감사합니다. SGM 설명해주시는 부분에서 저는 문맥상 SGM이 학습시에만 사용되서 visual feature가 context정보를 학습하도록 돕는다고 이해했는데 "SGM은…

[CVPR 2025] Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

[CVPR 2024]SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation

[arXiv 2024]ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation

[AAAI 2024](Oral) AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models

[CVPR 2024] WorDepth: Variational Language Prior for Monocular Depth Estimation

[TIP 2024] CLIP4STR: A Simple Baseline for Scene TextRecognition with Pre-trained Vision-LanguageModel

[ECCV 2022]Simple Open-Vocabulary Object Detection with Vision Transformers

[CVPR2025] Masking meets Supervision: A Strong Learning Alliance

[CVPR 2024] PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

[ICRA 2024] Universal Visual Decomposer: Long-Horizon Manipulation Made Easy

Conference Deadline

NEW POST

New Comment