Robotics and Computer Vision Lab

김 영규 on [CVPR 2025] Any6D : Model-free 6D Pose Estimation of Novel Objects08/07/2025
안녕하세요 건화님 댓글 감사합니다. 제가 설명을 부정확하게 한 것 같습니다. 단일 RGB 이미지를 통해 3D mesh를 만들어내는 image to 3D모델을…
안 우현 on [CVPR 2023]Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation08/06/2025
안녕하세요 영규님 좋은 댓글 감사합니다. 저도 학습 초기에 마스크가 bbox 보다 더 예측에 유리하다라는 부분이 되게 흥미로웠습니다! 마스크로부터 bbox를 역으로…
안 우현 on [CVPR 2025]LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/06/2025
안녕하세요 윤서님 좋은 댓글 감사합니다. Q1 . 본 논문에서 GroundingCap-1M이라는 대규모 OVOD 데이터를 수집할 때의 후처리 절차중에 무의미하거나, 깨진 문장을…
안 우현 on [CVPR 2025]LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/06/2025
안녕하세요 지연님 좋은 댓글 감사드립니다. 사실 해당 논문은 이미지 캡셔닝에 대한 연구는 아닙니다. 연구자체는 OVOD이며 OVOD에서 객체에 대한 정보 뿐만…
안 우현 on [CVPR 2025]LLMDet: Learning Strong Open-Vocabulary Object Detectors under theSupervision of Large Language Models08/06/2025
안녕하세요 유진님 좋은 댓글 감사합니다. 답변 먼저 드리면 Language Modeling Loss는 Ground Truth caption과 직접적으로 유사해지도록 학습됩니다. Figure 4에 GT…

Optimization Theory (Convex Optimization Problems)

[AAAI2017] Unsupervised Deep Learning for Optical Flow

[EAIS 2020] Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients

[arXiv2015]Particular object retrieval with integral max-pooling of CNN activations

[CVPR2022] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer

[CVPR 2022] End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection

LEA-Net: Layer-wise External Attention Network for Efficient Color Anomaly Detection

[ACMM2017] Multispectral Object Detection for Autonomous Vehicles

Boosting Contrastive Self-Supervised Learning with False Negative Cancellation

[arXiv 2022] Cross Modal Retrieval with Querybank Normalisation

Conference Deadline

NEW POST

New Comment