CVPR 2026 Foundational Few-Shot Object Detection Challenge

Published:

Challenge Summary

Can foundation models replace human annotators for 2D data annotation? Vision-Language Models (VLMs) like GroundingDINO have demonstrated remarkable zero-shot 2D detection performance on standard benchmarks like COCO. However, such foundational models may still be sub-optimal for specific target applications like medical and aerial image analysis Indeed, this well-known observation has created the ad-hoc practice of prompt engineering, where users actively search for a textual prompt that elicits the desired zero-shot behavior. Instead, we argue that one can principally address the challenge of aligning foundational models to target concepts through the lens of few-shot recognition by presenting VLMs with a few visual examples of the target concept. Crucially, such examples can be multi-modal, using both text and visual cues, mimicking the natural few-shot multi-modal instructions that are often given to human annotators when defining a target concept of interest. Concretely, we evaluate how well foundation models can learn from multi-modal annotator instructions using the Roboflow-VL dataset. We will evaluate detectors pre-trained on any external datasets and fine-tuned on multi-modal (text and visual) 10-shot examples per class. Methods will be evaluated on a held-out set of images and will be ranked by mean average precision (mAP).

Overall Track Challenge Winner

Team: Superb AI

Members: Kyeongryeol Go, Hyundong Jin, Taewoong Jang, Wooseong Choi

[Dowload Technical Report] [Code]

Overall Track Runner Up

Team: FDUROILab Lenovo

Members: Lingyi Hong, Mingxi Cheng, Xingqi He, Runze Li, Xingdong Sheng, Wenqiang Zhang

[Dowload Technical Report] [Code]

Other Teams

Team: Aliz

Members: Ali Alavi

[Dowload Technical Report] [Code]

Team: ANMSPro

Members: Abu Noman Md Sakib

[Dowload Technical Report] [Code]

In-Context Prompting Track Winner

Team: Lababa

Members: Pu Luo, Cong Xu, Yumei Li, Licheng Jiao, Puha Chen, Dan Zhang

[Dowload Technical Report] [Code]

In-Context Prompting Track Runner Up

Team: MR UCAS

Members: Chenhao Zhou, Qianqian Xu, Minye Lei, Yihang Huang, Peisong Wen, Siran Dai, Yang Liu, Qingming Huang

[Dowload Technical Report] [Code]

Other Teams

Team: HUST-ORP

Members: Qile Miao, Wencan Pei, Zerui Xi, Ruijie Ma, Jianxin Lin, Yiping Gao

[Dowload Technical Report]

Team: XMU MAC

Members: Zhigang Chen, Xiawu Zheng, Rongrong Ji

[Dowload Technical Report]

Team: TAHAKOM

Members: Norah Alshammari, Dalal Alayban, Reema Alsugair

[Dowload Technical Report]

Team: ANMSPro

Members: Abu Noman Md Sakib

[Dowload Technical Report] [Code]