Git Product home page Git Product logo

ovdeval's Introduction

OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection

[Paper 📄] [Dataset 🗂️]


OVDEval is a new benchmark for OVD model, which includes 9 sub-tasks and introduces evaluations on commonsense knowledge, attribute understanding, position understanding, object relation comprehension, and more. The dataset is meticulously created to provide hard negatives that challenge models' true understanding of visual and linguistic input. Additionally, we identify a problem with the popular Average Precision (AP) metric when benchmarking models on these fine-grained label datasets and propose a new metric called Non-Maximum Suppression Average Precision (NMS-AP) to address this issue.

Check out Our AAAI24 paper [How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection] for more details about the Inflated AP Problem and NMS-AP.

knowledge

OVDEval


Dataset Statistics

benchmark


Benchmark

radar benchmark


How To Download

See Our hugging face page for downloading OVDEval.


Evaluate With NMS-AP

OVDEval should be evaluated using NMS-AP to avoid the inflated AP problem. Please follow the evaluation instructions.

The "output" folder provides the final output JSON files obtained by applying NMS to the inference results of the GLIP model on the material test dataset.


Citations

Please consider citing our papers if you use the dataset:

@article{yao2023evaluate,
  title={How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection},
  author={Yao, Yiyang and Liu, Peng and Zhao, Tiancheng and Zhang, Qianqian and Liao, Jiajia and Fang, Chunxin and Lee, Kyusong and Wang, Qing},
  journal={arXiv preprint arXiv:2308.13177},
  year={2023}
}

ovdeval's People

Contributors

p3ngliu avatar mary-0830 avatar snakeztc avatar nxf1111 avatar

Stargazers

 avatar  avatar  avatar So-Young avatar Size Wu (吴思泽) avatar  avatar Zilun Zhang avatar  avatar  avatar Yuqi Ma avatar Xinran Wang avatar  avatar  avatar morning.cheng avatar 夏吧吧 avatar wanghao avatar  avatar Kai avatar  avatar zhoutong avatar Sheep avatar  avatar  avatar Kelei Jiang avatar Paul Alan avatar zhang_lu avatar kingfly avatar Licong Guan avatar Ren Tianhe avatar Qing Jiang avatar  avatar  avatar  avatar

Watchers

 avatar Ruochen Xu avatar Kyusong Lee avatar

Forkers

zilunzhang

ovdeval's Issues

question

Hello, author, your work is excellent. I have a question. How is Figure 5 drawn? Can you tell me some details, especially Figure (b)?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.