Git Product home page Git Product logo

stupd's Introduction

STUPD Dataset

arXiv

STUPD (Spatial and Temporal Understanding of Prepositions Dataset) is a synthetic dataset that aims to help vision-language models understand relations at a granular level. STUPD covers 30 distinct spatial relations, and 10 distinct temporal relations.

Some examples from Spatial-STUPD

Spatial-STUPD examples: static

Spatial-STUPD examples: dynamic

How to access the dataset?

The STUPD dataset is available in the form of zip files in this google drive link. The total size of the dataset is 959 GB. For convenience, the dataset has been divided into multiple zip files, each not exceeding 3GB. Categories (specifically dynamic relations, 16 in number) are uploaded as multipart zip files in respective directories. To unzip them, you would have to compile the parts back together into a single zip file as cat myfolder.part-* > myfolder.zip

For reviewers, and to get a quick sense of the STUPD dataset, you can view 50 examples from each category in this google drive link.

Generating the dataset

If you are interested in generating the dataset yourself, rather than using the dataset we provide, we provide all the UNITY configuration scripts for anyone to generate the (spatial)-STUPD dataset. There are many reasons why you would want to generate the dataset on your local UNITY setup. You can customize the logic, add in more configurations possibilities (more skins, backgrounds and objects), and also extract different types of meta-data.

Running experiments and recreating baselines

In experiments, we provide pytorch-based scripts to run baselines that are reported in the paper.

Bibtex

If you find our dataset useful in your research, please use the following citation:

@article{agrawal2023stupd,
  title={STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning},
  author={Agrawal, Palaash and Azaman, Haidi and Tan, Cheston},
  journal={arXiv preprint arXiv:2309.06680},
  year={2023}
}

stupd's People

Contributors

palaashagrawal avatar

Watchers

 avatar

stupd's Issues

Problem with download dataset

Hi, excellent job.
I try to download the entire dataset, but in parts, with the link of each, against_part01.zip, against_part01.zip, etc. So, try to unzip each file, and I got this message:

Screenshot from 2024-05-06 14-36-21

Is there a way to download by part, like groups "against," "along," etc.?

Thx

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.