Git Product home page Git Product logo

mgid-dataset's Introduction

MGID: A Social Media Based Multi-Modal Getty Image Depression and Emotion Dataset

Introduction

To support the development of social meida based multi-modal depression and emotion joint recognition, we collect textual and visual documents from Getty Image, and create a weakly labeled multi-modal depression and emotion dataset, called MGID. We we set a list of keywords with strongly depression (e.g., "depressive patient", "depression", etc.) and emotions ("happy", "scary", etc.) to query Getty Images, and use the labels of these words to weakly label the retrieved multi-modal documents of the first thirty pages. Based on these documents, we construct a new multi-modal Getty Image depression and emotion (MGID) dataset. It contains 2500 depressive and 2500 non-depressive samples. As for emotion, we divide MGID into SIX emotions, i.e., happiness, depressed, neutral, fear, surprise (including positive and negative feelings), anger. The dataset also provides train/dev/test sub-datasets.

Dataset Distribution

Dataset Distribution

Citation

All publications reporting on research using this database have to acknowledge this by citing the following article:
To be waited...

How To Use

(1) Download Raw Data

*** As for text, the title and textual content has been encapsulated into .zip file. One can access it directionly.
*** As for raw image, since Getty Image is a commercial platform, we will not upload the raw image directionly. Instead, we choose to upload its image URL, and provide the imageDownloader.py to download all raw images in MGID.

(2) User license

1. Commercial and academic use

MGID is made available for research purposes only. Any commercial use of this data is forbidden.

2. Redistribution

The user may not distribute the database or parts of it to any third party.

3. Publications

The use of data for illustrative purposes in publications is allowed. Publications include both scientific papers and presentations for scientific/educational purposes.

4. Changes

The author is allowed to change these terms of use at any time.

5. Warranty

We open it just for the development of depression detection. The dataset comes without any warranty. In no event shall the provider be held responsible for any loss or damage caused by the use of this data.

mgid-dataset's People

Contributors

yzzhang2008 avatar

Stargazers

duke_1111 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.