Git Product home page Git Product logo

glami-1m's Introduction

GLAMI-1M: A Multilingual Image-Text Fashion Dataset

We introduce GLAMI-1M: the largest multilingual image-text classification dataset and benchmark. The dataset contains images of fashion products with item descriptions, each in 1 of 13 languages. Categorization into 191 classes has high-quality annotations: all 100k images in the test set and 75% of the 1M training set were human-labeled. The paper presents baselines for image-text classification showing that the dataset presents a challenging fine-grained classification problem: The best scoring EmbraceNet model using both visual and textual features achieves 69.7% accuracy. Experiments with a modified Imagen model show the dataset is also suitable for image generation conditioned on text.

GLAMI-1M Dataset Examples

GLAMI-1M Paper

If you use or reference the dataset, please use the following BibTex entry to cite the paper:

@inproceedings{Kosar_2022_BMVC,
author    = {Vaclav Kosar and Antonín Hoskovec and Milan Šulc and Radek Bartyzal},
title     = {GLAMI-1M: A Multilingual Image-Text Fashion Dataset},
booktitle = {33rd British Machine Vision Conference 2022, {BMVC} 2022, London, UK, November 21-24, 2022},
publisher = {{BMVA} Press},
year      = {2022},
url       = {https://bmvc2022.mpi-inf.mpg.de/0607.pdf}
}

Google Colab Notebook

Try hands-on exercise with the dataset in this Google Colab notebook. glami-1m-multilingual-image-text-dataset-text-vs-image-similarity

How to Download GLAMI-1M Manually

You can either manually download the dataset zip file yourself or use the repository scripts to download, extract, and load into a dataframe.

To manually download the dataset ZIP file(s):

GLAMI-1M Code

Together with the dataset with provide helper code and experiments described in the paper. The model weights will be uploaded soon.

Installation

Below are steps to get minimal installation to be able to download the dataset with Python

conda create -n g1m python=3.9
conda activate g1m

git clone https://github.com/glami/glami-1m.git
cd glami-1m
pip install -r requirements_minimal.txt

How to Download GLAMI-1M Programmatically

The 228x298 dataset version can be downloaded programmatically using repository Python code. The destination directory to download and extract the dataset can be configured with environmental variable EXTRACT_DIR. EXTRACT_DIR is by default configured as a Linux temporary directory, which is removed upon machine restarts. Before downloading the dataset, make sure you have enough space to download and unzip corresponding version of the dataset.

EXTRACT_DIR="/tmp/GLAMI-1M/" python -c 'import load_dataset; load_dataset.download_dataset())'

After the download, we can load the dataset into a dataframe.

EXTRACT_DIR="/tmp/GLAMI-1M/" python -c 'import load_dataset; print(load_dataset.get_dataframe("test").head())'

Baseline Weights

To fully produce all code in the repository install all requirements via:

pip install requirements.txt

The code is available in folders classification, image-to-text, and translation. Weights for the baseline models described in the paper are available here.

Examples

GLAMI-1M Dataset Examples Table

glami-1m's People

Contributors

sulc avatar thonic avatar vackosar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

glami-1m's Issues

Missing "category" column in dataframe

First of all, thank you for open-sourcing this repository. I have just one question.
I recently downloaded the dataset and tried to run the eval_embracenet.py example.
However, I ran into a problem where the dataset raised an error while loading due to COL_NAME_CATEGORY ("category") not existing.

['item_id', 'image_id', 'name', 'description', 'geo', 'category_name', 'label_source', 'image_file'] seems to be what I get when I run df.keys() after line 19 in eval_embracenet.py.

Thank you in advance. I look forward to hearing from you soon.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.