Git Product home page Git Product logo

fitzpatrick17k's Introduction

Fitzpatrick17k


Overview

fitzpatrick17k

We annotated 16,577 clinical images sourced from two dermatology atlases โ€” DermaAmin and Atlas Dermatologico โ€” with Fitzpatrick skin type labels with two data annotation services: Scale AI and Centaur Labs. The Fitzpatrick labeling system, while not perfect, is a six-point scale originally developed for classifying sun reactivity of skin phenotype. The Fitzpatrick scale served as the basis for skin color in emojis and, more recently, the Fitzpatrick scale has been used in computer vision applications to evaluate algorithmic fairness and model accuracy. The annotated images represent 114 skin conditions with at least 53 images and a maximum of 653 images per skin condition.

fitzpatrick

We've included the original image sources, the training script train.py, and a notebook to compare Fitzpatrick annotations with individual typology angle scores. You can read the dataset and our analysis in our paper.

We thank Scale AI and Centaur Labs for providing Fitzpatrick Skin Type annotations for all images in this dataset for free.

Updates July 7 2022

Given the subjectivity of annotating Fitzpatrick Skin Type in images showing skin disease, we evaluated how well experts, crowds, and an algorithm compare with respect to inter-rater reliability. You can find replication files for this analysis in the annotation_evaluation folder.

Data Usage


Download the dataset

You can find the Fitzpatrick annotations in fitzpatrick17k.csv. You can download the images from their original source, which is shared in the url column of the Fitzpatrick annotations .csv. Alternatively, fill out this form and contact us and we can provide a link to all the images.

Replicate our analysis

The results from our paper can be replicated using train.py and ita_fitzpatrick_analysis.ipynb.

After you download the dataset, edit train.py by specifying the image directory of the dataset, and then run python train.py 20 full where 20 refers to the number of epochs and full refers to the full dataset.

You can check out our comparison of Fitzpatrick annotations and individual typology angle scores with the ita_fitzpatrick_analysis.ipynb

How to cite this dataset and paper

@inproceedings{groh2021evaluating,
  title={Evaluating deep neural networks trained on clinical images in dermatology with the fitzpatrick 17k dataset},
  author={Groh, Matthew and Harris, Caleb and Soenksen, Luis and Lau, Felix and Han, Rachel and Kim, Aerin and Koochek, Arash and Badri, Omar},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1820--1828},
  year={2021}
}

And the second paper:

@article{groh2022towards,
  title={Towards transparency in dermatology image datasets with skin tone annotations by experts, crowds, and an algorithm},
  author={Groh, Matthew and Harris, Caleb and Daneshjou, Roxana and Badri, Omar and Koochek, Arash},
  journal={arXiv preprint arXiv:2207.02942},
  year={2022}
}

Licensing

Original images collected from Atlas Dermatologico and DermaAmin

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

fitzpatrick17k's People

Contributors

mattgroh avatar aerinkim avatar yuppyboy avatar jeremykawahara avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.