Git Product home page Git Product logo

mitigatinglabelnoisedataambiguation's Introduction

Mitigating Label Noise through Data Ambiguation

This repository contains an implementation of Mitigating Label Noise through Data Ambiguation to be presented at AAAI-24. Please cite it as follows:

@misc{lienen2023mitigating,
      title={Mitigating Label Noise through Data Ambiguation}, 
      author={Julian Lienen and Eyke Hüllermeier},
      year={2023},
      eprint={2305.13764},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Requirements

To install all required packages, you need to run

pip install -r requirements.txt

The code has been tested using Python 3.9 on Ubuntu 2*.* systems. We trained our models on machines with Nvidia GPUs (we tested CUDA 10.1, 11.1 and 11.6). Furthermore, we recommend to use Python virtual environments to get a clean Python environment for the execution without any dependency problems.

As a required prerequisite, the config.ini needs to be populated with parameters to set the output directory (BASE_PATH), a directory for temporary artifacts (TMP_PATH) and an output directory for plots (PLOT_DIR).

Datasets

All datasets except for CIFAR-10(0)N, WebVision and Clothing1M are downloaded automatically. Webvision is available here, whereas access to Clothing1M has to be explicitly granted by the owner. CIFAR-10(0)is available here. All data needs to be stored in the specified --data_dir given as parameter to the training script (see next section).

Training and Evaluation

For the training and evaluation, you have to call the following function (e.g., for CIFAR-10 with 25 % symmetric synthetic noise for our loss):

CUDA_VISIBLE_DEVICES=<the numeric ID(s) of your CUDA device(s)> python train.py --dataset=cifar10  --model resnet34 --seed 0 --loss RDA --adaptive_lrvar2 --adaptive_lrvar2_start_beta 0.75 --lrvar2_beta 0.6 --adaptive_lrvar2_type cosine --lr 0.02 --decay_type cosine --label_noise 0.25

--help allows for printing out all parameter options. All results presented in the paper were computed based on the training scripts train.py.

License

Our code uses the Apache 2.0 License, which we attached as LICENSE file in this repository.

Feel free to re-use our code. We would be happy to see our ideas put into practice.

mitigatinglabelnoisedataambiguation's People

Contributors

julilien avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.