Git Product home page Git Product logo

sumn2u / deep-waste-app Goto Github PK

View Code? Open in Web Editor NEW
25.0 4.0 7.0 43.33 MB

Managing waste in fun and easy way with AI ♻️

Home Page: https://www.dwaste.live/

License: BSD 3-Clause "New" or "Revised" License

Kotlin 0.13% Ruby 1.50% Swift 0.44% Objective-C 0.04% Dart 96.26% HTML 1.63%
garbage-classification recycling flutter-apps image-classification flutter machine-learning community-project eco-friendly enviroment open-source

deep-waste-app's People

Contributors

sumn2u avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

deep-waste-app's Issues

Dataset Preprocessing

Waste Image Classification Dataset Preprocessing

Problem Statement

The dataset acquired for waste image classification contains variations in size, dimensions, and quality. To prepare the data for machine learning model training, a preprocessing step is necessary to standardize the data and address issues such as duplicates, missing values, and inconsistent labeling.

Dataset Link: https://www.kaggle.com/datasets/sumn2u/garbage-classification-v2/

Tasks

1. Data Cleaning

  • Identify and remove duplicate images to eliminate redundancy.
  • Detect and handle corrupted or incomplete images to ensure data integrity.
  • Check for and handle any missing or mislabeled data for accurate labeling.

2. Resizing

  • Standardize the size of all images to a common dimension suitable for model training.
  • Choose an appropriate size that balances computational efficiency and image quality.

3. Normalization

  • Normalize pixel values to a specific range, typically between 0 and 1.
  • Normalize color channels to have a mean of 0 and a standard deviation of 1 for improved model convergence.

4. Augmentation

  • Apply data augmentation techniques to artificially increase the dataset size.
  • Ensure augmentation parameters align with the characteristics of waste images.

5. Labeling

  • Validate and correct labels for each image to ensure accurate representation of classes.

6. Data Splitting

  • Divide the dataset into training, validation, and test sets for robust model evaluation.

7. File Format

  • Convert images to a common file format (e.g., JPEG, PNG).
  • Check and handle different color modes to ensure consistency.

8. Handling Class Imbalance

  • Implement techniques to address class imbalance, such as oversampling or undersampling, if necessary.

9. Metadata Extraction

  • Extract and store relevant metadata (e.g., timestamps, source information) for potential analysis or model enhancement.

10. Save Preprocessed Data

  • Save preprocessed images and corresponding labels in a structured format for easy access during model training.

Outcome

A well-preprocessed dataset ready for training machine learning models on waste image classification, addressing issues related to data quality, consistency, and standardization.

Expand Waste Database and Use Transfer Learning

Problem

This project utilizes the Trashnet dataset, comprising of 2527 images across six classes: glass, paper, cardboard, plastic, metal, and trash.

  • 501 glass
  • 594 paper
  • 403 cardboard
  • 482 plastic
  • 410 metal
  • 137 trash

Training an effective waste classification model requires more high-quality, diverse data.

Actions

Create a separate waste dataset by collecting from different sources and use transfer learning approach to add new knowledge.

References

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.