Git Product home page Git Product logo

testing-wine-quality's Introduction

Testing Wine Quality

The purpose of this machine learning project is to investigate the potential for using data-driven approaches to predict wine quality. The overall objective of this study consists in utilizing different machine learning techniques to predict the quality of wine based on various chemical properties such as acidity, pH, and alcohol content. A dataset comprising red or white wine samples and their corresponding quality ratings was exploited for model training and testing. The ultimate aim is to establish a reliable model for predicting the quality of new, previously unseen wines. In addition to demonstrating the feasibility of using machine learning for wine quality assessment, this project aims to provide valuable insights and practical applications for the wine industry.

Methodologies pursued:

1 . Use simple machine learning algorithms such as: Random Forest Classifier, Decision Three Classifier, SVC with different types of kernels

2 . Optimize the model’s performance with the best result by adding Stratified K Fold Model Selection

3 . Apply Outliers Detection Techniques (Isolation Forest, Elliptic Envelope, Local Outlier Factor or One Class SVM) to determine if the test accuracy would increase

These steps were applied on RED and WHITE datasets (RED = red wine, WHITE = white wine) and the results are presented in the following tables:

** for RED** RED

** for WHITE** WHITE

I observed that the model for predicting white wine quality did not perform as well as the one for predicting red wine quality. To improve this, we analyzed the feature importance and the correlation matrix to identify and eliminate features that were not contributing significantly to the model. As a result, we removed the density and total sulfur dioxide features from the dataset, which increased the accuracy from 0.87 to 0.88.

WHITE Correlation matrix Correlation_Matrix

Conclusion

After analyzing the data, it becomes evident that utilizing Stratified K Fold model selection in conjunction with Random Forest Classifier yields the most favorable results for predicting wine quality in this particular dataset. This suggests that a simpler approach, in this case, a combination of these two techniques, may be more effective for achieving the desired outcome.

testing-wine-quality's People

Contributors

anca-mt avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.