Git Product home page Git Product logo

scala-ml's Introduction

李航《统计学习方法》的scala版本实现

开发环境:spark-2.3.1

分类方法准确率对比

Iris Mnist Mnist备注 Iris Mnist Mnist备注
多层感知机 1.00 0.97 10Epoch,20分钟 0.97 0.96/0.98 784-300-10,5分钟
KNN分类 0.80 0.65 抽样,快
Bayes分类 0.43 OOM 0.94 0.83/0.82
决策树分类 0.80 0.79 抽样,仍慢 1.00 0.68/0.68
逻辑回归 0.97 0.99 均简化二分类 1.00/0.98 0.92/0.93 先交叉验证找超参,慢
SVM 1.00 简化二分类 0.93/0.84 0.91/0.91 OneVsRest

每行前一组为自己实现的算法,后一组为spark自带的算法

双值前面为测试集准确率,后者为训练集准确率

scala-ml's People

Contributors

hlstudio avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.