Git Product home page Git Product logo

predictionio-template-text-classifier's Introduction

Text Classification Engine

Look at the following tutorial for a Quick Start guide and implementation details.

Release Information

Version 6.0

  • Use Apache Lucene as tokenizer
  • Add stopwords filter
  • Rename Scala package name
  • Update SBT version

Version 5.0 First Apache Version

  • Major changes to namespace to reflect donation to the Apache Software Foundation.
  • Build changes to support modified Apache build mechanism

Version 4.0

Re-structure and design preparator and algo. less memory usage and run time is faster. Move BIDMach, VW & SPPMI algo changes to bidmach branch temporarily.

Version 3.1

Fix DataSource to read "content", "e-mail", and use label "spam" for tutorial data. Fix engine.json for default algorithm setting.

Version 2.2

Modified PreparedData to use MLLib hashing and tf-idf implementations.

Version 2.1

Fixed dot product implementation in the predict methods to work with batch predict method for evaluation.

Version 2.0

Included three different data sets: e-mail spam, 20 newsgroups, and the rotten tomatoes semantic analysis set. Includes Multinomial Logistic Regression algorithm for text classification.

Version 1.2

Fixed import script bug occuring with Python 2.

Version 1.1 Changes

Changed data import Python script to pull straight from the 20 newsgroups page.

predictionio-template-text-classifier's People

Contributors

chanlee514 avatar dszeto avatar emergentorder avatar k4hoo avatar marcovivero avatar marevol avatar nlauchande avatar pferrel avatar ramyerubandi avatar sebas00 avatar shimamoto avatar takezoe avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.