pushkar / abagail Goto Github PK

The library contains a number of interconnected Java packages that implement machine learning and artificial intelligence algorithms. These are artificial intelligence algorithms implemented for the kind of people that like to implement algorithms themselves.

License: BSD 3-Clause "New" or "Revised" License

Java 93.69% Python 5.60% Shell 0.08% HTML 0.54% Batchfile 0.08%

artificial-intelligence-algorithms neural-network java machine-learning optimization-algorithms

abagail's Introduction

ABAGAIL

Usage

*For discrete optimization problems see java examples /src/opt/test or jython versions /jython
*For jython | csv | python and grid search examples see /jython
*Also see Wiki, FAQ

Contributing

Fork it.
Create a branch (git checkout -b my_branch)
Commit your changes (git commit -am "Awesome feature")
Push to the branch (git push origin my_branch)
Open a Pull Request
Enjoy a refreshing Diet Coke and wait

Features

Hidden Markov Models

Baum-Welch reestimation algorithm, scaled forward-backward algorithm, Viterbi algorithm
Support for Input-Output Hidden Markov Models
Write your own output or transition probability distribution or use the provided distributions, including neural network based conditional probability distributions
Neural Networks

Feed-forward backpropagation neural networks of arbitrary topology

Configurable error functions with sum of squares, weighted sum of squares
Multiple activation functions with logistic sigmoid, linear, tanh, and soft max
Choose your weight update rule with standard update rule, standard update rule with momentum, Quickprop, RPROP
Online and batch training
Support Vector Machines

Fast training with the sequential minimal optimization algorithm

Support for linear, polynomial, tanh, radial basis function kernels
Decision Trees

Information gain or GINI index split criteria

Binary or all attribute value splitting
Chi-square signifigance test pruning with configurable confidence levels
Boosted decision stumps with AdaBoost
K Nearest Neighbors

Fast kd-tree implementation for instance based algorithms of all kinds

KNN Classifier with weighted or non-weighted classification, customizable distance function
Linear Algebra Algorithms

Basic matrix and vector math, a variety of matrix decompositions based on the standard algorithms

Solve square systems, upper triangular systems, lower triangular systems, least squares
Singular Value Decomposition, QR Decomposition, LU Decomposition, Schur Decomposition, Symmetric Eigenvalue Decomposition, Cholesky Factorization
Make your own matrix decomposition with the easy to use Householder Reflection and Givens Rotation classes
Optimization Algorithms

Randomized hill climbing, simulated annealing, genetic algorithms, and discrete dependency tree MIMIC

Make your own crossover functions, mutation functions, neighbor functions, probability distributions, or use the provided ones.
Optimize the weights of neural networks and solve travelling salesman problems
Graph Algorithms

Kruskals MST and DFS

Clustering Algorithms

EM with gaussian mixtures, K-means

Data Preprocessing

PCA, ICA, LDA, Randomized Projections

Convert from continuous to discrete, discrete to binary
Reinforcement Learning

Value and policy iteration for Markov decision processes

abagail's People

Contributors

Stargazers

Watchers

Forkers

chrisjstjohn thejenix willieowens msandt3 salil-pai psychovision sichinumi ymiyata rlobo3 nybblr joker23 dnuffer cgearhart mlee350 cdspace clementrobotics tim9996 dhalima3 dgritsko aschenoni jfalkson klittlepage makearl billmccord jhudgins skavanaugh shashir sterlinm sandizzle balajin-cse vamsijkrishna krishnatray bjella karun8880 prabhjotsl amritbhandari lauradhamilton jenevans33 daniel1124 gijigae brandon-o mwytock0812 david-wilson fernavid golemme hellofanengineer deepti1011 rsadek jenleong mlhales mongi3 oforero omtinez thedeetch anupradhan scivm opikalo nickrobinson cy-goh misterquinn joshuamorton adelenesim abrooke-forks mohit1007 dgonzalez7 jlas jnonon kaniska davidvondollen simkieu businessmeetsprogramming chipmandal pujun-ai seederekengineer danielrich vergenzt mkumar23 sdlin sshamid jgromeros ashaw596 idaho777 ahuynh rikinm sameersegal mafreihaut rajanchaudhari yiqic follower76 wenzheli adamacosta kingjuli fabriciotuosto jibaro danainschool yamolekula onaclovtech icarusalways nlscng hippohaha7

abagail's Issues

Bug in knapsack jython file

https://github.com/pushkar/ABAGAIL/blob/master/jython/knapsack.py

Somehow the GA and MIMIC functions get scores that exceed the knapsack volume. This shouldn't happen if we have the correct checks in place, right?

MIMIC outputs incorrect solution for NQueensTest

Clarification on Function

Any chance you understand what's happening on this line?
https://github.com/pushkar/ABAGAIL/blob/master/src/util/graph/KruskalsMST.java#L29

From what I can tell we are getting a cop of data (edges) then clearing it, but it appears that it is only clearing the returned list, which doesn't modify anything, nor does it save off the state in any way.

(I'm trying to convert to Python, but this is one of the very few lines that has me utterly baffled why it's there).

Useful dependencies: guava, apache commons, junit, functional java

@pushkar
Some of these will make our lives a lot easier. Perhaps have Maven manage dependencies?

Customize AdaBoostClassifier

Instead of taking a Class as an argument for the constructor of the AdaBoostClassifier class, perhaps take in a factory that creates FunctionApproximaters. This way the user has the ability to customize the internals.

https://github.com/pushkar/ABAGAIL/blob/master/src/func/AdaBoostClassifier.java#L46

License?

Is this library meant to be released under an open source style license? Without one it's unclear what uses of this code are acceptable - can this be used in a commercial context, for example? Strictly speaking, without a license, modifying and redistributing the code are copyright infringement.

I believe predicted and actual are switched in abalone_test

predicted = instance.getLabel().getContinuous() print "predicted ", predicted actual = networks[i].getOutputValues().get(0) print "actual ", actual
When I run that code, you see that predicted is either 0 or 1, and the actual is shown as a continuous value.

Since the data is loaded with all values less than 15 "washed" as a 1 and 0 if otherwise, these values are swapped in the code.

Not a big deal since the code is measuring absolute value of difference, but can be confusing

Unit tests and JUnit integration

We need this pretty badly.

Sum of Squared Errors

When porting to python the SumOfSquaredErrors.java file, you only loop through the output size and don't look at the rest of the examples. This doesn't seem like correct behavior.

https://github.com/pushkar/ABAGAIL/blob/master/src/shared/SumOfSquaresError.java#L22

I'm not certain but I think this might be the right way to do it
for (int i = 0; i < output.size(); i++) {
for (int j = 0; i < label.size(); i++) {
sum += (output.getContinuous(i) - label.getContinuous(j))
* (output.getContinuous(i) - label.getContinuous(j))
* example.getWeight(); // Not sure if weight should change or move out of the whole thing (as in multiply by weight at the end).
}

I just wanted to get your thoughts on this.

build fail with ant

Line 40 and 41 of build.xml should be:

    <target name="javadoc" depends="prepare" >
        <javadoc sourcepath="${src.dir}" destdir="${jdocs.dir}" additionalparam="subpackages"/>

the current version fails when run ant.

ArffDataSetReader initializes instances with null labels

The ArffDataSetReader initializes all instances with null labels when reading an arff file.

The error ends up looking like this

Running Random Hill ClimbingException in thread "main" java.lang.NullPointerException
	at shared.SumOfSquaresError.value(Unknown Source)
	at opt.example.NeuralNetworkEvaluationFunction.value(Unknown Source)
	at opt.example.NeuralNetworkOptimizationProblem.value(Unknown Source)
	at opt.RandomizedHillClimbing.<init>(Unknown Source)
	at tests.OptdigitsTest.main(Unknown Source)

Hello

jdk: oraclejdk8 causing travis builds to fail

Is it possible to remove this line or specify a newer version?

Error running example from wiki

If I run the example in the wiki:

cd ABAGAIL
ant
java -cp ABAGAIL.jar opt.test.XORTest

I get
Error: Could not find or load main class opt.test.XORTest

ABAGAIL.arrays fails search when sample is not found

The search method in ABAGAIL.arrays does not return a valid index when the value found is at the top end of the distribution. The start 'high' value is the length of the array instead of length - 1, which results in the 'high' value sometimes returning an index that is out of bounds from the array.