cswanghan / hama-kmeans Goto Github PK
View Code? Open in Web Editor NEWThis project forked from willmore/hama-kmeans
KMeans Implementation Using Hama
This project forked from willmore/hama-kmeans
KMeans Implementation Using Hama
Hama K-Means Clustering ------------------------------------------------------------------- This project implements k-means clustering using the Hama framework. ------------------------------------------------------------------- Building This is a Java Maven project. All dependencies and build configurations are located in the ./pom.xml file. Please consult your local environment / IDE on how to build a Maven project. The result of a successful build is hama-kmeans-<VERSION>.jar in the ./target directory. To run the provided Python display script, you will need to install Python 2.6 and the matplotlib module. ------------------------------------------------------------------- Running You must have a Hama and Hadoop system installation. Consult http://wiki.apache.org/hama/GettingStarted. To run: $HAMA_HOME/bin/hama jar hama-kmeans.jar ee.ut.cs.willmore.KMeansCluster The program will generate a random input set on the configured HDFS and launch the k-means solver. Output will be stored on HDFS and additionally be copied to a the local /tmp directory. The location of these files will be printed to stdout. The program support a variety of options for generating input data: -points Number of points (observations). Default value is 1000. -k Number of clusters. Default value is the number of BSPPeers. Value must not exceed number of BSPPeers. -display A display script that will be called immediately after clustering has completed. A sample Python script is supplied at ./python/graph_output.py -noise Number of extra random points scattered throughout the problem space. -pattern Format of <pattern>[:<size>] where pattern of sphere|cube|random and optional size of points for each cluster. Example is sphere:20 which makes spheres of radius 20. -size Point world size in X,Y,Z dimensions. Example: a value of 100 would create a world of 100 X 100 X 100.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.