Light

xjqbest / mindalpha Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mindalpha/mindalpha

0.0 1.0 0.0 163 KB

License: Apache License 2.0

CMake 1.68% Shell 0.12% C++ 65.21% C 0.10% Dockerfile 0.47% Python 26.75% Thrift 0.28% Jupyter Notebook 5.40%

mindalpha's Introduction

MindAlpha

MindAlpha is a machine learning platform integrating PySpark, PyTorch and a parameter server implementation. The platform contains native support for sparse parameters, making it easy for users to develop large-scale models. Together with MindAlpha Serving, the platform provides a one-stop solution for data preprocessing, model training and online prediction.

Features

Efficient IO with PySpark. Minibatches read by PySpark as pandas DataFrames can be feed directly to models.
Similar API with PyTorch and Spark ML, users familar with PyTorch and PySpark can get started quickly.
Wrap custom sparse layers as PyTorch modules, making them easy to use. Those sparse layers can contain billions of parameters.
Models can be developed in Jupyter Notebook interactively and periodical model training can be scheduled by Airflow.
The trained model can be exported via one method call and loaded by MindAlpha Serving for online prediction.

Build

Use docker/Dockerfile to build a docker image and launch a container with the image, then execute ./compile.sh in the container.

Tutorials

Two tutorials are given:

MindAlpha Getting Started introduces the basic API of MindAlpha briefly.
MindAlpha Tutorial shows how to use MindAlpha in production setting.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.