anilsener Goto Github PK
Name: Anil Sener (Anıl Şener)
Type: User
Name: Anil Sener (Anıl Şener)
Type: User
100 days of algorithms
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Code to accompany Advanced Analytics with Spark from O'Reilly Media
Analysis of Air Tranportation Statistics Data Case Study solutions for a Lead Data Engineering Position
Open source simulator based on Unreal Engine for autonomous vehicles from Microsoft AI & Research
Alluxio, formerly Tachyon, Unify Data at Memory Speed
The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
Apache Spark 2x Machine Learning Cookbook, published by Packt
Apache Spark Deep Learning Cookbook, published by Packt
The most cited deep learning papers
A curated list of awesome Machine Learning frameworks, libraries and software.
A community driven list of useful Scala libraries, frameworks and software.
In few hours, quickly learn how to effectively leverage various AWS services to improve developer productivity and reduce the overall time to market for new product capabilities.
I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved (without involving Trip Matching-Repeated Trips feature) with Random Forest. Many ensembles with RF, GBT and Logistic Regression and outlier elimination could be used to improve this result. There are two versions of my code (test and full execution). Since AWS costs have exceeded my budget I sopped to train my model(s) all dataset for full dataset execution. There is also a ppt that presents my outputs in test execution. Full Data Execution code is more production ready and slightly different version. I had to use Databricks Table Caching to TRAIN and TEST data tables to obtain acceptable performance in production ready version.
Predicting Backorders in Inventory Mangement Context.
BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
BigDL: Distributed Deep Learning Library for Apache Spark
Business Data Analysis by HiPIC of CalStateLA
Natural language processing pipeline for book-length documents
AWS SDK for Python
Breeze is a numerical processing library for Scala.
Hive UDF's for the data warehouse
Course materials/Homework materials for the FREE MOOC course on "Creative Applications of Deep Learning w/ Tensorflow" #CADL
Analyzes resource usage and performance characteristics of running containers.
Cauldron Unnotebook Gallery
Coordinated (etcd, ...) cluster construction for dynamic (cloud, containers) environments
CSV Validation Tool and API (CSV Schema RI)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.