drroad Goto Github PK
Type: User
Type: User
'Easy' web-enabled Exploratory Data Analysis with Shiny :mag_right:
Web-based tool to monitor information about sensor quality and framework reliability. In the CityPulse Framework it provides a visualisation of the Monitoring Component.
EDA and Machine Learning Models in R (Regression, Classification, SVM, Decision Tree, Random Forest, Time-Series Analysis, Recommender System, XGBoost)
Notebooks to learn data science - Videos https://www.edyoda.com/course/1416
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
A curated set of resources for data science, machine learning, artificial intelligence (AI), big data, and more. Includes links, notebooks, articles, books, cheat sheets, whitepapers, lists, and technical papers
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
Introductory tutorial to data science using R programming language and machine learning.
Data Science Projects Repository
data story tech-blog
A shiny app to drag and drop csv files to get fast data summaries of the data set.
Summary In this article, we are going to present the solution for the Women’s Health Risk Assessment data science competition on Microsoft’s Cortana Intelligence platform which was ranked among the top 5%. In this page, you can find the published Azure ML Studio experiment., a description of the data science process used, and finally a link to the R code (in GitHub). Competition Here is the description from the Microsoft Cortana Competition “To help achieve the goal of improving women's reproductive health outcomes in underdeveloped regions, this competition calls for optimized machine learning solutions so that a patient can be accurately categorized into different health risk segments and subgroups. Based on the categories that a patient falls in, healthcare providers can offer an appropriate education and training program to patients. Such customized programs have a better chance to help reduce the reproductive health risk of patients. This dataset used in this competition was collected via survey in 2015 as part of a Bill & Melinda Gates Foundation funded project exploring the wants, needs, and behaviors of women and girls with regards to their sexual and reproductive health in nine geographies. The objective of this machine learning competition is to build machine learning models to assign a young woman subject (15-30 years old) in one of the 9 underdeveloped regions into a risk segment, and a subgroup within the segment.” https://gallery.cortanaintelligence.com/Competition/Womens-Health-Risk-Assessment-1 Dataset The contains 9000 observations The original training dataset is in CSV format and can be found in the competition’s description. To submit a solution, two options are possible: build it in Azure ML Studio or build your solutions locally in R and then submit it through Azure ML Studio. An Azure ML’s solution, and a R script code where given as example. The two solutions are based on the use of a Generalized Linear Model is automatically downloaded. You can find a detailed description of the dataset, the R sample Code and a tutorial using Azure ML and R in the competition page Solution I started following the R tutorial for this competition. Then I have submitted the exact same R solution. The sample model has a 77% accuracy Pre-processing & Cleaning The first thing I did was changing the initial multinomial model (nnet package) for a random forest model (RandomForest package). All missing values have been replaced by 0 Feature selection Features have been selected using the function varImpPlot from the randomforest package Parameter tuning I have chosen (for educational matter) to use the module Tune Model Hyperparameters in Azure ML Studio. I could have also used the R Package Caret. Evaluation The final model has an accuracy of 86.36% (18 position over almost 500 participants) You can download the R code here
Tools to collect and analyze the nbi dataset
StreamSets Data Collector - Continuous big data and cloud platform ingest infrastructure
人人可用的开源数据可视化分析工具。
An R package for data screening
An R package to enable reproducible data processing, packaging and sharing.
A Shiny app for data preparation, meant primarily for use with ONDRI data as part of the ONDRI NIBS standards and outliers pipeline.
R package for data quality assessment
A tool to help improve data quality standards in observational data science.
:exclamation: This is a read-only mirror of the CRAN R package repository. datarobot — 'DataRobot' Predictive Modeling API
Churn Analysis of Telecom data from VDM
Data Science Course in a Box
Codes written in the course of a data science workshop at KIT in cooperation with FZI
Deploying pre-trained NLP and computer vision models in a single application
common data analysis tasks using R
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.