- Machine Learning Engineering
--
- Data Engineering
--
- Data Analytics
--
- Course Notes & Other Useful Material
--
- Contributions
Name: Kauvin Lucas
Type: User
Bio: Data Analytics / Data Engineering / Artificial Intelligence
Location: Bolivia
Blog: kauvinlucas.com
--
--
--
--
My notes of each module in Big Data Science, an online course offered by Semantix Brasil
Notebooks of Datacamp projects
Neste reposit贸rio apresentei os notebooks de analise explorat贸ria e visualiza莽茫o de dados feitos no Python com a ajuda das bibliotecas Pandas e Matplotlib. Este reposit贸rio responde ao desafio da plataforma Digital Innovation One.
Este reposit贸rio cont锚m os arquivos de contagem de palavras gerados no Google Cloud por meio de script de Python e dentro de um ecossistema de Big Data gerenciado em cloud chamado Google DataProc. O reposit贸rio em quest茫o responde ao desafio da plataforma Digital Innovation One.
Big Data Ecosystem Docker
A complete catalog of all the players in Fifa 18 and their complete statistics.
In this project, I analyzed the scores of the ENEM 2019, a standardized test used for admission in Brazilian colleges, in the context of existing socioeconomic disparities between participants. PySpark was used for data ingestion and transformation. Pandas, Statsmodels, Matplotlib/Seaborn/Folium, and Scikit-learn were used for descriptive analysis and data visualization.
This is a web app made with Python consisting of a dashboard that was used as submission for a visualization challenge called "Maven Unicorn Challenge" by Maven Analytics
The main goal of this project was to build and optimize an Azure ML pipeline using the Python SDK and a provided Scikit-learn Logistic Regression model to solve a classification problem. Hyperdrive was used to optimize the model. This was then compared to an Azure AutoML run to see which of these approaches returns the best tuned model.
Final project submission for the IBM Data Science Professional Certificate specialization
This is a simple project consisting of a pipeline of streaming processing with Apache Kafka, PySpark and Twitter Streaming API. This project is meant to understand the concepts behind stateful processing and event time processing with Spark Streaming
This repository contains files used to build images to deploy Spark clusters on Kubernetes
#DataEngineeringLATAM
A declarative, efficient, and flexible JavaScript library for building user interfaces.
馃枛 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 馃搳馃搱馃帀
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google 鉂わ笍 Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.