Git Product home page Git Product logo

Welcome!

Here is something about me

Data Engineering projects

Crawlers

  • News Crawler - a Scrapy spider that crawls news and run in a Docker Container
    Extract textual data relating to news. The output of this crawler is particularly useful for researches with NLP, Computational Linguistics and Analytical intelligence with grouped texts.

    • Runs on a Docker container
    • The items crawled from the website are saved as jsonlines, compressed with gzip and send to a bucket on AWS.
  • ANP Crawler - a Scrapy spider that crawls fuel prices
    This crawler extracts data from ANP research on fuels and has two quality layers implemented:

    • There is a Pipeline for data standardization, validate if the data are in the defined structure and typing, monitor the crawler statistics, such as minimum amount of extracted items, unwanted http status, errors, etc.
    • Alerts to a Slack channel if one or more monitors failed
    • All this process run in a EC2 instance on AWS and Circle CI is used to manage the deploy process from Github to the EC2 instance

    The extracted data is saved in a gzip-compressed jsonlines file and sent to an AWS S3 bucket.

Courses in progress or completed

Here I built an environment to practice the sql exercices proposed by the course, with docker compose to setup the Mysql Instance (the course database) and the Metabase instance (so I can run the queries over the Mysql database and build some analysis with charts and dashboards).

Various

  • Smart River - System for monitoring the level of rivers
    My college graduation capstone project: an android app for monitoring the level of rivers. The data is collected by sensors that communicate with a central through LoRa wireless technology, present on an Arduino MKR board. The central, an ESP8266 board, consolidates the data and sends it to the database via Wi-Fi, which is finally accessed by the android application.

Isabela Capetti's Projects

developer-roadmap icon developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

risco_fogo_prediction icon risco_fogo_prediction

Repository destined for the FRP Prediction modeling project, the radiative fire energy distributed through burning in Brazilian forests.

tcc_arduino icon tcc_arduino

A BPM280 pressure sensor reads and an Arduino MKR board transmits this data to another Arduino MKR board via the LoRa connection. This MKR acts as a central and sends this data via serial connection to an ESP board. ESP sends data via wi-fi connection to a database hosted on a server.

video-maker icon video-maker

Minha versão do Video Maker, um projeto que possui 4 robôs que criam vídeos para o YouTube com base no termo de busca indicado pelo usuário.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.