Git Product home page Git Product logo

scrapyproject's Introduction

Flask application to visualise scrapped data with scrapy

Table of contents

General info

The idea behind the project is to scrape data from IMDB and Amazon websites and then store this data in CSV files, on which we will apply some visualisations with the help of the plotly library. Then we will deploy a FLASK application with our visualisations on Heroku.

So to summarize, the steps are as follow :

1- Scrap data with Scrapy.

2- Store data in CSV files.

3- Clean the CSV files and apply visualisations with Plotly.

4- Deploy Flask application with the visualisations.

This guideline is about how to run the Flask application locally as well as deploying it to Heroku.

Technologies

Project is created with:

  • Python version: 3 or more
  • PyCharm : community edition
  • Scrapy
  • Plotly
  • Flask

Setup

To run this project locally follow the steps below :

1- Clone the repo.

2- Open the project in PyCharm.

3- Install the required libraries from the PyCharm terminal : $ pip install -r requirements.txt

4- Run FirstDashboard.py

Heroku

To deploy this project on Heroku follow the steps below :

1- Create a Heroku account if you don't have one.

2- Create a new application.

3- Choose Heroku Git as the deployment method.

4- Install Heroku CLI.

5- Inside the PyCharm terminal enter the following commands :

$ heroku login 
$ git init 
$ heroku git:remote -a yourAppName
$ git add .
$ git commit -am "commit message"
$ git push heroku master

Demo

Heroku link : https://web-semantique.herokuapp.com

alt text

scrapyproject's People

Contributors

moltomay avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.