Git Product home page Git Product logo

dynamicwebscraper's Introduction

Dynamic Web Scraper

Scrapes a Website that is taken from an inputted emails domain.

This project uses ECMAScript 6(2015)

This is a NodeJS project that is designed to get as much data as possible from an email address. It will grab the domain from an email address and visit the corresponding website. From the website it will scrape phone numbers and email addresses with a fantastic success rate. Then it will use GoogleMaps API to find out much more about the company that owns the website. It grabs the address, the company name, company GMaps rating and whether or not the company is currently open. All scraped data is saved in a json file, which is also passed to a HTML5 table, within a webapp, hosted on localhost:8000, to make the data much more readable.

Screenshots

Webapp view

CLI view

Installation

Windows & OS X & Linux:

git pull https://github.com/Crithane/dynamicWebScraper.git
npm install

Usage example

node app.js [email protected]

Features

  • Dynamically scrapes any domain's website from an inputted email address
  • Grabs all phone numbers from website
  • Grabs all email addresses from website
  • Utilizes GMaps API to grab companies address
  • Utilizes GMaps API to grab companies name
  • Utilizes GMaps API to grab companies rating
  • Utilizes GMaps API to see if company is open at the time of the request
  • Saves all scraped data to /domain/domain.json
  • Displays data in a table on a webapp hosted on localhost:8000

Dependencies

Authors

  • Shane Critchley-Kenyon

dynamicwebscraper's People

Contributors

crithane avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Forkers

tawawhite

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.