Git Product home page Git Product logo

alltheplaces's Introduction

All the Places

A project to generate point of interest (POI) data sourced primarily from major websites with 'store location' pages. The project uses scrapy, a popular Python-based web scraping framework, to write individual site spiders to retrieve POI data, publishing the results in a standard format. There are various scrapy tutorials, this series on YouTube is reasonable.

Getting started

Development setup

Windows users may need to follow some extra steps, please follow the scrapy docs for up to date details.

  1. Clone a copy of the project from the GitHub All The Places repo (or your own fork if you are considering contributing to the project):

    $ git clone [email protected]:alltheplaces/alltheplaces.git
    
  2. If you haven't done so already, install pipenv and check that it runs:

    $ pipenv --version
    pipenv, version 2022.8.30
    
  3. Use pipenv to install the project dependencies:

    $ cd alltheplaces
    $ pipenv install
    
  4. Test for successful project installation:

    $ pipenv run scrapy
    

    If the above runs without complaint, then you have a functional installation and are ready to run and write spiders.

Contributing code

Many of the sites provide their data in a standard format. Others export their data via simple APIs. We have a number of guides to help you develop spiders:

The weekly run

The output from running the project is published on a regular cadence to our website: alltheplaces.xyz. You should not run all the spiders to pick up the output: the less the project "bothers" a website the more we will be tolerated.

Contact us

Communication is primarily through tickets on the project GitHub issue tracker. Many contributors are also present on OSM US Slack, in particular we watch the #poi channel.

License

The data generated by our spiders is provided on our website and released under Creative Commons’ CC-0 waiver.

The spider software that produces this data (this repository) is licensed under the MIT license.

alltheplaces's People

Contributors

iandees avatar cj-malone avatar davidhicks avatar jleedev avatar mjoe999 avatar panasonic-mafia avatar abbensid2 avatar thismakessand avatar wessport avatar josh0203be avatar strvm avatar webmobi59 avatar ckpiper avatar thekuiperbelt avatar boomerwv1 avatar brawer avatar reidab avatar nbeecher avatar starsep avatar vlad555 avatar karen-kerr avatar kristinepetrosyan avatar evanmosseri avatar ladylatitude avatar pre-commit-ci[bot] avatar cmecklenborg avatar thomas536 avatar derekmcdaniel avatar sledjama avatar dimitar5555 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.