Git Product home page Git Product logo

github-scraper's Introduction

github-scraper

A Python script that outputs to a .txt file the clone URLs of each public repo of a given organization or user set at the top of the script.

This project is released under the Unlicense, meaning you can do whatever you want with it.

Usage

  1. Edit github_scraper.py and edit the input_url variable to be the organization or user you wish to scrape. Example:

    input_url = 'https://github.com/orgs/salesforce/repositories'

  2. Ensure you have installed the required dependences for the script:

    pip install requests pip install lxml

  3. Open an Administrator Powershell, Command Window, or other Terminal and run:

    ./github_scraper.py

When it is finished, you can see the results in the output.txt file created in the same folder. There will also be a log.txt file that contains the execution log of the script.

Setup

If you are behind a corporate proxy, ensure port 443 can reach GitHub.com.

Required Dependencies:

  • Python 3
  • requests (Python package)
  • lxml (Python package)

github-scraper's People

Contributors

zesix avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.