Git Product home page Git Product logo

bridgekeeper's Introduction

BridgeKeeper

Scrape employee names from search engine LinkedIn profiles. Convert employee names to a specified username format.

Usage

usage: bridgekeeper.py [-h] (-c COMPANY | -F FILE) [-f FORMAT] [-D DOMAIN]
                       [-a API] [-d DEPTH] [-t TIMEOUT] [-o OUTPUT]
                       [--cookie COOKIE] [--proxy PROXY] [--lower] [--upper]
                       [--debug]

Scrape employee names from search engine LinkedIn profiles. Convert employee
names to a specified username format.

optional arguments:
  -h, --help            show this help message and exit
  -c COMPANY, --company COMPANY
                        Target company to search for LinkedIn profiles (e.g.
                        'Example Ltd.').
  -F FILE, --file FILE  File containing names to be converted to usernames.
                        Name format: 'First Last'
  -f FORMAT, --format FORMAT
                        Specify username format. Valid format identifiers:
                        {first}, {middle}, {last}, {f}, {m}, {l}, [#] (For
                        trimming names)
  -D DOMAIN, --domain DOMAIN
                        Domain name of target company for Hunter.io email
                        format identification and email scraping.
  -a API, --api API     Hunter.io API key.
  -d DEPTH, --depth DEPTH
                        Number of pages to search each search engine. Default:
                        5
  -t TIMEOUT, --timeout TIMEOUT
                        Specify request timeout. Default: 25
  -o OUTPUT, --output OUTPUT
                        Directory to write username files to.
  --cookie COOKIE       File containing Google CAPTCHA bypass cookies
  --proxy PROXY         Proxy to pass traffic through: <ip:port>
  --lower               Force usernames to all lower case.
  --upper               Force usernames to all upper case.
  --debug               Enable debug output.

Examples

Gather employee names for a company, Example, and convert each name into an 'flast' username formatted email:
$ python3 bridgekeeper.py --company "Example Ltd." --format {f}{last}@example.com --depth 10 --output example-employees/ --debug

Gather employee names and email addresses from search engines and Hunter.io:
$ python3 bridgekeeper.py --company "Example Ltd." --domain example.com --api {API_KEY} --depth 10 --output example-employees/ --debug

Convert an already generated list of names to usernames:
$ python3 bridgekeeper.py --file names.txt --format {f}{last}@example.com --output example-employees/ --debug

Username format examples (BridgeKeeper supports middle names as well as character limited usernames - e.g. only 4 characters of a last name is used):

Name: John Adams Smith
{f}{last}                   > jsmith
{f}{m}.{last}               > ja.smith
{f}{last}[4]@example.com    > [email protected]

Features

  • Support for three major search engines: Google, Bing, and Yahoo
  • Name parsing to strip LinkedIn titles, certs, prefixes, etc.
  • Search engine blacklist evasion
  • Proxying
  • Username formatting with support for trickier username formats
    • Name trimming
      • e.g. If a username format has only the first 4 characters of the last name
    • Hyphenated last name handling
    • Duplicate username handling
      • Incrementing numbers appended to duplicate usernames
  • Use Hunter.io to identify the email format for a specified domain and pull down any known emails for that domain

Contributers

aslarchergore - Gather format and emails from Hunter.io

Acknowledgements

m8r0wn - CrossLinked
initstring - linkedin2username

bridgekeeper's People

Contributors

0xzdh avatar nromsdahl avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.