Git Product home page Git Product logo

volkansah / the_collector.py Goto Github PK

View Code? Open in Web Editor NEW
2.0 2.0 2.0 20 KB

The Collector.py (lite) is a RedTeam Black-Python script designed to help in finding BAD sites. This script is part of a larger collection and was created to be used with caution. Some functions have been removed to prevent any malicious attacks, but it remains a simple yet powerful script that can be used to scan for bad sites.

Home Page: https://github.com/VolkanSah/The_Collector.py/tree/main

License: MIT License

Python 100.00%
abuse-detection offensive-scripts offensive-security python redteam-tools redteaming script security hacking information-gathering

the_collector.py's Introduction

The Collector.py (lite)

RedTeam Black-Python Scripts

Warning: This tool is only for educational and ethical hacking purposes. Do not use it for any malicious activities. Use at your own risk.

The Collector.py (lite) is a RedTeam Black-Python Script designed to help in finding BAD sites. This script is part of a larger collection and was created to be used with caution. Some functions have been removed to prevent any malicious attacks, but it remains a simple yet powerful script that can be used to scan for bad sites.

Installation

To use The Collector.py (lite), you need to have Python 3 installed on your machine, as well as the following Python packages: requests, progress, and BeautifulSoup.

You can install the required packages by running the following command:

pip install requests progress BeautifulSoup4

How it works

The Collector.py (lite) works by scraping a given URL and all its links, then searching for a specified keyword within the HTML content. If the keyword is found, the URL is added to a list of bad sites.

The script uses the requests package to make HTTP requests, BeautifulSoup to parse the HTML content, and progress to display a progress bar during the scanning process.

Usage

To use The Collector.py (lite), simply run the script and provide the starting URL and the keyword you want to search for as command-line arguments:

python3 collector.py <starting_url> <keyword>

For example, to search for the keyword "child abuse" on the website "https://example.com", you would run:

python3 collector.py https://example.com "child abuse"

For better work! Setup collector.py to your needs and use

python3 collector.py

The script will then start scanning the website and all its links for the specified keyword. Any URLs that contain the keyword will be added to the list of bad sites.

The results are saved to a text file called collector_domainlist.txt in the same directory as the script. The file contains a list of all the visited URLs, along with a label indicating whether the site is a bad site or not.

Disclaimer

The Collector.py (lite) is only intended for ethical hacking and educational purposes. Do not use this tool for any illegal activities. The author of this tool is not responsible for any damages caused by the misuse of this tool.

Use at your own risk! Better with Tor rooted traffic to scan target onion sites!

Thank you for your support!

  • If you appreciate my work, please consider becoming a 'Sponsor', giving a โญ to my projects, or following me.

Copyright

License

This project is licensed under the MIT - see the LICENSE file for details.

the_collector.py's People

Contributors

volkansah avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Forkers

ibnaleem badtin

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.