Git Product home page Git Product logo

advanced-gui-based-web-scrappers-in-pyqt's Introduction

A Collection of web-scrappers with GUI written in Pyside6/PyQt6

This would have multiple GUI programs to scrap pages using PySide6 and BeautifulSoup.

Description

GUI based advanced web-scrappers. Scrapping can be applied to many use cases for instance, downloading images from a given web-page.

List of Programs

1. GUI Based Scrapper created in Python using PySide6

This you can find in the file named 'gui_pyqt6_scrapper.py'. You can scrap a given page using multiple web-agents. It is important to keep switching agents to avoid getting blocked by websites under suspicion. This scrapper allows you to search for given tags in the page for example, you can search for all occurrences of h1 tag, p tag and so on. Additionally, you can download all the images from a scrapped page in a given folder which you get to choose.

Getting Started

  • Create a Virtual Environment using Python
  • Install packages in requirements.txt

That's it, ready to run the program. Pyside6/PyQt6 are bulky packages which can consume upto 1 GB on your system but they're like very feature rich and advanced compared to Tkinter.

Dependencies

  • Any operating system with Python 3 installed should work.
  • Pyside6/PyQt6
  • BeautifulSoup
  • Requests
  • Generic familiarity with web scrapping

Executing program

  • Simply run the file once virtual environment is activated and relevant packages are installed.
python gui_pyqt6_scrapper.py

Authors

Contributors names and contact info

ex. Apfirebolt - @amitprafull

License

This project is licensed under the MIT License - see the LICENSE.md file for details

advanced-gui-based-web-scrappers-in-pyqt's People

Contributors

apfirebolt avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.