Git Product home page Git Product logo

julkhami's Projects

api-downloader icon api-downloader

This tool uses Beautiful Soup 4 to download the useful reference information from API documentation.

deepreader icon deepreader

DeepReader is a simple command line reading app which presents text in small units. Just import a segmented text file from the command line, navigate backwards and forwards and save comments.

documentation-downloader icon documentation-downloader

This program uses Beautiful Soup 4 to download all of the pages of a webpage and produce a single, complete text.

fast-googler icon fast-googler

The is meant to be a command line tool that can retrieve Google (or any kind of search) results in an extremely fast time, ideally less than 0.1 seconds.

free-movie icon free-movie

This hosts a universal database of high-quality free movies and provides an effective tool for viewing and downloading them.

textextractor icon textextractor

Text extractor is an application and tool for extracting the relevant text content from (ideally) any format of document. It comes with two parts: an interactive application which is itself designed to help facilitate the process of a custom extraction (you can more easily inspect source HTML, and point and click to manually specify what it is you'd like to keep), plus an auto-magic tool whose function is to guess, and make things easier.

webpage-scraper icon webpage-scraper

Drop in a url to the home page of a website. This effectively crawls all relevant pages of that website. It launches a GUI window to help you easily cherry-pick which sites you want to discard and which to keep. It also has a second round with helpful tools for trying to teach the script which text content to keep and which to exclude. Finally, it crawls, and scrapes/text extracts the entire website in accordance with the rules you put in place, and returns it as plaintext. Lastly, it performs a keyword extraction on that data - but that's possible with the separate tool, Term Extractor. However, I might also make the term extraction happen on the fly, while it is crawling - another possible variant.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.