Git Product home page Git Product logo

pdffoldersearch's Introduction

pdffoldersearch

About 1.5 years ago I quickly hacked a little tool together that allows to search folders containing PDF documents for search terms in those PDFs.

It is currently being used to search a manual all at once that comprises multiple PDF documents.

There are quite some TODOs left since it was a really quick hack and not everything is optimal but it is usuable. Feel free to clone it and use it and adapt it for whatever purposes you want. If you do, a little message is of course always appreciated.

Some instructions on getting it set up:

You need all the qt4 dev as well as libpoppler-dev.
In ubuntu, looking in the packetmanager for qt4 dev should be quite easy 
and for libpoppler you need the packages: libpoppler-qt4 and libpoppler-qt4-dev

After installing those dependencies you should be able to simply open the .pro file
in QT creator and compile the program

To run it: You can specify up to 3 arguments via command line.

1. (obligatory) The path to the folder containing the PDFs
2. (voluntary) The path to a copy of Adobe Reader
3. (voluntary) The maximal number of possible hits (in order to prevent searches that take forever)

Here are binaries for windows and linux:

Windows: Please get the binary here

Just extract the zip file. I provided a sample *.bat file for how to run the application. Simply adjust the *.bat file to your computer and execute it.

Linux: Please get the binary here

You need to have libpoppler-qt4 and qt4 installed on your system. I have compiled this binary on Kubuntu 13.04.

AND here is the corresponding blog entry: http://myvirtualbrain.blogspot.de/2013/05/a-pdf-search-tool.html

Enjoy!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.