Git Product home page Git Product logo

pdf-toc's Introduction

PDF ToC

a CLI tool to easily extract / edit ToC(Table of Content, or bookmark) of pdf file

demo

Requirements

  • python >= 3.7
    • for python 3.6 support, use pdf-toc == 1.1.4
  • PyMuPDF >= 1.19

Installation

install from pypi

pip install pdf-toc

install from the repo directly

pip install git+https://github.com/HareInWeed/pdf-toc.git@master#egg=pdf-toc

install a specific version

pip install git+https://github.com/HareInWeed/[email protected]#egg=pdf-toc

Usage

usage: pdf-toc [-h] [--version] [--show-toc {json,toc}] [-t TOC] [-d DEST]
               [-T {json,toc}] [-f] [-m]
               source

pdf ToC modifier.

positional arguments:
  source                source pdf file directory

optional arguments:
  -h, --help            show this help message and exit
  --version             show program's version number and exit
  --show-toc {json,toc}
                        print the toc info of the source file and exit
  -t TOC, --toc TOC     toc info used to embed in the result file. leave it
                        empty to read toc from stdin
  -d DEST, --dest DEST  destination directory for result file
  -T {json,toc}, --type {json,toc}
                        specify format of ToC file. leave it empty to let the
                        tool determine the format, (from file suffix)
  -f, --force           overwrite dist file if it exist
  -m, --modify          modified the original file instead of create a new one

Examples

show the toc of file.pdf in toc format

pdf-toc --show-toc toc file.pdf

Generate a new file with content of file.pdf and toc in toc.txt

pdf-toc -t toc.txt -d new-file.pdf file.pdf

Replace the toc of file.pdf with one in toc.txt

pdf-toc -m -t toc.txt file.pdf

ToC file

Two types of toc file are supported

One is json, which mostly follow the specification of PyMuPDF, but in json format. See PyMuPDF docs and toc_json.md for detail

The other is a special data format, which provides ease of modification and additional functionalities. Check out toc.md for detail

Licence

MIT

pdf-toc's People

Contributors

hareinweed avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.