Export to image support about pdfboxing HOT 3 OPEN

werenall commented on June 9, 2024

Export to image support

from pdfboxing.

Comments (3)

dotemacs commented on June 9, 2024

First of all - kudos for this library! It proves to be very useful to our project in Magnet.

Thank you, I'm glad that you're finding it useful.

However we need an export to image functionality that Apache's PDFbox provides.

OK. Is this functionality already present in any of the Java examples here:
https://svn.apache.org/viewvc/pdfbox/trunk/examples/src/main/java/org/apache/pdfbox/examples/

I'm asking because I'm trying to understand what exactly are you trying to do: extract images out of a PDF or ...?

We fought that it would be nice if your library has it as well.
We'd be happy to make a PR with this.

OK, but let me understand what you're trying to do first. Then if you're willing to do the work, then that would be great.

from pdfboxing.

werenall commented on June 9, 2024

We have pdfs (possibly multi-paged) that we need thumbnails for. In our case, each page gets converted into an image. Something like with Google Drive - they don't display a pdf in the preview. Just an image with its thumbnail.

from pdfboxing.

avocade commented on June 9, 2024

We have a use case where we want to extract all images from the entire document so we can then do ML on each image. Extracting the text is done separately. PDFBox looks like the right tool for it:

https://docs.aspose.com/pdf/java/extract-images-from-pdf-file/

Similar use case with the nodeJS pdf-lib (the extract-images.zip example which seems to work well):
Hopding/pdf-lib#83 (comment)

from pdfboxing.

Recommend Projects

Export to image support about pdfboxing HOT 3 OPEN

Comments (3)

Related Issues (19)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent