[Suggestion] Reporting the byte location of images

I have tested many PDF-to-text programs, and this one is the most robust. However, handling images is always a question since they are heavy objects and usually unnecessary. If I am correct, starting from version 0.7, GROBID dropped the option of extracting images.

I suggest adding an option to save the byte location of image elements instead of saving the image to disk. In this case, we can later read the image directly from the PDF file whenever needed instead of storing all images on the disk.

Implementing this feature should be trivial since the location and length of the image objects are already known to pdfalto.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Suggestion] Reporting the byte location of images #161

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Suggestion] Reporting the byte location of images #161

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions