Skip to content

Can't properly pull images from archive.org .pdf #116

@iconoclasthero

Description

@iconoclasthero

Setup

  • [ubuntu 22.04 ] OS (for example macOS)
  • [Google Chrome 119.0.6045.105] Browser version (for example Chrome 90.0.4430.93)
  • [Image Downloader v3.4.0] Extension version (for example 3.2.2)

Describe the bug

I'm trying to download images from a [borrowed] pdf book on archive.org (below) and it isn't loading all the images (not a surprise, but not desirable) and when I select the few images that populate and click download it tries to save them as .txt. In the attached screenshot, I clicked the download arrow and it pops up with the filename.txt.

URL

(https://archive.org/details/lostchanceinchin0000serv/page/9/mode/1up)

Screenshots

Screenshot from 2023-11-08 10-51-31

I would really like to be able to scrape the entire book at one go so I can tesseract > piper > ffmpeg > opus audiobook

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions