Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
855b7b8
chore: update requirements to current versions
dhdaines Sep 6, 2023
5e33cfd
fix: remove unsupported argument in pydantic2
dhdaines Sep 6, 2023
1f1596f
fix: use `use_text_flow` for pdfplumber
dhdaines Sep 6, 2023
dc1cb6b
chore: rebuild
dhdaines Sep 6, 2023
0f8b84b
Merge branch 'py-pdf:main' into main
dhdaines Feb 20, 2025
a640d1e
feat: add PLAYA-PDF
dhdaines Feb 20, 2025
4592803
chore: update all versions
dhdaines Feb 20, 2025
ff365e1
fix: nope! postprocess not compatible with playa (yet)
dhdaines Feb 20, 2025
2c2da35
chore: rerun with playa
dhdaines Feb 20, 2025
f624925
fix: use playa main branch with xobject fix
dhdaines Feb 20, 2025
6508f30
feat: use 2 CPUs for PLAYA
dhdaines Feb 20, 2025
6622f6c
fix: correct pdfplumber version
dhdaines Feb 20, 2025
1871f18
feat: update to use new PLAYA extract_text (and improve accuracy)
dhdaines Feb 20, 2025
a4fdbba
fix: update release dates
dhdaines Feb 20, 2025
75e22bd
chore: update for playa 0.3.0
dhdaines Feb 21, 2025
f4bb730
fix: update release dates
dhdaines Feb 21, 2025
3df099c
fix: outpath no longer used
dhdaines Feb 21, 2025
c928275
chore: update to 0.4.1
dhdaines Apr 1, 2025
055d287
chore: update for playa 0.5.0
dhdaines May 15, 2025
8e76000
fix: argh! 2201.00022 was the WRONG VERSION
dhdaines May 21, 2025
e366fa1
feat: remove borb, use uv
dhdaines Jan 28, 2026
59841aa
chore: new outputs
dhdaines Jan 28, 2026
81f1cba
chore: make it main.py
dhdaines Jan 28, 2026
af5abf5
chore: rerun
dhdaines Jan 28, 2026
6230675
feat: update for latest and fastest playa
dhdaines Feb 5, 2026
6fcb38c
fix: remove bogus release dates
dhdaines Feb 5, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -131,3 +131,4 @@ dmypy.json
*.pdf
pdfs/
pdf_cache/
/image_extraction
94 changes: 48 additions & 46 deletions README.md

Large diffs are not rendered by default.

Loading