Skip to content

Add PDF to Markdown Python sample#122

Merged
datalogics-dliang merged 2 commits intodatalogics:mainfrom
datalogics-cgreen:pdfcloud-5252-markdown-complex-sample
Oct 23, 2025
Merged

Add PDF to Markdown Python sample#122
datalogics-dliang merged 2 commits intodatalogics:mainfrom
datalogics-cgreen:pdfcloud-5252-markdown-complex-sample

Conversation

@datalogics-cgreen
Copy link
Contributor

@datalogics-cgreen datalogics-cgreen commented Oct 21, 2025

Adds a sample program written in Python that performs "preprocessing" (Flatten Forms or OCR Text) on documents before converting them to Markdown.

@datalogics-cgreen datalogics-cgreen marked this pull request as ready for review October 23, 2025 14:31
Copy link
Contributor

@datalogics-dliang datalogics-dliang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Documents containing forms are flattened (and documents without simply skip that step) as expected, and starter keys are also working as expected.

@datalogics-dliang datalogics-dliang merged commit efb1eb6 into datalogics:main Oct 23, 2025
1 check passed
@datalogics-cgreen datalogics-cgreen deleted the pdfcloud-5252-markdown-complex-sample branch October 24, 2025 14:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants