Question 1

Why is the extracted text blank or garbled?

Accepted Answer

Two common reasons. Either the PDF is a scan (no text layer — use OCR), or the PDF uses a custom font encoding that maps characters in an unusual way. For the second case, OCR also works as a fallback.

Question 2

Does it preserve tables?

Accepted Answer

Text comes out in reading order, but column alignment from tables is lost — PDFs do not store tables as tables, only as floating text boxes. For table-shaped data, try the PDF to Excel tool instead.

Question 3

Can I extract text from just one page?

Accepted Answer

Extract everything first, then keep the section you need. For page-level splitting, combine with the Split PDF tool upstream.

Question 4

What encoding is the TXT file?

Accepted Answer

UTF-8 with no BOM. That is the modern standard and works in every editor and spreadsheet.

PDF to Text

Drop a PDF to extract its text

Extracted Text

How to extract text from a PDF

What this works on (and what it does not)

Frequently asked