Some test text!

PDF Parsing & Content Extraction Library

Access all PDF bits and pieces including images, fonts, structured text and tables, bookmarks, and metadata for advanced content repurposing & indexing in your web, mobile, desktop, and server applications.

Text extraction

Convert PDFs into readable Unicode text, regardless of language or font. Extract characters, words, fonts, and form fields. Populate a full-text search engine to search across a set of documents.
See Documentation

Metadata extraction

Analyze PDFs at a low level. Grab the PDF version, author information, timestamps, and anything else hidden away in the file.

Annotation extraction

Serialize annotations into the industry-standard XFDF format (compatible with most PDF viewers). Enable users to edit annotations without modifying the underlying document. Share annotations with other users to enable real-time collaboration. Create a summary of all annotations.

Table data extraction

Detect tables, and programmatically extract the information as XML or HTML.
Launch Table Extraction Demo

Image extraction

Extract individual images or graphics embedded within a PDF, or convert pages into images.

3D data extraction

Unwrap U3D, PRC, or STEP files embedded within PDF documents for display in a 3D viewer.

Font extraction

Retrieve Type1, OpenType, TrueType, Type3, and CID fonts embedded in the PDF. Find font names, font sizes, and the path data for individual glyphs.

Form field extraction

Serialize forms in the industry-standard XFDF format to extract, edit, or insert form field data.
See Documentation

Search multiple documents

Programmatically search across multiple documents at predefined locations. Extract information and metadata from a set of documents.

Powered by the PDFTron SDK

Easy to Integrate

Code samples, familiar package managers, and a Docker image make it easy to get up and running.

Consistent and Predictable

Our core document engine has been perfected by 20 years of knowledge, innovation, and real-world testing.

Fully Customizable

An open source UI gives you complete freedom to match your look & feel, and optimize the user experience.

Truly Cross-Platform

A single API with consistent function calls across platforms means a shorter learning curve and easier maintenance.

Try our SDK for free today

Upcoming Webinar: PDFTron SDK Tech Review | Nov 29, 2022 at 2 pm ET


The Platform


© 2022 PDFTron Systems Inc. All rights reserved.


Terms of Use