Some test text!
Access all PDF bits and pieces including images, fonts, structured text and tables, bookmarks, and metadata for advanced content repurposing & indexing in your web, mobile, desktop, and server applications.
Convert PDFs into readable Unicode text, regardless of language or font. Extract characters, words, fonts, and form fields. Populate a full-text search engine to search across a set of documents.
Analyze PDFs at a low level. Grab the PDF version, author information, timestamps, and anything else hidden away in the file.
Serialize annotations into the industry-standard XFDF format (compatible with most PDF viewers). Enable users to edit annotations without modifying the underlying document. Share annotations with other users to enable real-time collaboration. Create a summary of all annotations.
Detect tables, and programmatically extract the information as XML or HTML.
Launch Table Extraction Demo
Extract individual images or graphics embedded within a PDF, or convert pages into images.
Unwrap U3D, PRC, or STEP files embedded within PDF documents for display in a 3D viewer.
Retrieve Type1, OpenType, TrueType, Type3, and CID fonts embedded in the PDF. Find font names, font sizes, and the path data for individual glyphs.
Serialize forms in the industry-standard XFDF format to extract, edit, or insert form field data.
Programmatically search across multiple documents at predefined locations. Extract information and metadata from a set of documents.
Code samples, familiar package managers, and a Docker image make it easy to get up and running.
Our core document engine has been perfected by 20 years of knowledge, innovation, and real-world testing.
An open source UI gives you complete freedom to match your look & feel, and optimize the user experience.
A single API with consistent function calls across platforms means a shorter learning curve and easier maintenance.