- ProductsGreat pdf developer solutions
- SupportDeveloper 2 developer support
- ResourcesCommunity & developer resources
- Why PDFTronTrusted pdf experts with great solutions
- About UsThe story behind the company
Easy and accurate PDF text extraction.
PDF2Text is a stand-alone software for high-quality and efficient text extraction from PDF documents. PDF2Text can be used to extract text from any PDF document as Unicode or as structured XML.
PDF2Text is offered as an easy-to-use command-line application and as a software development component that can be used as a building block for other client and server-based applications.
For developers who are looking for a software development component to integrate into their application, PDFTron also offers PDF2Text SDK, an easy-to-use, yet powerful software component for extracting text from PDF documents. PDF2Text SDK is available as a plain "C DLL" and can be easily accessed from any programming language (including C#, VB.NET, C/C++, Java, VB6, Perl, Python, Ruby, Delphi, etc).
PDF2Text is based on PDFNet SDK, PDFTron's own comprehensive PDF library. If you require rasterization or additional PDF functionality than what is provided as part of PDF2Text SDK for embedding in your own applications, please check out PDFNet SDK PDFNet SDK or contact a PDFTron representative for more information.