Some test text!

Loading...
Guides
IRIS OCR

Using the IRIS OCR engine in Python

In addition to the default Tesseract 4 based OCR module, PDFTron offers the IRIS OCR Module based on the IRIS iDRS engine. This package is licensed separately from the default OCR module and may provide better results in some cases, especially when considering multiple disconnected text snippets on a page, as might occur in documents such as magazine covers or a CAD documents. The IRIS module is currently available on Windows and Linux Platforms.

Installing the IRIS OCR Module

The IRIS OCR module requires no special installation step, beyond downloading the appropriate package, and extracting it overtop of a previously expanded PDFTron SDK package.

Using the IRIS OCR Module

Generally speaking, the IRIS module behaves the same as the default OCR module. In particular, any of the operations using the OCRModule class described within the OCR workflow page will work without any modification. The one exception to this is the case where there are multiple OCR modules present and accessible by the SDK. In this case the IRIS module must be selected by setting the OCREngine option on the OCROptions object: OCROptions.setEngine("iris")

Get the answers you need: Support

Upcoming Webinar: PDFTron SDK Tech Review | Nov 29, 2022 at 2 pm ET

PDFTron SDK

The Platform

NEW

© 2022 PDFTron Systems Inc. All rights reserved.

Privacy

Terms of Use