Class: OCROptions

PDFNet.OCRModule. OCROptions


new OCROptions()

An object containing options for OCRModule functions

Methods


addDPI(value)

Knowing proper image resolution is important, as it enables the OCR engine to translate pixel heights of characters to their respective font sizes. We do our best to retrieve resolution information from the input's metadata, however it occasionally can be corrupt or missing. Hence we allow manual override of source's resolution, which supersedes any metadata found (both explicit as in image metadata and implicit as in PDF).
Parameters:
Name Type Description
value number image resolution
Returns:
this object, for call chaining
Type
PDFNet.OCRModule.OCROptions

addIgnoreZonesForPage(regions, page_num)

Adds a collection of ignorable regions for the given page, an optional list of page areas not to be included in analysis
Parameters:
Name Type Description
regions Array.<PDFNet.Rect> the zones to be added to the ignore list
page_num number the page number the added regions belong to
Returns:
this object, for call chaining
Type
PDFNet.OCRModule.OCROptions

addLang(lang)

Adds a language to the list of to be considered when processing this document
Parameters:
Name Type Description
lang string the new language to be added to the language list
Returns:
this object, for call chaining
Type
PDFNet.OCRModule.OCROptions

addTextZonesForPage(regions, page_num)

Adds a collection of text regions of interest for the given page, an optional list of known text zones that will be used to improve OCR quality
Parameters:
Name Type Description
regions Array.<PDFNet.Rect> the zones to be added to the text region list
page_num number the page number the added regions belong to
Returns:
this object, for call chaining
Type
PDFNet.OCRModule.OCROptions