Processing options that can be passed in Begin() method to direct the flow of content recognition algorithms.
public enum ProcessingFlags : int
Disables removing duplicated text that is frequently used to achieve visual effects of drop shadow and fake bold.
Enables removing text that uses rendering mode 3 (i.e. invisible text). Invisible text is usually used in 'PDF Searchable Images' (i.e. scanned pages with a corresponding OCR text). As a result, invisible text will be extracted by default.
Disables expanding of ligatures using a predefined mapping. Default ligatures are: fi, ff, fl, ffi, ffl, ch, cl, ct, ll, ss, fs, st, oe, OE.
Enables removal of text that is marked as part of a Watermark layer
Treat punctuation (e.g. full stop, comma, semicolon, etc.) as word break characters.
Enables removal of text that is obscured by images or rectangles. Since this option has small performance penalty on performance of text extraction, by default it is not enabled.