Class: HTMLOutputOptions

PDFNet.Convert. HTMLOutputOptions


new HTMLOutputOptions()

An object containing options common to ToHtml and ToEpub functions

Members


<static> ContentReflowSetting

Type:
  • number
Properties:
Name Type Description
e_fixed_position number Content uses fixed positioning (default).
e_reflow_paragraphs number Text flows within paragraphs.

<static> SearchableImageSetting

Type:
  • number
Properties:
Name Type Description
e_ocr_image_text number Indicates that both text and image are going to be visible (default).
e_ocr_image number Indicates that only the images are going to be visible.
e_ocr_text number Indicates that only the text is going to be visible.

Methods


setConnectHyphens(connect)

Specifies whether hyphens in the PDF should be connected. This only works with English words. Default is false. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
connect boolean if true, hyphens in the PDF will be connected.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setContentReflowSetting(reflow)

Switch between fixed (pre-paginated) and reflowable HTML generation. Default is e_fixed_position. In e_reflow_paragraphs mode, conversions require that the optional PDFTron pdf2html add-on module is available.
Parameters:
Name Type Description
reflow number
PDFNet.Convert.HTMLOutputOptions.ContentReflowSetting = {
	e_fixed_position : 0,
	e_reflow_paragraphs : 1
}
the generated HTML will be either fixed or reflowable.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setDisableVerticalSplit(disable)

Specifies whether to disable the detection of section columns. Default is false. Enable this if your tables are coming out as section columns. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
disable boolean if true, the detection of section columns are disabled.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setDPI(dpi)

The output resolution, from 1 to 1000, in Dots Per Inch (DPI) at which to render elements which cannot be directly converted. Default is 140. Note: This option is only available for e_fixed_position mode.
Parameters:
Name Type Description
dpi number the resolution in Dots Per Inch
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setEmbedImages(embed)

Specifies whether images are embedded in the HTML without having to link to external files. Default is true. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
embed boolean if true, images are embedd in the HTML, otherwise, images are saved as external files.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

Enable the conversion of external URL navigation. Default is false.
Parameters:
Name Type Description
enable boolean if true, links that specify external URL's are converted into HTML. Note: This option is only available for e_fixed_position mode.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setFileConversionTimeoutSeconds(seconds)

Specifies the amount of time in seconds after which the conversion fails. Default is 300. Very long files need more time to convert. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
seconds number the timeout in seconds.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setImageDPI(dpi)

Specifies the output image resolution, from 8 to 600, in Pixels Per Inch (PPI). The higher the PPI, the larger the image. Default is 192. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
dpi number the resolution in Pixels Per Inch.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

Enable the conversion of internal document navigation. Default is false.
Parameters:
Name Type Description
enable boolean if true, links that specify page jumps are converted into HTML. Note: This option is only available for e_fixed_position mode.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setJPGQuality(quality)

Specifies the compression quality to use when generating JPEG images.
Parameters:
Name Type Description
quality number the JPEG compression quality, from 0(highest compression) to 100(best quality).
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setMaximumImagePixels(max_pixels)

Specifies the maximum image slice size in pixels. Default is 2000000. Note: This setting now will no longer reduce the total number of image pixels. Instead a lower value will just produce more slices and vice versa. Note: Since image compression works better with more pixels a larger max pixels should generally create smaller files. Note: This option is only available for e_fixed_position mode.
Parameters:
Name Type Description
max_pixels number the maximum number of pixels an image can have
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setNoPageWidth(enable)

Determines whether to flow contents across the entire browser window. Default is false. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
enable boolean if true, content will flow across entire page.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setPages(pageFrom, pageTo)

Specifies a range of pages to be converted. By default all pages are converted. The first page has the page number of 1. Use a negative value to specify the last page in the PDF. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
pageFrom number the first page to be converted.
pageTo number the last page to be converted (inclusive).
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setPDFPassword(password)

Specifies the password if the PDF requires one. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
password string the PDF password, if required; an empty string otherwise.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setPreferJPG(prefer_jpg)

Use JPG files rather than PNG. This will apply to all generated images. Default is true.
Parameters:
Name Type Description
prefer_jpg boolean if true JPG images will be used whenever possible.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setReportFile(path)

Generate a XML file that contains additional information about the conversion process. By default no report is generated.
Parameters:
Name Type Description
path string The file path to which the XML report is written to. Note: This option is only available for e_fixed_position mode.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setScale(scale)

Set an overall scaling of the generated HTML pages. Default is 1.0.
Parameters:
Name Type Description
scale number a number greater than 0 which is used as a scale factor. For example, calling SetScale(0.5) will reduce the HTML body of the page to half its original size, whereas SetScale(2) will double the HTML body dimensions of the page and will rescale all page content appropriately. Note: This option is only available for e_fixed_position mode.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setSearchableImageSetting(setting)

Specifies how image + hidden text pages should be converted. Default is e_ocr_image_text. Note: This feature does not perform OCR. Note: This option is only available for e_reflow_paragraphs mode.
Parameters:
Name Type Description
setting number
PDFNet.Convert.HTMLOutputOptions.SearchableImageSetting = {
	e_ocr_image_text: 0,
	e_ocr_image: 1,
	e_ocr_text: 2
}
the searchable image setting.
Returns:
this object, for call chaining
Type
PDFNet.Convert.HTMLOutputOptions

setSimpleLists(enable)

Determines whether to use
  • tags for list items. Default is false. Note: This option is only available for e_reflow_paragraphs mode.
  • Parameters:
    Name Type Description
    enable boolean if true,
  • tags are used for list items.
  • Returns:
    this object, for call chaining
    Type
    PDFNet.Convert.HTMLOutputOptions

    setSimplifyText(enable)

    Controls whether converter optimizes DOM or preserves text placement accuracy. Default is false.
    Parameters:
    Name Type Description
    enable boolean If true, converter will try to reduce DOM complexity at the expense of text placement accuracy. Note: This option is only available for e_fixed_position mode.
    Returns:
    this object, for call chaining
    Type
    PDFNet.Convert.HTMLOutputOptions

    setTitle(title)

    Specifies the title for the output HTML. Note: This option is only available for e_reflow_paragraphs mode.
    Parameters:
    Name Type Description
    title string the title of the output HTML.
    Returns:
    this object, for call chaining
    Type
    PDFNet.Convert.HTMLOutputOptions