Some test text!

Convert from PDFkeyboard_arrow_down

Convert PDF to multiple file types in Python

To convert PDF documents to different format types.

Internet connection is not required for conversion.
Convert PDF to DOCX, DOC, HTML, SVG, TIF, PNG, JPEG, XPS, EPUB, TXT, and many other formats.
doc = PDFDoc(filename)

# Convert PDF document to SVG
Convert.ToSvg(doc, output_filename + ".svg")

# Convert PDF document to XPS
Convert.ToXps(filename, output_filename + ".xps")

# Convert PDF document to multipage TIFF
tiff_options = Convert.TiffOutputOptions()
Convert.ToTiff(filename, output_filename + ".tiff", tiff_options)

# Convert PDF to XOD
Convert.ToXod(filename, output_filename + ".xod")

# Convert PDF to HTML
Convert.ToHtml(filename, output_filename + ".html")

PDF Converter (SVG, XPS, TIFF, JPG, RTF, TXT, More)
Full sample code which shows how to use PDFNet Convert for direct, high-quality conversion between PDF, XPS, EMF, SVG, TIFF, PNG, JPEG, and other image formats.

linkAbout converting from PDF

The PDFTron SDK also supports converting from PDF to other formats like EMF, EPUB, XOD, HTML and XPS.

In addition to the document formats, exporting to image formats like TIFF, SVG, PNG and JPEG are supported too.

Semantic structure information like tables, headers, footers, paragraphs are not part of the PDF specification and do not exist in PDFs. To extract this type of data, any type of conversion or extraction tool will need to have a good document understanding to differentiate between tables or paragraphs. As part of our efforts at PDFTron to provide cutting edge document tools, we have created PDFTron.AI - a utility for extracting tables and text from existing PDF documents as HTML or XML.

linkAbout PDF to HTML

Depending on your use case, PDF to HTML can be used for rendering with high fidelity and accuracy or to primarily be used in content extraction. This means our tools can help you to display the output or be used in data analysis workflows.

Here are the different options for PDF to HTML conversion depending on your requirements:

linkPDF to HTML for the highest rendering accuracy

Here are the options for maintaining the original PDF layout and visual accuracy.

To convert PDF to HTML canvas in real-time client-side.

PDF to HTML/ePub
To convert PDF to fixed layout HTML/ePub where one PDF page becomes one HTML file.

To convert PDF to SVG to create a vector based image that can be embedded in an HTML file.

To convert PDF to Image (PNG, JPG, TIFF, Raw) to create a raster based image that can be embedded in an HTML file.

linkPDF to HTML for extracting semantic content

Here are the options for extracting semantic content from the output.

To convert PDF to a single HTML file that preserves the PDF content using a custom heuristic method.

To convert PDF to HTML where tables are identified using a custom deep learning model.

Get the answers you need: Support


Free Trial

Get unlimited trial usage of PDFTron SDK to bring accurate, reliable, and fast document processing capabilities to any application or workflow.

Select a platform to get started with your free trial.

Unlimited usage. No email address required.