Some test text!
To convert PDF documents to different format types.
doc = PDFDoc(filename) # Convert PDF document to SVG Convert.ToSvg(doc, output_filename + ".svg") # Convert PDF document to XPS Convert.ToXps(filename, output_filename + ".xps") # Convert PDF document to multipage TIFF tiff_options = Convert.TiffOutputOptions() tiff_options.SetDPI(200) tiff_options.SetDither(true) tiff_options.SetMono(true) Convert.ToTiff(filename, output_filename + ".tiff", tiff_options) # Convert PDF to XOD Convert.ToXod(filename, output_filename + ".xod") # Convert PDF to HTML Convert.ToHtml(filename, output_filename + ".html")
PDF Converter (SVG, XPS, TIFF, JPG, RTF, TXT, More)
Full sample code which shows how to use PDFNet Convert for direct, high-quality conversion between PDF, XPS, EMF, SVG, TIFF, PNG, JPEG, and other image formats.
The PDFTron SDK also supports converting from PDF to other formats like EMF, EPUB, XOD, HTML and XPS.
In addition to the document formats, exporting to image formats like TIFF, SVG, PNG and JPEG are supported too.
Semantic structure information like tables, headers, footers, paragraphs are not part of the PDF specification and do not exist in PDFs. To extract this type of data, any type of conversion or extraction tool will need to have a good document understanding to differentiate between tables or paragraphs. As part of our efforts at PDFTron to provide cutting edge document tools, we have created PDFTron.AI - a utility for extracting tables and text from existing PDF documents as HTML or XML.
Depending on your use case, PDF to HTML can be used for rendering with high fidelity and accuracy or to primarily be used in content extraction. This means our tools can help you to display the output or be used in data analysis workflows.
Here are the different options for PDF to HTML conversion depending on your requirements:
Here are the options for maintaining the original PDF layout and visual accuracy.
To convert PDF to HTML canvas in real-time client-side.
PDF to HTML/ePub
To convert PDF to fixed layout HTML/ePub where one PDF page becomes one HTML file.
To convert PDF to SVG to create a vector based image that can be embedded in an HTML file.
To convert PDF to Image (PNG, JPG, TIFF, Raw) to create a raster based image that can be embedded in an HTML file.
Here are the options for extracting semantic content from the output.
To convert PDF to a single HTML file that preserves the PDF content using a custom heuristic method.
To convert PDF to HTML where tables are identified using a custom deep learning model.
Get the answers you need: Support
Get unlimited trial usage of PDFTron SDK to bring accurate, reliable, and fast document processing capabilities to any application or workflow.
Select a platform to get started with your free trial.
Unlimited usage. No email address required.