PDFTron logo

PDFNet SDK - What's New?

Version 10.9.0 - May 1st, 2024
See the 10.9.0 changelog on apryse.com

New In This Release

Version 10.9 is primarily a bugfix release.

Support for MHT and EML

  • If the HTML2PDF module is available, Convert.ToPdf() and HTML2PDF.Convert() will now automatically use this module to convert .mht/.mhtml and .eml files to PDF.

New Digital Signature Utilities

  • Added support for digitally signing using the RSASSA-PSS signature algorithm. (DigitalSignatureField.GenerateCMSSignature() that takes AlgorithmIdentifier, RSASSAPSSParams class)

New Options

  • Added an option to save incrementally to a file when converting from Office to PDF. This saving leads to a significant reduction in memory usage when processing documents with many pages (particularly large Excel documents), but may slow the conversion down if the filesystem is slow. (OfficeToPDFOptions.SetIncrementalSave()).
  • Added options to force or disable OCR processing when converting from PDF to PowerPoint or Excel using the Structured Output Module. These options bring these formats in line with the existing option for conversion to Word. (PowerPointOutputOptions.SetSearchableImageSetting() and ExcelOutputOptions.SetSearchableImageSetting()).
  • Added an option to exclude bookmarks when converting office to PDF (OfficeToPDFOptions.SetIncludeBookmarks()).

Improvements:

  • [html] Added support for HTML2PDF.WebPageSettings.SetAllowJavaScript() when using html2pdf_chromium. This method is no longer deprecated.
  • [pdf] Fixed occasional excessive highlights with semantic text comparison.
  • [pdf] Exposed PDFDoc.FDFExtract() with a PageSet argument on Java and all versions of .Net.
  • [all] Improved connection error messaging for network-enabled consumption license keys.
  • [pdf] Adjusted cloudy FreeText annotation text padding and alignment to more closely match other PDF viewers.
  • [ocr] Improved OCR quality for documents containing low resolution bitmaps that are not aligned with the crop box.
  • [pdf] Improved support for extracting text from corrupt PDF documents with mismatched q/Q operator pairs.
  • [pdf] Improved resilience of digital signature validation against corrupt files containing free references. Previously these could cause an exception to be thrown.

Bugfixes:

  • [node.js] Added missing e_ccitt enum value in PDFNet.Optimizer.MonoImageSettings.CompressionMode for Node.js.
  • [html] Fixed issue when converting HTML to PDF that could lead to unexpectedly large PDF output.
  • [html] Fixed invalid PDF structure tree generated by HTML to PDF conversion.
  • [pdf] Fixed an issue with DataExtractionModule.DetectAndAddFormFieldsToPDF() when a page range is specified.
  • [pdf] Fixed an issue with potentially incorrect text extraction in the case where a symbolic TrueType font is used for Latin text.
  • [pdf] Fixed an issue with GeometryCollection where processing of Form XObjects could fail thereby preventing snapping to objects described within.
  • [pdf] Fixed PDF rendering issues with a rare type of tiling pattern.
  • [pdfa] Fixed a PDF/A validation error after converting a PDF that uses device CMYK colorspace with no default and has overprint enabled in the graphics state.
  • [pdf] Fixed an issue with calling Annot.RefreshAppearance(RefreshOptions) in Python, Ruby, PHP and Go.
  • [xod] Fixed an issue where some content could be missing after converting certain XOD files to PDF.
  • [pdf] Fixed an issue where disabling digest verification would make TimestampingConfiguration.TestConfiguration() report a failure.
  • [pdfa] Fixed an issue with validation and conversion to PDF/A where "/Interpolate true", which is not allowed in PDF/A, could be ignored in the "Mask" entry of images and in the page thumbnail.
  • [pdfa] Fixed an issue when converting some documents to PDF/A-1, where the output could cause errors in other PDF consumers. This occurred when processing corrupt PDF files containing image masks that also have a colorspace.

Office Fidelity:

  • [office] Fixed multiple element placement issues in stacked column charts.
  • [docx] Added clickable hyperlinks to tab leaders in the table of contents.
  • [docx] Fixed an issue where the text within a textbox could appear upside down in Word 2013 documents.
  • [docx] Enabled heading structure tags for PDF output.
  • [xlsx] Reduced peak memory usage for Excel document conversions.
  • [docx] Tuned line height and text baseline placement in a number of different scenarios.
  • [xls] Fixed an issue with incorrect width of some shapes in XLS documents.
  • [docx] Fixed a rare issue with infinite pages caused by floating elements pushed outside page margins.
  • [docx] Fixed a rare "pure virtual method called" error on Linux.
  • [xlsx] Fixed an issue with incorrect handling of Excel documents with more than 65,535 rows.
  • [office] Various fixes for office to PDF accessibility tags for tables.
  • [pdf] Changed font subsetting to also subset glyph widths, reducing PDF file size.
  • [pdf] Improved font processing performance for office PDFs.
  • [docx] Fixed an issue where a floating image could incorrectly overlap a line of text.
  • [xls] Fixed a rare issue with unexpected (extra or missing) cell borders in Excel documents.
  • [office] Added office file metadata to PDF output in office to PDF conversion (title, author. etc.).
  • [docx] Improved text indentation accuracy within paragraphs and lists.
  • [docx] Fixed an issue where floating images could be placed on the wrong page.
  • [office] Fixed an issue with certain Thai accent characters, causing the Adobe "Character encoding" accessibility check to fail.
  • [xlsx] Fixed an issue that could produce incorrect Excel cell margins for merged cells.
  • [docx] Fixed a bug where vertically justified table cell contents were mispositioned.
  • [doc] Fixed incorrect table header styles in some .doc files.
  • [docx] Implemented clipping of floating elements inside table cells.
  • [xls] Fixed an issue with too narrow column widths in some .xls documents.
  • [docx] Fixed issues with incorrect application of kerning in Word documents.
  • [docx] Improved the accuracy of paragraph spacing in Word documents.
  • [office] Added accessibility tags for link annotations.
  • [docx] Fixed a rare issue where an extra line could be added to the end of a table cell.
  • [xlsx] Improved page breaking for large sheets when the ApplyPageBreaksToSheet option is not set.
  • [docx] Improved text layout in justified paragraphs to better match Word's behavior.
  • [xls] Added support for color scale conditional formatting for .xls files.
  • [docx] Fixed an issue with text wrapping around images in table cells.
  • [xls] Improved reading of shapes from malformed .xls documents.
  • [xlsx] Added support for East Asian date formats in Excel.
  • [xlsx] Fixed an issue with the display of numbers with rounding errors in Excel.
  • [pptx] Fixed a rare issue with incorrect shape fill style inherited from the slide master.
  • [docx] Fixed a rare issue where extra table rows were repeated on every page.
  • [office] Fixed incorrect scaling of some SVG images.
  • [xlsx] Fixed a bug where an Excel sheet was extended to its maximum height due to a full-column merged cell.
  • [office] Improved font substitution for multi-language documents.
  • [office] Improved typesetting of Arabic text.
  • [office] Improved font substitution by preferring an exact font match if available in documents containing characters that are not supported by the font.
  • [docx] Added accessibility tags for table header cells.
  • [docx] Fixed a bug where a table row could be be incorrectly interpreted as a repeated header.

Fixes and improvements for the Structured Output Module

  • [docx] Fixed an issue preventing successful conversion of a file.
  • [docx] Fixed an issue preventing one image of many from being correctly rendered.
  • [docx] Fixed an issue preventing successful conversion of a file on Linux operating systems only.
  • [docx] Fixed an issue preventing the detection of a Table of Contents due to the text order of the file.
  • [docx] Improved detection of breaks on scanned documents containing Arabic text.
  • [office] Streamlined optical character recognition workflow of large documents containing non-standard encoded text.
  • [office] Allowed page snapshot deletion where annotations exist.
  • [office] Improved processing of non-standard encoded characters to unicode.
  • [office] Improved detection of combined characters.
  • [office] Improved detection of Arabic diacritic characters.
  • [office] Improved detection of transparent watermarks over scanned pages.
  • [docx] Improved detection of Table of Contents.
  • [docx] Improved detection of shapes when converting to DOCX.
  • [docx] Improved detection of serial images that contain underlines.
  • [office] A limited number of third-party libraries have been updated to include the latest security fixes.

Version 10.8.0 - March 20th, 2024
Version 10.7.0 - February 7th, 2024
Version 10.6.0 - December 6th, 2023
Version 10.5.0 - October 25th, 2023
Version 10.4.0 - September 13th, 2023
Version 10.3.0 - Aug 2nd, 2023
Version 10.2.0 - June 21st, 2023
Version 10.1.0 - May 10th, 2023
Version 10.0.0 - March 29th, 2023
Version 9.5.0 - February 17th, 2023
Version 9.4.2 - December 19th, 2022
Version 9.4.0 - October 12th, 2022
Version 9.3.0 - July 5th, 2022
Version 9.2.0 - January 27th, 2022
Version 9.1.0 - August 24th, 2021
Version 9.0.0 - April 9th, 2021
Version 8.1.0 - November 30th, 2020
Version 8.0.0 - September 4th, 2020
Version 7.1.0 - February 18th, 2020
Version 7.0.1 - August 20th, 2019
Version 7.0.0 - July 25th, 2019


Older Changelogs