Some test text!

Loading...
Guides
Optimize

Optimize & compress PDFs in Python

To optimize a PDF with default settings.

doc = PDFDoc(filename)
Optimizer.Optimize(doc)

Compress & optimize PDF files
Full code sample which shows how to use 'pdftron.PDF.Optimizer' to reduce PDF file size by removing redundant information and compressing data streams using the latest in image compression technology.

About optimize and compress

Compression as a subset of optimization represents encoding specific data using fewer bits than the original content by reducing the size of the data. This is distinct from optimize (which modifies all images) because you have the ability to choose individual images and to selectively choose the compression type for each.

PDFTron SDK supports all basic and advanced compression filters allowed in PDF including:

  • JPEG2000
  • JBIG2
  • CCITT Fax
  • Flate/PNG
  • JPEG/DCT
  • Crypt Filters

Compress images in a PDF document

To compress images using JBIG2 compression inside a PDF.

doc = PDFDoc(filename)
cos_doc = pdf_doc.GetSDFDoc()
num_objs = cos_doc.XRefSize()

i = 1
while i < num_objs:
	obj = cos_doc.GetObj(i)
	if obj is None or obj.IsFree() or not obj.IsStream():
		i = i + 1
		continue

	# Process only images
	itr = obj.Find("Subtype")
	if not itr.HasNext() or not itr.Value().GetName() == "Image":
		i = i + 1
		continue
		
	input_image = Image(obj)

	# Process only gray-scale images
	if input_image.GetComponentNum() != 1:
		i = i + 1
		continue
		
	# Skip images that are already compressed using JBIG2
	itr = obj.Find("Filter")
	if (itr.HasNext() and itr.Value().IsName() and itr.Value().GetName() == "JBIG2Decode"):
		i = i + 1
		continue
		
	filter = obj.GetDecodedStream()
	reader = FilterReader(filter)
	
	hint_set = ObjSet()     # hint to image encoder to use JBIG2 compression
	hint = hint_set.CreateArray()
	
	hint.PushBackName("JBIG2")
	hint.PushBackName("Lossless")
	
	new_image = Image.Create(
		cos_doc, 
		reader, 
		input_image.GetImageWidth(), 
		input_image.GetImageHeight(), 
		1, 
		ColorSpace.CreateDeviceGray(), 
		hint
	)
	cos_doc.Swap(i, new_img.GetSDFDoc().GetObjNum())
	i = i + 1

PDF image JBIG2 compression
Full sample code which illustrates how to recompress bitonal (black and white) images in existing PDF documents using JBIG2 compression.

Get the answers you need: Support