Welcome to PyMuPDF¶

PyMuPDF is a high-performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

PyMuPDF is hosted on GitHub and registered on PyPI.

This documentation covers all versions up to 1.27.2.3.

About

User Guide

How to Guide

API Reference

Command line interface
Classes
- Annot
- Archive
- Colorspace
- DisplayList
- Document
- DocumentWriter
- Font
- Identity
- IRect
- Link
- linkDest
- Matrix
- Outline
- Page
- Pixmap
- Point
- Quad
- Rect
- Shape
- Story
- TextPage
- TextWriter
- Tools
- Widget
- Xml
The PyMuPDF4LLM API
Operator Algebra for Geometry Objects
Low Level Functions and Classes
Glossary
- coordinate
- matrix_like
- rect_like
- irect_like
- point_like
- quad_like
- inheritable
- MediaBox
- CropBox
- catalog
- trailer
- contents
- resources
- dictionary
- page
- pagetree
- object
- stream
- unitvector
- xref
- fontsize
- resolution
- OCPD
- OCCD
- OCG
- OCMD
- ligature
Constants and Enumerations
Color Database
- Function getColor()
- Printing the Color Database

This software is provided AS-IS with no warranty, either express or implied. This software is distributed under license and may not be copied, modified or distributed except as expressly authorized under the terms of that license. Refer to licensing information at artifex.com or contact Artifex Software Inc., 39 Mesa Street, Suite 108A, San Francisco CA 94129, United States for further information.