Working with PDFs in Python: Merging and Splitting Pages

ByAhmed Nabil February 4, 2026March 18, 2026

3D isometric visualization of a machine merging loose PDF pages into a book and splitting a book into pages using Python.

Handling PDFs is a daily task for many, but software to edit them can be expensive. Python PDF Automation tools can manage this task for free using the <a href="https://pypi.org/project/pypdf/" type="link" id="https://pypi.org/project/pypdf/">pypdf</a> library.

Step 1: Install the Library

pip install pypdf

Task 1: Merging Multiple PDFs

Imagine you have report_part1.pdf and report_part2.pdf and you want to combine them using Python for automated PDF processing.

from pypdf import PdfWriter

merger = PdfWriter()

# List of PDF files to merge, in order
pdf_files = ["report_part1.pdf", "report_part2.pdf"]

for pdf in pdf_files:
    merger.append(pdf)

# Write the combined file
merger.write("merged_report.pdf")
merger.close()
print("PDFs merged successfully!")

Task 2: Splitting a PDF (Extracting Pages)

What if you only want page 3 from a 100-page document? Python PDF automation simplifies this by extracting specific pages.

from pypdf import PdfReader, PdfWriter

# Open the big file
reader = PdfReader("big_document.pdf")
writer = PdfWriter()

# Get page 3 (Remember, Python is 0-indexed, so page 3 is index 2!)
page_3 = reader.pages[2]
writer.add_page(page_3)

# Save it as a new file
with open("page_3_only.pdf", "wb") as output_file:
    writer.write(output_file)

print("Page extracted successfully!")

Note the "wb" mode when opening the file. This stands for “Write Binary”, which is required for non-text files like PDFs.

Key Takeaways

Editing PDFs can be expensive, but Python PDF Automation tools like the pypdf library offer a free solution.
To start, install the pypdf library for managing PDFs easily.
You can merge multiple PDFs, such as report_part1.pdf and report_part2.pdf, using Python.
Additionally, you can extract specific pages from a PDF, like page 3 from a 100-page document.
Remember to open files in ‘wb’ mode for writing binary data when working with PDFs.

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.

Automation
Automating Excel with Python: Reading and Writing .xlsx Files (openpyxl)
ByAhmed Nabil February 21, 2026February 4, 2026
Do you need to edit an Excel file but keep its style and formulas? While Pandas is good for checking data, it often ignores the…
Read More Automating Excel with Python: Reading and Writing .xlsx Files (openpyxl)
Python Basics
An Advanced Guide to Python f-strings (Beyond the Basics)
ByAhmed Nabil June 3, 2026April 30, 2026
You already use f-strings to put variables in text: f"Hello, {name}!" But f-strings are far more powerful than that. In this article, we’ll look at…
Read More An Advanced Guide to Python f-strings (Beyond the Basics)
Python Basics
Inheritance in Python: How to Re-use Code in Classes
ByAhmed Nabil January 24, 2026March 17, 2026
In our Guide to OOP, we created a Dog class. But what if we also need a Cat class? They both have names and ages….
Read More Inheritance in Python: How to Re-use Code in Classes
Python Projects
Intermediate Python Project: Build an Amazon Price Tracker
ByAhmed Nabil January 28, 2026March 17, 2026
Following up on our Web Scraping 101, let’s build something useful: a Python Price Tracker script that checks a product price and tells us if…
Read More Intermediate Python Project: Build an Amazon Price Tracker
Automation
Browser Automation with Selenium: Controlling Chrome with Python
ByAhmed Nabil February 16, 2026March 18, 2026
Sometimes requests and BeautifulSoup aren’t enough. For a comprehensive solution, look no further than this Selenium Python Guide. Modern websites use JavaScript to load data,…
Read More Browser Automation with Selenium: Controlling Chrome with Python
Advanced Python
Python yield Explained: A Deep Dive into Generators
ByAhmed Nabil March 14, 2026March 21, 2026
If you’ve ever worked with huge files or infinite sequences, you’ve needed a generator. The keyword that powers them is yield. In this article, you’ll…
Read More Python yield Explained: A Deep Dive into Generators

Step 1: Install the Library

Task 1: Merging Multiple PDFs

Task 2: Splitting a PDF (Extracting Pages)

Key Takeaways

Similar Posts

Leave a Reply Cancel reply