Python Automation: How to Read Text from Images (OCR)

ByAhmed Nabil March 16, 2026February 4, 2026

3D visualization of a laser scanner extracting glowing text from a physical photo, representing Python OCR.

OCR (Optical Character Recognition) is the process of “reading” the text out of an image file. This is perfect for automating data entry from scanned invoices, receipts, or old documents.

Python’s most popular tool for this is pytesseract.

⚠️ Step 1: Installation (Crucial!)

This is a 2-part process. 1. Install the Python library:

pip install pytesseract pillow

2. Install Google’s Tesseract Engine: pytesseract is just a “wrapper.” It needs the actual Tesseract program to be installed on your computer.

On Mac: brew install tesseract
On Windows: Download and run the official Tesseract installer. You MUST add it to your system PATH.

Step 2: The Python Code

Let’s assume you have an image file named receipt.png that contains text.

from PIL import Image
import pytesseract

# On Windows, you might need to point to the .exe file
# pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'

try:
    # 1. Open the image using Pillow
    img = Image.open('receipt.png')
    
    # 2. Use tesseract to convert the image to text
    text = pytesseract.image_to_string(img)
    
    # 3. Print the result!
    print("--- Text found in image: ---")
    print(text)
    
except FileNotFoundError:
    print("Error: Could not find 'receipt.png'.")
except Exception as e:
    print(f"An error occurred. Did you install Tesseract? Error: {e}")

You can now write scripts that scan a folder of images, extract the text, and save it to a .txt file or an Excel sheet.

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.