Working with JSON in Polars: The .json Namespace (2026 Guide)

ByAhmed Nabil April 22, 2026April 14, 2026

3D visualization of a machine converting a messy tree structure into flat table blocks, representing Polars JSON handling.

It’s very common to have a column in your data that contains a JSON string. In Pandas, this is slow and difficult to work with. In Polars, it’s fast and easy, thanks to the .json namespace.

This allows you to query inside the JSON string without having to parse the whole thing first.

The Setup

Let’s create a DataFrame with a messy JSON string column.

import polars as pl

df = pl.DataFrame({
    "id": [1, 2],
    "json_data": [
        '{"name": "Alice", "age": 30, "city": "New York"}',
        '{"name": "Bob", "age": 45, "city": "London"}'
    ]
})

1. `json_path_match`: Check for Data

Let’s find all rows where the person’s name is “Alice”. We use a “JSONPath” expression ($.name) to look inside the string.

# Find rows where the 'name' key inside 'json_data' is "Alice"
result = df.filter(
    pl.col("json_data").json.json_path_match("$.name") == "Alice"
)
print(result)

This is incredibly fast and efficient.

2. `json_extract`: Pulling Data Out

What if you just want to get the “age” and “city” out into their own columns? Use json_extract().

df.with_columns(
    # Parse the JSON string into a Polars "Struct" (like a dict)
    pl.col("json_data").str.json_decode().alias("parsed_json")
).unnest("parsed_json")

This single command uses .str.json_decode() to parse the string, and then .unnest() to split the JSON keys (name, age, city) into their own separate columns.

Output:

shape: (2, 4)
┌─────┬───────────────────────────┬───────┬─────┬──────────┐
│ id  ┆ json_data                 ┆ name  ┆ age ┆ city     │
│ --- ┆ ---                       ┆ ---   ┆ --- ┆ ---      │
│ i64 ┆ str                       ┆ str   ┆ i64 ┆ str      │
╞═════╪═══════════════════════════╪═══════╪═════╪══════════╡
│ 1   ┆ {"name": "Alice", "age":… ┆ Alice ┆ 30  ┆ New York │
│ 2   ┆ {"name": "Bob", "age": 4… ┆ Bob   ┆ 45  ┆ London   │
└─────┴───────────────────────────┴───────┴─────┴──────────┘

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.

Data Science
How to Find and Remove Duplicate Rows in Polars (2026 Guide)
ByAhmed Nabil May 16, 2026April 22, 2026
Duplicate data is a silent killer for analysis and machine learning. Polars provides high-speed, easy-to-use methods for finding and removing duplicate rows. The Setup Let’s…
Read More How to Find and Remove Duplicate Rows in Polars (2026 Guide)
Data Science
Interactive Polars: Plotting with hvplot (2026 Guide)
ByAhmed Nabil May 30, 2026April 25, 2026
Matplotlib and Seaborn create static, non-interactive images. In 2026, data exploration is interactive. hvplot is a library that provides a .hvplot() method for Pandas and…
Read More Interactive Polars: Plotting with hvplot (2026 Guide)
Data Science | Python Projects
AI Project: Object Detection with Hugging Face (DETR)
ByAhmed Nabil April 15, 2026April 7, 2026
We’ve taught our AI to classify an image (e.g., “This is a cat”). Now let’s teach it to find the cat. Object Detection is a…
Read More AI Project: Object Detection with Hugging Face (DETR)
Data Science
Polars Performance: String Caching (Categorical Type)
ByAhmed Nabil May 4, 2026April 22, 2026
Let’s say you have a 10GB file with a “Country” column. The string “United States of America” might appear 50 million times, using a massive…
Read More Polars Performance: String Caching (Categorical Type)
Data Science
Polars Window Functions: shift() and rank() Explained
ByAhmed Nabil June 5, 2026May 5, 2026
Today we’re covering two powerful Polars Expressions: shift and rank. These are essential for financial analysis, ranking, and finding trends. In this article you’ll learn…
Read More Polars Window Functions: shift() and rank() Explained
Data Science | Python Projects
AI Project: Fill-in-the-Blank with Hugging Face (BERT)
ByAhmed Nabil May 8, 2026April 22, 2026
This is one of the original tasks that made models like BERT famous. A “Masked Language Model” (MLM) is trained by having words randomly “masked”…
Read More AI Project: Fill-in-the-Blank with Hugging Face (BERT)

The Setup

1. json_path_match: Check for Data

2. json_extract: Pulling Data Out

Similar Posts

Leave a Reply Cancel reply

1. `json_path_match`: Check for Data

2. `json_extract`: Pulling Data Out