AI Project: Object Detection with Hugging Face (DETR)

ByAhmed Nabil April 15, 2026April 7, 2026

3D isometric illustration of a street scene with glowing bounding boxes around objects, representing AI object detection with DETR.

We’ve taught our AI to classify an image (e.g., “This is a cat”). Now let’s teach it to find the cat.

Object Detection is a computer vision task that identifies what is in an image and where it is by drawing a “bounding box” around it.

Step 1: Installation

You’ll need Pillow to handle images and timm.

pip install transformers torch pillow timm

Step 2: The Code

We’ll use the object-detection pipeline with DETR, a popular model from Facebook AI.

from transformers import pipeline
from PIL import Image
import requests # To get an image from the web

# 1. Load the pipeline
# This will download a DETR model
detector = pipeline("object-detection")

# 2. Get an image
# Let's use a sample image URL
url = "http://images.cocodataset.org/val2017/000000039769.jpg"
img = Image.open(requests.get(url, stream=True).raw)

# 3. Run the detector!
results = detector(img)

# 4. Print the results
print("--- Objects Found ---")
for obj in results:
    print(f"Label: {obj['label']}")
    print(f"Confidence: {obj['score']:.4f}")
    print(f"Location: {obj['box']}")
    print("-----")

Step 3: The Result

The output will be a list of all objects the model found.

--- Objects Found ---
Label: remote
Confidence: 0.9982
Location: {'ymin': 74, 'xmin': 42, 'ymax': 118, 'xmax': 176}
-----
Label: cat
Confidence: 0.9960
Location: {'ymin': 19, 'xmin': 30, 'ymax': 375, 'xmax': 289}
-----
Label: cat
Confidence: 0.9952
Location: {'ymin': 12, 'xmin': 255, 'ymax': 375, 'xmax': 640}
-----

It found the remote and both cats! You can use this to count items, track objects in videos, and more.

Key Takeaways

The article teaches how to implement Hugging Face Object Detection to locate objects in images.
Object Detection identifies what is in an image and where it is by using bounding boxes.
Installation requires the Pillow and timm libraries for image handling.
Use the object-detection pipeline with the DETR model from Facebook AI to find objects.
The output provides a list of detected items, which can be used for counting and tracking in videos.

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.

Data Science
A Deep Dive into the Polars Expression API (pl.Expr)
ByAhmed Nabil May 2, 2026April 21, 2026
We’ve used the Polars Expression API a lot. But what is an expression? An expression, or pl.Expr, is a recipe for a calculation. It’s not…
Read More A Deep Dive into the Polars Expression API (pl.Expr)
Web Development
Django Admin Project: How to Add Custom ‘Actions’ (Export to CSV)
ByAhmed Nabil June 15, 2026May 5, 2026
You’ve customized your Django Admin, but it has a hidden power: “Actions.” An action is a function you can run on all selected items in…
Read More Django Admin Project: How to Add Custom ‘Actions’ (Export to CSV)
Data Science | Python Projects
Machine Learning Project: Predicting House Prices with Scikit-Learn
ByAhmed Nabil February 23, 2026February 2, 2026
In our Scikit-Learn intro, we used tiny fake data. Now we’ll use Python to predict house prices and build a real model. We’ll use a…
Read More Machine Learning Project: Predicting House Prices with Scikit-Learn
Data Science
Using Regex in Polars: .str.contains(), .str.replace_all(), .str.extract()
ByAhmed Nabil June 12, 2026May 30, 2026
In our Polars string guide, we covered basic text cleaning. When your data demands pattern-level precision, Polars regex delivers — it builds Regular Expression (Regex)…
Read More Using Regex in Polars: .str.contains(), .str.replace_all(), .str.extract()
Automation | Python Projects
Automate Your Feed: Build a Simple Reddit Bot with PRAW
ByAhmed Nabil March 20, 2026February 4, 2026
PRAW (Python Reddit API Wrapper) is a fantastic library that makes it easy to interact with Reddit. You can use it to build bots that…
Read More Automate Your Feed: Build a Simple Reddit Bot with PRAW
Data Science
Interactive Polars: Plotting with hvplot (2026 Guide)
ByAhmed Nabil May 30, 2026April 25, 2026
Matplotlib and Seaborn create static, non-interactive images. In 2026, data exploration is interactive. hvplot is a library that provides a .hvplot() method for Pandas and…
Read More Interactive Polars: Plotting with hvplot (2026 Guide)

Step 1: Installation

Step 2: The Code

Step 3: The Result

Key Takeaways

Similar Posts

Leave a Reply Cancel reply