AI Project: Build a Recommender System with Hugging Face datasets

ByAhmed Nabil June 1, 2026May 1, 2026

3D isometric illustration of a robot drawing a gold connection line between a user and a recommended movie, representing a recommender system.

Recommender systems are the engine of the modern internet (Netflix, Amazon, Spotify). In this post, we’ll introduce a Hugging Face Recommender System and explore how it works. The simplest type is “content-based filtering,” which asks: “If you like this item, what other items are most similar to it?”

We can do this by:

Loading a dataset (e.g., of movies with descriptions).
Converting all descriptions into Text Embeddings.
Using a special library called FAISS to find the “nearest neighbors” (closest vectors) to a movie you like.

Step 1: Installation

pip install datasets transformers sentence-transformers faiss-cpu
# 'faiss-cpu' is Facebook's fast similarity search library

Step 2: Prepare the Data

We’ll load a movie dataset, get the embeddings for all descriptions, and add them to a FAISS “index.”

from datasets import load_dataset
from sentence_transformers import SentenceTransformer

# 1. Load a small movie dataset
ds = load_dataset("all-movies-from-1990s-TMDb", split='train').select(range(1000))

# 2. Load an embedding model
model = SentenceTransformer('all-MiniLM-L6-v2')

# 3. Create embeddings for all movie overviews (This takes time!)
print("Creating embeddings...")
ds = ds.map(lambda x: {
    "embedding": model.encode(x["overview"])
})

# 4. Add embeddings to a FAISS index for fast search
ds.add_faiss_index(column="embedding")
print("FAISS index created!")

Step 3: Make a Recommendation!

Now, let’s pick a movie and find the 5 most similar movies.

# Let's find movies similar to 'Pulp Fiction' (ID=680)
query_movie = ds[20] # (Index 20 is Pulp Fiction in this dataset slice)
query_embedding = query_movie["embedding"]

# 5. Search the index
# It finds the 5 closest embeddings to our query
scores, similar_movies = ds.get_nearest_examples("embedding", query_embedding, k=5)

# 6. Print results
print(f"--- Movies similar to: {query_movie['title']} ---")
for movie in similar_movies['title']:
    print(movie)

Output:

--- Movies similar to: Pulp Fiction ---
Pulp Fiction
Reservoir Dogs
Four Rooms
Natural Born Killers
From Dusk Till Dawn

The AI has “understood” the vibe of Pulp Fiction and found other 90s crime films by Quentin Tarantino.

Key Takeaways

Recommender systems power platforms like Netflix, Amazon, and Spotify by suggesting similar items.
Content-based filtering identifies items that are most similar to those users already like.
To build a Hugging Face Recommender System, load a dataset, convert descriptions into Text Embeddings, and use FAISS for finding nearest neighbours.
The process involves installation, data preparation, and then making movie recommendations based on user preferences.

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.

Python Projects
Beginner Project: Build a Weather App with Python (Using APIs)
ByAhmed Nabil January 21, 2026March 17, 2026
Real-world applications don’t just use data you type in; they fetch data from the internet. A Python Weather App, for instance, would do this using…
Read More Beginner Project: Build a Weather App with Python (Using APIs)
Data Science
Your First Machine Learning Model: Linear Regression with Scikit-Learn
ByAhmed Nabil January 26, 2026March 17, 2026
Machine Learning (ML) often sounds like magic, but at its core, it is just math. It is about finding patterns in data and using them…
Read More Your First Machine Learning Model: Linear Regression with Scikit-Learn
Data Science | Web Development
How to Host Your AI App for Free: Deploying Gradio to Hugging Face Spaces
ByAhmed Nabil July 1, 2026May 17, 2026
We built a Gradio app to demo our AI models. But it only ran on your local computer (localhost) so How do you show it…
Read More How to Host Your AI App for Free: Deploying Gradio to Hugging Face Spaces
Data Science | Python Projects
AI Project: Zero-Shot Audio Classification (Hugging Face)
ByAhmed Nabil May 13, 2026April 22, 2026
This is one of the most incredible “2026 Vision” projects. You’ve used Zero-Shot for text, but what about sound? Zero-Shot Audio Classification opens up fascinating…
Read More AI Project: Zero-Shot Audio Classification (Hugging Face)
Automation | Python Projects
Automate Your Desktop: Create a Custom Wallpaper Changer with Python
ByAhmed Nabil February 6, 2026March 18, 2026
Why manually change your wallpaper when Python can do it for you? In this project, we’ll write a script that picks a random image from…
Read More Automate Your Desktop: Create a Custom Wallpaper Changer with Python
Data Science | Python Projects
Polars Project: A-to-Z Data Cleaning (The 2026 Guide)
ByAhmed Nabil June 1, 2026May 1, 2026
You’ve learned all the individual Polars methods. Now, let’s put them together in one “A-to-Z” project to clean a messy dataset and look at effective…
Read More Polars Project: A-to-Z Data Cleaning (The 2026 Guide)

Step 1: Installation

Step 2: Prepare the Data

Step 3: Make a Recommendation!

Key Takeaways

Similar Posts

Leave a Reply Cancel reply