AI Project: How to Deploy a Hugging Face Model as a REST API (with Flask)

ByAhmed Nabil April 11, 2026March 22, 2026

3D isometric illustration of a Hugging Face robot on a Flask server rack handing data cubes to client devices, representing API deployment.

You’ve built amazing AI models with Hugging Face, but they’re stuck in your script. Want to know how to deploy Hugging Face API so other applications (like a website or a mobile app) can use them?

wrap them in an API. We’ll use Flask to create a simple web server that runs your AI model.

Step 1: Install Libraries

pip install flask transformers torch

Step 2: The Flask Server (`app.py`)

This script will:

Load the AI model (only once, when the server starts).
Create a Flask “route” (a URL) that can accept POST requests.
Run the model on the data sent to it and return the result as JSON.

from flask import Flask, request, jsonify
from transformers import pipeline

# 1. Initialize the Flask app
app = Flask(__name__)

# 2. Load the AI model ONCE at startup
# We'll use the sentiment analyzer
print("Loading AI model...")
classifier = pipeline("sentiment-analysis")
print("Model loaded!")

# 3. Define the API endpoint
@app.route("/analyze", methods=['POST'])
def analyze_text():
    # 4. Get the JSON data from the request
    data = request.json
    if not data or 'text' not in data:
        return jsonify({"error": "Missing 'text' key"}), 400
    
    text_to_analyze = data['text']
    
    # 5. Run the model and return the result
    result = classifier(text_to_analyze)
    return jsonify(result)

# 6. Run the app
if __name__ == "__main__":
    app.run(debug=True, port=5000)

Step 3: Run It and Test It

Run your script: python app.py
Your server is now running at http://127.0.0.1:5000.
You can’t test this in a browser (it’s a POST request). Use a tool like Insomnia/Postman or another Python script to send it data!

You now have a real, working AI microservice.

Key Takeaways

To deploy your Hugging Face AI models, wrap them in an API using Flask.
First, install the necessary libraries for your project.
Create a Flask server with a route that accepts POST requests and runs the AI model.
After running the server with python app.py, test the API using tools like Insomnia or Postman.
You will successfully create a working AI microservice.

Ahmed Nabil

Python Engineer and the founder of Python Pro Hub. With a focus on modern data science (Polars), backend architecture (FastAPI/Django), and automation, builds production-grade tutorials designed to take developers from absolute beginners to advanced software engineers.

Data Science
Cleaning Text in Polars: The .str Expression Namespace
ByAhmed Nabil April 10, 2026March 22, 2026
Text data is almost always messy. One of the most efficient ways to tackle this is with Polars string manipulation. In Pandas, you use .str…
Read More Cleaning Text in Polars: The .str Expression Namespace
Data Science
Time-Series in Polars: Filling Gaps with upsample and interpolate
ByAhmed Nabil May 11, 2026April 22, 2026
Real-world data is often “sparse.” You might have sales data for Monday and Friday, but nothing for Tuesday, Wednesday, or Thursday. This is where polars…
Read More Time-Series in Polars: Filling Gaps with upsample and interpolate
Data Science
Getting Data from the Web: Using APIs for Data Science (JSON to Pandas)
ByAhmed Nabil February 2, 2026June 13, 2026
In our Pandas Guide, we loaded data from CSV files. But modern data often lives on the web, accessible via APIs. For data scientists, understanding…
Read More Getting Data from the Web: Using APIs for Data Science (JSON to Pandas)
Web Development
Django Forms: How to Automatically Set the Author on Save
ByAhmed Nabil April 24, 2026April 14, 2026
You’ve secured your views. Now you have a new problem. When a user creates a new post, how do you save who that user was?…
Read More Django Forms: How to Automatically Set the Author on Save
Data Science
Working with JSON in Polars: The .json Namespace (2026 Guide)
ByAhmed Nabil April 22, 2026April 14, 2026
It’s very common to have a column in your data that contains a JSON string. In Pandas, this is slow and difficult to work with….
Read More Working with JSON in Polars: The .json Namespace (2026 Guide)
Data Science
Interactive Polars: Plotting with hvplot (2026 Guide)
ByAhmed Nabil May 30, 2026April 25, 2026
Matplotlib and Seaborn create static, non-interactive images. In 2026, data exploration is interactive. hvplot is a library that provides a .hvplot() method for Pandas and…
Read More Interactive Polars: Plotting with hvplot (2026 Guide)

Step 1: Install Libraries

Step 2: The Flask Server (app.py)

Step 3: Run It and Test It

Key Takeaways

Similar Posts

Leave a Reply Cancel reply

Step 2: The Flask Server (`app.py`)