AI-PEDIA

AI Inference

Running a trained model to get an output for a real request.

Ops BasicsModel serving

What it is

AI inference is the moment a live model takes input, processes it, and returns a prediction or generation.

A few adjacent definitions to lock in the concept.

Measuring how an AI system performs so you can improve quality, cost, and safety.

The process of serving a trained model so real users and systems can call it.

The learned function that maps inputs to predictions, decisions, or generated content.