Glossary entry

Inference

The process of running a trained AI model to produce an output from a prompt, query, or input signal.

Why it matters

Inference matters because most enterprise AI cost is incurred not when a model is trained, but when it is repeatedly used in production at scale.

In many enterprises, the economics of AI are really the economics of inference: how often models are called, how efficiently they run, and what those calls enable in real workflows.

Explore next

Continue exploring

Follow the threads that connect AI cost, value, governance, and operating discipline.

Glossary index

Browse the full alphabetized library of AI economics terms.

AI TCO Framework

See how cost structure affects the meaning of the terms on this page.

FinOps & AI

Connect vocabulary to the operating practices shaping AI cost control.