AI-PEDIA

Context window

The maximum amount of tokens an LLM can consider at once (input + output).

What it is

The context window is the LLM's working memory for a single request: system + user messages, tool results, and whatever the model generates.

A few adjacent definitions to lock in the concept.

A Large Language Model: a neural network trained to predict the next token and generate text.

The input instructions and context you give an LLM.

The chunks of text an LLM reads and generates; cost and limits are usually token-based.