AI-PEDIA

Mixture of Experts (MoE)

A architecture that picks the best specialized 'expert' models for each part of a query.

What it means

Mixture of Experts is like having a team of specialized doctors rather than one general practitioner. When you ask a question, a 'router' sends your query to the specific parts of the model (experts) best suited to handle it.

Why it matters

It makes massive models like GPT-4 faster and more efficient to run, because the computer only has to activate a small fraction of the total parameters for any given request.

Keep reading

A few adjacent definitions to lock in the concept.

View all →

Parameters

The internal variables or 'settings' learned by the model during training.

Read definition

Vision Transformer (ViT)

An architecture that allows models to process images by breaking them into patches.

Read definition