AI Dictionary

Plain-English definitions for agents, context, tools, RAG, evals, safety, models, and AI-native product language.

Terms

94

Categories

8

Research date

May 2026

Showing 24 of 94 matches

AI terms

Agents & Autonomy: 21
Context & Memory: 9
Tools & Protocols: 18
Retrieval & Grounding: 9
AA

AI Agent

Also called Agent, LLM agent

Agents & Autonomy
Established

A system that uses a model, tools, context, and control logic to pursue a goal and take actions.

More context>

Plain English

A chatbot answers. An agent can decide what to do next, use tools, check results, and keep working toward an outcome.

Example

A coding agent reads an issue, edits files, runs tests, and opens a pull request.

Why it matters

It is the umbrella term behind most new AI product claims, but the actual autonomy can range from a scripted workflow to a long-running worker.

Related

Agentic AITool UseLong-Horizon Task
AA

Agentic AI

Also called Agentic system, Compound AI system

Agents & Autonomy
Established

AI that plans, uses tools, receives feedback from the environment, and can act with partial autonomy.

More context>

Plain English

The product is no longer only generating content; it is participating in a workflow.

Example

A support agent classifies a ticket, fetches account data, drafts a refund, and asks a human to approve the action.

Why it matters

Agentic systems change product, security, and UX design because the software can affect external systems.

Related

AI AgentHuman-in-the-LoopGuardrails
AH

Agent Harness

Also called Harness, Agent runtime

Agents & Autonomy
Rising

The runtime around a model that supplies tools, state, prompts, permissions, logs, and execution loops.

More context>

Plain English

The model is the brain, but the harness is the workbench, clipboard, tool belt, and supervisor.

Example

A coding harness gives an agent shell access, file editing, browser QA, progress logs, and rules for ending each session cleanly.

Why it matters

Many agent improvements now come from better harness design rather than only stronger models.

Related

Agent LoopLong-Running AgentCompaction
AL

Agent Loop

Also called Run loop, Think-act-observe loop

Agents & Autonomy
Established

The repeated cycle where an agent reasons, calls a tool, observes the result, and decides the next step.

More context>

Plain English

It is the heartbeat of an agent: decide, act, look at what happened, repeat.

Example

Search the docs, read a result, update the plan, run a command, inspect the error, and try a fix.

Why it matters

Loops make AI useful for open-ended work, but they also create compounding cost and error risk.

Related

Tool UseStopping ConditionTrace
LH

Long-Horizon Task

Also called Long-running task, Extended task

Agents & Autonomy
Rising

A goal that requires many dependent steps, decisions, tool calls, and course corrections before completion.

More context>

Plain English

It is work that cannot be solved in one neat prompt, such as migrating an app or researching a market.

Example

An agent spends hours modernizing a codebase, running tests, fixing regressions, and documenting the result.

Why it matters

Long-horizon ability is becoming a major benchmark for whether agents can handle valuable real-world work.

Related

Agent HarnessLong-Running AgentCompaction
LR

Long-Running Agent

Also called Persistent agent, Background agent

Agents & Autonomy
Emerging

An agent designed to keep making progress over many sessions, context windows, or wall-clock hours.

More context>

Plain English

It can leave notes for its future self, resume after interruption, and avoid starting from scratch.

Example

A research agent checks sources every morning, updates findings, and keeps a running evidence log.

Why it matters

The next useful agents are less like chat tabs and more like durable workers with memory and accountability.

Related

Agent HarnessMemoryCheckpoint
AS

Agent Swarm

Also called Swarm, Agent fleet

Agents & Autonomy
Emerging

A group of agents that coordinate or compete to solve pieces of a larger goal.

More context>

Plain English

Instead of one general agent, several specialists work in parallel or hand work between themselves.

Example

One agent researches vendors, another estimates cost, and a third audits the final recommendation.

Why it matters

Swarms promise speed and specialization, but they need orchestration, shared state, and safety boundaries.

Related

Multi-Agent SystemSupervisor AgentA2A
GM

Goal Mode

Also called /goal, Goal command, Persistent goal

Agents & Autonomy
Watch

An emerging coding-agent workflow where a persistent objective stays active across progress checks, continuation, pause, and resume.

More context>

Plain English

Instead of asking once, you pin the outcome the agent should keep pursuing until it finishes or hits a stop rule.

Example

A developer starts `/goal reduce bundle size by 20% and verify with build output`, then lets the coding agent continue until it can prove the result.

Why it matters

Goal-style commands turn agent work from a chat request into a bounded task contract with acceptance criteria.

Related

Long-Running AgentCheckpointCoding Agent
DE

Durable Execution

Also called Resumable workflow, Persistent execution

Agents & Autonomy
Rising

Saving workflow progress so an agent or graph can pause, fail, or wait for a person and later resume without redoing completed steps.

More context>

Plain English

The agent gets a reliable save system for long work, not just a long chat history.

Example

A procurement agent pauses for budget approval, stores its state, and resumes from the same task after approval arrives days later.

Why it matters

Durability is becoming essential for production agents that run beyond a single request-response cycle.

Related

CheckpointInterruptLong-Running Agent
I

Interrupt

Also called Pause point, Mid-task approval

Agents & Autonomy
Rising

A deliberate pause inside an agent workflow that waits for external input before continuing.

More context>

Plain English

The agent stops at the important moment, asks for the missing decision, then continues with the answer.

Example

A refund agent pauses before issuing credit and asks a manager to approve the amount.

Why it matters

Interrupts make human guidance possible during the work, not only after the agent has already acted.

Related

Human-in-the-LoopDurable ExecutionApproval Gate
MA

Multi-Agent System

Also called MAS, Agent team

Agents & Autonomy
Rising

An architecture where multiple specialized agents collaborate through messages, tools, or a shared controller.

More context>

Plain English

A team of narrow agents can be easier to steer than one giant all-purpose agent.

Example

A planning agent delegates to a browser agent, a coding agent, and a QA agent.

Why it matters

Multi-agent patterns are becoming common when context grows too large or expertise needs to be separated.

Related

Agent SwarmHandoffSupervisor Agent
SA

Supervisor Agent

Also called Manager agent, Orchestrator agent

Agents & Autonomy
Rising

A coordinating agent that decomposes work, invokes specialists, and combines their outputs.

More context>

Plain English

It is the project manager of a multi-agent setup.

Example

A supervisor decides whether the legal, billing, or technical sub-agent should handle a customer question.

Why it matters

Supervisor patterns make complex systems debuggable by keeping a clear owner for routing and synthesis.

Related

Orchestrator-WorkersSub-AgentHandoff
SA

Sub-Agent

Also called Worker agent, Specialist agent

Agents & Autonomy
Rising

A focused agent invoked by another agent or workflow to handle a narrower task.

More context>

Plain English

A specialist you call when the main agent should not carry every detail in its own context.

Example

A QA sub-agent opens the app in a browser and reports visual or functional regressions.

Why it matters

Sub-agents reduce context clutter and make ownership clearer in larger agent systems.

Related

Supervisor AgentA2ATool Use
AA

Agent-as-Tool

Also called Agent tool, Callable agent

Agents & Autonomy
Emerging

A pattern where one agent exposes a specialist agent as a callable tool instead of handing over the whole conversation.

More context>

Plain English

The main agent can ask a specialist for one job and keep ownership of the final answer.

Example

A travel planner calls a visa-checking agent as a tool, receives the result, and continues planning the trip.

Why it matters

It offers specialization without losing the thread of who owns the user-facing response.

Related

Sub-AgentTool UseSupervisor Agent
AT

Agent Team

Also called Teammates, Peer agents

Agents & Autonomy
Emerging

Multiple agent sessions coordinated around shared work, often with peer messaging, task assignment, or separate context.

More context>

Plain English

It is less one helper and more a small crew of AI workers, each with its own workspace.

Example

A product agent drafts requirements while a coding agent implements and a review agent checks risks.

Why it matters

Agent-team vocabulary is replacing vague swarm language when systems need clearer roles and accountability.

Related

Agent SwarmMulti-Agent SystemSub-Agent
H

Handoff

Also called Transfer, Agent transfer

Agents & Autonomy
Rising

A controlled transfer of a task or conversation from one agent to another.

More context>

Plain English

The AI equivalent of routing a case from support to billing, with context carried along.

Example

A triage agent hands a refund request to a billing specialist after collecting the order ID.

Why it matters

Good handoffs prevent lost context, repeated questions, and specialists acting without the right permissions.

Related

Sub-AgentA2AContext
OW

Orchestrator-Workers

Also called Manager-worker workflow, Delegation workflow

Agents & Autonomy
Established

A pattern where a central model dynamically breaks work into subtasks and delegates them to worker models.

More context>

Plain English

One agent figures out the plan; other agents do the pieces.

Example

A codebase migration agent identifies affected packages and assigns each package to a worker.

Why it matters

This pattern handles tasks where the steps are not known ahead of time.

Related

Supervisor AgentParallelizationAgent Swarm
EO

Evaluator-Optimizer

Also called Critic loop, Review-and-revise loop

Agents & Autonomy
Established

A workflow where one model produces work and another model evaluates it, creating an improvement loop.

More context>

Plain English

One AI writes; another AI reviews; the first AI revises.

Example

A drafting agent writes a policy summary, then an evaluator checks completeness and flags unsupported claims.

Why it matters

It can raise quality when the success criteria are clear enough to evaluate repeatedly.

Related

EvalsSelf-ReflectionJudge Model
HI

Human-in-the-Loop

Also called HITL, Human approval

Agents & Autonomy
Established

A design where a person reviews, approves, or edits model output before a consequential action happens.

More context>

Plain English

The AI can prepare the action, but a human still signs off.

Example

An agent drafts a customer refund, but a manager approves it before money moves.

Why it matters

It is one of the simplest ways to limit harm while still getting automation benefits.

Related

Human-on-the-LoopGuardrailsApproval Gate
HO

Human-on-the-Loop

Also called Oversight, Supervisory control

Agents & Autonomy
Rising

A design where humans monitor autonomous systems and intervene when risk, uncertainty, or policy requires it.

More context>

Plain English

The system can act, but people can see what it is doing and take the wheel.

Example

A security operations agent remediates low-risk alerts automatically and escalates unusual cases.

Why it matters

As agents run for longer, oversight has to be continuous rather than one approval at the end.

Related

Human-in-the-LoopTraceAutonomy Level
AL

Autonomy Level

Also called Autonomy tier, Agency level

Agents & Autonomy
Emerging

A way to describe how much freedom an agent has to decide, act, spend money, or change systems.

More context>

Plain English

Not every agent deserves the same keys.

Example

Level 1 drafts recommendations, level 2 acts after approval, and level 3 acts within a budgeted sandbox.

Why it matters

Clear autonomy levels help teams align product promises, permissions, risk, and user trust.

Related

GuardrailsApproval GateTool Permissions
CW

Context Window

Also called Context length, Token window

Context & Memory
Established

The amount of text, images, tool results, or other tokens a model can consider in one request.

More context>

Plain English

It is the model's short-term workspace.

Example

A long-context model can inspect a whole repository or a large batch of documents before answering.

Why it matters

Large context helps, but long tasks still need retrieval, memory, and summarization discipline.

Related

TokenCompactionContext Engineering
CE

Context Engineering

Also called Context design, Context assembly

Context & Memory
Rising

The practice of selecting, ordering, compressing, and updating the information a model sees.

More context>

Plain English

Prompt engineering asks the model well. Context engineering gives it the right room to think in.

Example

A coding tool includes the active file, related tests, recent errors, project rules, and only the relevant docs.

Why it matters

For agents, context quality often matters more than clever wording.

Related

Context WindowRetrievalCompaction
M

Memory

Also called Agent memory, Long-term memory

Context & Memory
Rising

Information stored outside the immediate prompt so an AI system can recall preferences, facts, or progress later.

More context>

Plain English

Memory is what lets the system remember beyond one chat or one run.

Example

An agent stores that the user prefers concise summaries and that the project uses Bun, not npm.

Why it matters

Memory improves continuity, but it also creates new risks such as stale, sensitive, or poisoned context.

Related

Episodic MemorySemantic MemoryMemory Poisoning

How this dictionary is maintained

The source set is documented in docs/ai-terminology-research.md. The page favors terms that are useful across AI product, engineering, and safety conversations, with maturity labels to separate settled vocabulary from phrases still in motion.

Agents & Autonomy

agent patterns, delegation, and long-running work

Context & Memory

what the model sees, stores, and resumes from

Tools & Protocols

how models call software and connect to systems

Retrieval & Grounding

search, evidence, citations, and trusted context

Reasoning & Inference

how systems spend effort at answer time

Evaluation & Safety

testing, security, observability, and control

Models & Training

model families, customization, and compression

AI-Native Product

new product language around AI-powered workflows