AI Dictionary

Plain-English definitions for agents, context, tools, RAG, evals, safety, models, and AI-native product language.

Terms

AI terms

Agents & Autonomy: 21

Context & Memory: 9

Tools & Protocols: 18

Retrieval & Grounding: 9

AI Agent

Also called Agent, LLM agent

Agents & Autonomy

Established

A system that uses a model, tools, context, and control logic to pursue a goal and take actions.

More context>

Plain English

A chatbot answers. An agent can decide what to do next, use tools, check results, and keep working toward an outcome.

Example

A coding agent reads an issue, edits files, runs tests, and opens a pull request.

Why it matters

It is the umbrella term behind most new AI product claims, but the actual autonomy can range from a scripted workflow to a long-running worker.

Agentic AITool UseLong-Horizon Task

Sources

Anthropic: Building Effective Agents Stanford HAI: 2026 AI Index

Agentic AI

Also called Agentic system, Compound AI system

Agents & Autonomy

Established

AI that plans, uses tools, receives feedback from the environment, and can act with partial autonomy.

More context>

Plain English

The product is no longer only generating content; it is participating in a workflow.

Example

A support agent classifies a ticket, fetches account data, drafts a refund, and asks a human to approve the action.

Why it matters

Agentic systems change product, security, and UX design because the software can affect external systems.

AI AgentHuman-in-the-LoopGuardrails

Sources

Anthropic: Building Effective Agents OWASP Agentic AI Top 10

Agent Harness

Also called Harness, Agent runtime

Agents & Autonomy

Rising

The runtime around a model that supplies tools, state, prompts, permissions, logs, and execution loops.

More context>

Plain English

The model is the brain, but the harness is the workbench, clipboard, tool belt, and supervisor.

Example

A coding harness gives an agent shell access, file editing, browser QA, progress logs, and rules for ending each session cleanly.

Why it matters

Many agent improvements now come from better harness design rather than only stronger models.

Agent LoopLong-Running AgentCompaction

Sources

Anthropic: Effective Harnesses Anthropic: Building Effective Agents

Agent Loop

Also called Run loop, Think-act-observe loop

Agents & Autonomy

Established

The repeated cycle where an agent reasons, calls a tool, observes the result, and decides the next step.

More context>

Plain English

It is the heartbeat of an agent: decide, act, look at what happened, repeat.

Example

Search the docs, read a result, update the plan, run a command, inspect the error, and try a fix.

Why it matters

Loops make AI useful for open-ended work, but they also create compounding cost and error risk.

Tool UseStopping ConditionTrace

Sources

Anthropic: Building Effective Agents OpenAI: Function Calling

Long-Horizon Task

Also called Long-running task, Extended task

Agents & Autonomy

Rising

A goal that requires many dependent steps, decisions, tool calls, and course corrections before completion.

More context>

Plain English

It is work that cannot be solved in one neat prompt, such as migrating an app or researching a market.

Example

An agent spends hours modernizing a codebase, running tests, fixing regressions, and documenting the result.

Why it matters

Long-horizon ability is becoming a major benchmark for whether agents can handle valuable real-world work.

Agent HarnessLong-Running AgentCompaction

Sources

Anthropic: Effective Harnesses Stanford HAI: 2026 AI Index

Long-Running Agent

Also called Persistent agent, Background agent

Agents & Autonomy

Emerging

An agent designed to keep making progress over many sessions, context windows, or wall-clock hours.

More context>

Plain English

It can leave notes for its future self, resume after interruption, and avoid starting from scratch.

Example

A research agent checks sources every morning, updates findings, and keeps a running evidence log.

Why it matters

The next useful agents are less like chat tabs and more like durable workers with memory and accountability.

Agent HarnessMemoryCheckpoint

Sources

Anthropic: Effective Harnesses LangChain: Long-Term Memory

Agent Swarm

Also called Swarm, Agent fleet

Agents & Autonomy

Emerging

A group of agents that coordinate or compete to solve pieces of a larger goal.

More context>

Plain English

Instead of one general agent, several specialists work in parallel or hand work between themselves.

Example

One agent researches vendors, another estimates cost, and a third audits the final recommendation.

Why it matters

Swarms promise speed and specialization, but they need orchestration, shared state, and safety boundaries.

Multi-Agent SystemSupervisor AgentA2A

Sources

AutoGen: Multi-Agent Chat LangChain: Multi-Agent

Goal Mode

Also called /goal, Goal command, Persistent goal

Agents & Autonomy

Watch

An emerging coding-agent workflow where a persistent objective stays active across progress checks, continuation, pause, and resume.

More context>

Plain English

Instead of asking once, you pin the outcome the agent should keep pursuing until it finishes or hits a stop rule.

Example

A developer starts `/goal reduce bundle size by 20% and verify with build output`, then lets the coding agent continue until it can prove the result.

Why it matters

Goal-style commands turn agent work from a chat request into a bounded task contract with acceptance criteria.

Long-Running AgentCheckpointCoding Agent

Sources

Community: Codex /goal Guide Codex CLI: Slash Commands

Durable Execution

Also called Resumable workflow, Persistent execution

Agents & Autonomy

Rising

Saving workflow progress so an agent or graph can pause, fail, or wait for a person and later resume without redoing completed steps.

More context>

Plain English

The agent gets a reliable save system for long work, not just a long chat history.

Example

A procurement agent pauses for budget approval, stores its state, and resumes from the same task after approval arrives days later.

Why it matters

Durability is becoming essential for production agents that run beyond a single request-response cycle.

CheckpointInterruptLong-Running Agent

Sources

LangGraph: Durable Execution Anthropic: Effective Harnesses

Interrupt

Also called Pause point, Mid-task approval

Agents & Autonomy

Rising

A deliberate pause inside an agent workflow that waits for external input before continuing.

More context>

Plain English

The agent stops at the important moment, asks for the missing decision, then continues with the answer.

Example

A refund agent pauses before issuing credit and asks a manager to approve the amount.

Why it matters

Interrupts make human guidance possible during the work, not only after the agent has already acted.

Human-in-the-LoopDurable ExecutionApproval Gate

Sources

LangGraph: Interrupts OpenAI: Agents SDK

Multi-Agent System

Also called MAS, Agent team

Agents & Autonomy

Rising

An architecture where multiple specialized agents collaborate through messages, tools, or a shared controller.

More context>

Plain English

A team of narrow agents can be easier to steer than one giant all-purpose agent.

Example

A planning agent delegates to a browser agent, a coding agent, and a QA agent.

Why it matters

Multi-agent patterns are becoming common when context grows too large or expertise needs to be separated.

Agent SwarmHandoffSupervisor Agent

Sources

LangChain: Multi-Agent AutoGen: Multi-Agent Chat Google ADK: A2A

Supervisor Agent

Also called Manager agent, Orchestrator agent

Agents & Autonomy

Rising

A coordinating agent that decomposes work, invokes specialists, and combines their outputs.

More context>

Plain English

It is the project manager of a multi-agent setup.

Example

A supervisor decides whether the legal, billing, or technical sub-agent should handle a customer question.

Why it matters

Supervisor patterns make complex systems debuggable by keeping a clear owner for routing and synthesis.

Orchestrator-WorkersSub-AgentHandoff

Sources

Anthropic: Building Effective Agents LangChain: Multi-Agent

Sub-Agent

Also called Worker agent, Specialist agent

Agents & Autonomy

Rising

A focused agent invoked by another agent or workflow to handle a narrower task.

More context>

Plain English

A specialist you call when the main agent should not carry every detail in its own context.

Example

A QA sub-agent opens the app in a browser and reports visual or functional regressions.

Why it matters

Sub-agents reduce context clutter and make ownership clearer in larger agent systems.

Supervisor AgentA2ATool Use

Sources

Google ADK: A2A LangChain: Multi-Agent

Agent-as-Tool

Also called Agent tool, Callable agent

Agents & Autonomy

Emerging

A pattern where one agent exposes a specialist agent as a callable tool instead of handing over the whole conversation.

More context>

Plain English

The main agent can ask a specialist for one job and keep ownership of the final answer.

Example

A travel planner calls a visa-checking agent as a tool, receives the result, and continues planning the trip.

Why it matters

It offers specialization without losing the thread of who owns the user-facing response.

Sub-AgentTool UseSupervisor Agent

Sources

OpenAI Agents SDK: Guardrails LangChain: Multi-Agent

Agent Team

Also called Teammates, Peer agents

Agents & Autonomy

Emerging

Multiple agent sessions coordinated around shared work, often with peer messaging, task assignment, or separate context.

More context>

Plain English

It is less one helper and more a small crew of AI workers, each with its own workspace.

Example

A product agent drafts requirements while a coding agent implements and a review agent checks risks.

Why it matters

Agent-team vocabulary is replacing vague swarm language when systems need clearer roles and accountability.

Agent SwarmMulti-Agent SystemSub-Agent

Sources

Claude Code: Extensions LangChain: Multi-Agent

Handoff

Also called Transfer, Agent transfer

Agents & Autonomy

Rising

A controlled transfer of a task or conversation from one agent to another.

More context>

Plain English

The AI equivalent of routing a case from support to billing, with context carried along.

Example

A triage agent hands a refund request to a billing specialist after collecting the order ID.

Why it matters

Good handoffs prevent lost context, repeated questions, and specialists acting without the right permissions.

Sub-AgentA2AContext

Sources

LangChain: Multi-Agent OpenAI: Function Calling

Orchestrator-Workers

Also called Manager-worker workflow, Delegation workflow

Agents & Autonomy

Established

A pattern where a central model dynamically breaks work into subtasks and delegates them to worker models.

More context>

Plain English

One agent figures out the plan; other agents do the pieces.

Example

A codebase migration agent identifies affected packages and assigns each package to a worker.

Why it matters

This pattern handles tasks where the steps are not known ahead of time.

Supervisor AgentParallelizationAgent Swarm

Sources

Anthropic: Building Effective Agents

Evaluator-Optimizer

Also called Critic loop, Review-and-revise loop

Agents & Autonomy

Established

A workflow where one model produces work and another model evaluates it, creating an improvement loop.

More context>

Plain English

One AI writes; another AI reviews; the first AI revises.

Example

A drafting agent writes a policy summary, then an evaluator checks completeness and flags unsupported claims.

Why it matters

It can raise quality when the success criteria are clear enough to evaluate repeatedly.

EvalsSelf-ReflectionJudge Model

Sources

Anthropic: Building Effective Agents

Human-in-the-Loop

Also called HITL, Human approval

Agents & Autonomy

Established

A design where a person reviews, approves, or edits model output before a consequential action happens.

More context>

Plain English

The AI can prepare the action, but a human still signs off.

Example

An agent drafts a customer refund, but a manager approves it before money moves.

Why it matters

It is one of the simplest ways to limit harm while still getting automation benefits.

Human-on-the-LoopGuardrailsApproval Gate

Sources

Anthropic: Building Effective Agents OWASP Agentic AI Top 10

Human-on-the-Loop

Also called Oversight, Supervisory control

Agents & Autonomy

Rising

A design where humans monitor autonomous systems and intervene when risk, uncertainty, or policy requires it.

More context>

Plain English

The system can act, but people can see what it is doing and take the wheel.

Example

A security operations agent remediates low-risk alerts automatically and escalates unusual cases.

Why it matters

As agents run for longer, oversight has to be continuous rather than one approval at the end.

Human-in-the-LoopTraceAutonomy Level

Sources

OWASP Agentic AI Top 10 Anthropic: Building Effective Agents

Autonomy Level

Also called Autonomy tier, Agency level

Agents & Autonomy

Emerging

A way to describe how much freedom an agent has to decide, act, spend money, or change systems.

More context>

Plain English

Not every agent deserves the same keys.

Example

Level 1 drafts recommendations, level 2 acts after approval, and level 3 acts within a budgeted sandbox.

Why it matters

Clear autonomy levels help teams align product promises, permissions, risk, and user trust.

GuardrailsApproval GateTool Permissions

Sources

OWASP Agentic AI Top 10 Anthropic: Building Effective Agents

Context Window

Also called Context length, Token window

Context & Memory

Established

The amount of text, images, tool results, or other tokens a model can consider in one request.

More context>

Plain English

It is the model's short-term workspace.

Example

A long-context model can inspect a whole repository or a large batch of documents before answering.

Why it matters

Large context helps, but long tasks still need retrieval, memory, and summarization discipline.

TokenCompactionContext Engineering

Sources

Google Cloud Generative AI Glossary Anthropic: Effective Harnesses

Context Engineering

Also called Context design, Context assembly

Context & Memory

Rising

The practice of selecting, ordering, compressing, and updating the information a model sees.

More context>

Plain English

Prompt engineering asks the model well. Context engineering gives it the right room to think in.

Example

A coding tool includes the active file, related tests, recent errors, project rules, and only the relevant docs.

Why it matters

For agents, context quality often matters more than clever wording.

Context WindowRetrievalCompaction

Sources

Anthropic: Effective Harnesses Google Cloud Generative AI Glossary

Memory

Also called Agent memory, Long-term memory

Context & Memory

Rising

Information stored outside the immediate prompt so an AI system can recall preferences, facts, or progress later.

More context>

Plain English

Memory is what lets the system remember beyond one chat or one run.

Example

An agent stores that the user prefers concise summaries and that the project uses Bun, not npm.

Why it matters

Memory improves continuity, but it also creates new risks such as stale, sensitive, or poisoned context.

Episodic MemorySemantic MemoryMemory Poisoning

Sources

LangChain: Long-Term Memory OWASP Agentic AI Top 10

How this dictionary is maintained

The source set is documented in docs/ai-terminology-research.md. The page favors terms that are useful across AI product, engineering, and safety conversations, with maturity labels to separate settled vocabulary from phrases still in motion.

Agents & Autonomy

agent patterns, delegation, and long-running work

Context & Memory

what the model sees, stores, and resumes from

Tools & Protocols

how models call software and connect to systems

Retrieval & Grounding

search, evidence, citations, and trusted context

Reasoning & Inference

how systems spend effort at answer time

Evaluation & Safety

testing, security, observability, and control

Models & Training

model families, customization, and compression

AI-Native Product

new product language around AI-powered workflows