02
Docs

Core Concepts: Shared Vocabulary

Definitions, the agent spectrum, context windows, and key terminology

What is an Agent?

Before diving into tools, the team needs shared language.

Working Definition:

Definition

An agent is an AI system that can autonomously take actions to achieve a goal, using tools, making decisions, and iterating based on feedback — without requiring step-by-step human instruction.

Key Characteristics:

CharacteristicDescriptionExample
AutonomyDecides what to do next without explicit instructionClaude Code choosing which files to edit
Tool UseInvokes external tools/APIs to accomplish tasksRunning terminal commands, querying databases
Goal-OrientedWorks toward a defined objective, not just responding“Build a login page” vs. “What is React?”
IterationEvaluates results and tries again if neededFixing errors after running tests
Context AwarenessUnderstands environment and adapts behaviorReading codebase before making changes

What an Agent is NOT:

The Agent Spectrum

Where most developers are vs. where we want them to be:

Less Agentic ─────────────────────────────────────────────────▶ More Agentic Autocomplete → Chat Assistant → Copilot → Agent Mode → Autonomous Agent (Tab) (Q&A only) (Inline) (Multi-step) (Background) ▲ ▲ ▲ │ │ │ Most devs Some devs Workshop start here are here goal
LevelDescriptionExample Tools
AutocompletePredicts next characters/linesBasic IDE completion
Chat AssistantAnswers questions, explains codeChatGPT, basic Claude
CopilotSuggests code inline, you accept/rejectGitHub Copilot inline
Agent ModeMulti-step tasks, uses tools, iteratesClaude Code, Cursor Composer
Autonomous AgentWorks in background, minimal supervisionClaude Code background, Codex async

Context Windows by Model

Understanding context limits helps you choose the right model and manage long sessions:

ModelContext WindowApprox. Lines of CodeBest For
Claude Sonnet 4200K tokens~150,000 linesDaily coding tasks, fast responses
Claude Opus 4200K tokens~150,000 linesComplex reasoning, architecture
GPT-4o128K tokens~96,000 linesGeneral tasks, multimodal
GPT-4 Turbo128K tokens~96,000 linesLong documents, analysis
Gemini 2.0 Pro2M tokens~1.5M linesEntire codebases, massive context

Practical implications:

Token estimation rules of thumb:

Key Terminology

Skills

Definition: Discrete capabilities an agent can perform, often tied to specific tools.

SkillWhat It DoesEnabled By
File editingRead, write, modify code filesBuilt-in
Terminal executionRun shell commandsBuilt-in
Web browsingFetch and parse web contentPlaywright MCP
Issue managementCreate/read/update ticketsJira MCP, GitHub MCP
Database queriesRead/write to databasesPostgreSQL MCP
Design extractionRead Figma filesFigma MCP

Tools vs. Skills

MCP (Model Context Protocol)

Analogy: “USB for AI” — a standard way to connect AI models to external tools.

AspectDescription
What it isProtocol for exposing tools to AI agents
Why it mattersStandardized connections = portable workflows
Key benefitSame integration works across Claude, Cursor, etc.

The Agentic Loop

How agents work internally:

┌─────────────────────────────────────────────────────┐ │ AGENTIC LOOP │ │ │ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ │ │ │ Observe │ → │ Think │ → │ Plan │ │ │ └─────────┘ └─────────┘ └─────────┘ │ │ ▲ │ │ │ │ ▼ │ │ ┌─────────┐ ┌─────────┐ │ │ │Evaluate │ ←──────────────── │ Act │ │ │ └─────────┘ └─────────┘ │ │ │ └─────────────────────────────────────────────────────┘
  1. Observe: Read the current state (files, errors, output)
  2. Think: Reason about what’s needed
  3. Plan: Decide on approach
  4. Act: Execute using tools
  5. Evaluate: Check results
  6. Repeat: Continue until goal is met

Guardrails

Constraints that limit what an agent can do:

TypeDescriptionExample
Permission-basedRequires approval for certain actions“Ask before deleting files”
Approval-basedHuman reviews before executionPlan mode in Claude Code
Rule-basedHard limits on behavior“Never modify production configs”