Agents and Tool UseModern AI systems are no longer just “text generators”.
They are increasingly built as
agent systems that can plan, call tools, store state, and execute actions outside the model.
The core shift is:
From: text prediction
To: decision + action systems
---
1. Tool CallingTool calling allows a model to request external functions instead of generating text only.
Example tools:
- Search engines
- Calculators
- Databases
- Code execution
- APIs
Flow:
User query → model → tool request → tool output → model response
Example:
"What's the weather?"
→ call weather API
→ return structured result
→ format response
So the model becomes a controller, not just a responder.
---
2. External APIsAPIs extend the model’s capabilities beyond training data.
They provide:
- Real-time information
- Structured data access
- System integration
Examples:
- Flight booking systems
- Financial data feeds
- Web search APIs
Key idea:
Model = reasoning layer
API = real-world interface
---
3. Planning LoopsAgents can break tasks into steps and execute them iteratively.
Example loop:
1. Understand goal
2. Break into sub-tasks
3. Execute step
4. Observe result
5. Adjust plan
6. Repeat
This creates goal-directed behaviour instead of single-shot responses.
---
4. Chain-of-Thought OrchestrationChain-of-thought (CoT) is internal reasoning over multiple steps.
In agent systems, this becomes structured orchestration:
- Decomposition of problems
- Intermediate reasoning steps
- Decision checkpoints
Example:
Problem → sub-problems → intermediate outputs → final answer
Important distinction:
- CoT = internal reasoning structure
- Orchestration = system-level control flow around it
---
5. Autonomous AgentsAn autonomous agent is a system that can:
- Perceive input
- Plan actions
- Use tools
- Execute steps
- Iterate without constant human prompting
Architecture:
Goal → Planner → Tool use → Feedback loop → Goal completion
Examples:
- Coding assistants that write and run code
- Research agents that browse and summarise
- Task automation systems
---
6. Memory StoresAgents often require memory beyond the context window.
Memory systems include:
- Short-term working memory (context window)
- Long-term vector databases
- Summarised conversation logs
How it works:
New input → retrieve relevant memories → inject into context → reason
This allows continuity across sessions and tasks.
---
7. Execution EnvironmentsExecution environments allow agents to actually run code or actions.
Examples:
- Python interpreters
- Shell environments
- Containerised systems
- Sandboxed runtimes
This turns AI from:
"describing actions"
into:
"performing actions"
Key benefit:
- Verifiable outputs
- Real computation
- Tool-verified reasoning
---
Key InsightAgents are not a new model type — they are a system design layer built on top of LLMs.
Core structure:
LLM (reasoning engine)
+ Tools (external capabilities)
+ Memory (state persistence)
+ Planner (task decomposition)
+ Execution (real-world actions)
So instead of a static text generator:
You get a goal-directed system that can interact with the world