Course Chapter 06

Scaling

Take agentic systems from prototype to production — managing cost, reliability, and complexity as usage grows.

From working demo to production workload

Chapter framing

A prototype that works for 10 users is not a system that works for 10,000. This chapter covers what changes when you scale agentic applications — from cost management and performance optimization to coordinating multiple agents on complex tasks.

You'll learn where the bottlenecks actually are (hint: it's rarely the model), how to keep costs from spiraling, and how to design multi-agent systems that cooperate instead of collide.