Module 15 Wrap-up: The Observability Suite
Hands-on: Create a CloudWatch dashboard that tracks your agent's success rate and token spend.
Module 15 Wrap-up: The Watchtower
You have reached "Production Maturity." You know that an AI system is only as good as your ability to Measure it. You have learned how to use Traces for logic debugging, CloudWatch for long-term audit logs, and Alarms to protect your company's credit card.
Hands-on Exercise: The Budget Shield
1. The Scenario
You are worried about a "Bad prompt" causing your agent to loop forever between 2 tools.
2. The Task
- Go to CloudWatch Metrics.
- Find the
Invocationsmetric for your agent. - Create an Anomaly Detection alarm.
- Configure it to email you if the number of invocations per hour is "Outside the normal range" (Suddenly spikes).
- This is the most effective way to catch "Runaway Agents" before they cost thousands.
Module 15 Summary
- Tracing: Essential for step-by-step logic debugging.
- CloudWatch Logs: The permanent source of truth for all AI inputs/outputs.
- Token Metrics: The primary driver of your AWS Bedrock bill.
- Alarms: Automated gatekeepers that stop cost spikes.
- Anomaly Detection: The modern way to monitor unpredictable AI behavior.
Coming Up Next...
In Module 16, we return to Security and Governance. We will look at deep-layer security: protecting your secrets, preventing Prompt Injection at the API level, and ensuring your AI complies with enterprise standards.
Module 15 Checklist
- I can describe the 3 stages of a Bedrock Agent Trace.
- I have viewed my AI logs in CloudWatch.
- I have set a CloudWatch Alarm for token usage.
- I understand the difference between a
ThrottlingExceptionand aValidationExceptionin the logs. - I can find the total
InputTokenCountfor a specific agent request.