
Early Exits
Taking the shortcut.
Early Exits
Most user queries are simple. "Hi", "Help", "What is your refund policy?"
These do not need a 5-step agentic reasoning loop. Design your graph with an "Express Lane".
- Router: Checks "Is this a canned response?"
- Yes: Returns string from DB -> END.
- No: Passes to Agent -> ...
This keeps average latency low while preserving power for complex queries.