Every time you review an agent you'll find things. The trick is not to fix at random: it's to sort each finding into one of three buckets and tackle them with the logic each one calls for. That turns QA from firefighting into a system.
Behaviors that are wrong. The agent isn't doing what it should. High priority: this loses money today.
"My agent sends the payment link too early. Fix it without touching the rest and test it first."
The agent already works well. Now the game is raising conversion: getting more bookings and more sales from the same leads.
"At which funnel stage do I lose the most people, and what change would you try to improve conversion this week?"
The agent performs and is measured. Nothing's broken or urgent to optimize: it's ready for more. Here you don't tinker, you multiply.
"This agent performs. Build me a second one for another channel reusing its voice, and leave it measured from day 1."