
Agent Failures Don't Throw
An agent loops, a user gives up, a jailbreak slips past, and the trace still says 200 OK. The failures that matter to agents are semantic, and your logs can't see them. Reflexes are small classifiers that read every turn and label the ones that broke: frustration, jailbreaks, loops, policy violations. Eight ship out of the box, one API call, under 90ms. Train your own in an hour.






































