ai_agents_test
AI Agent Test is a module that allows you to test the AI Agents decision making process over and over again, using a known input context against any provider and model combination.
It is very easy to be tricked when you use the Agent Explorer on some random samples of data to think that you have created a production ready agent, but the real world is much more complex. If you develop an agent you have domain knowledge about how to instruct it correctly, and you inputs will be very biased with correct vocabularies and context as compared to the real world.
The AI Agent Test module allows you to create a test suite of actual prompt that testers/end-users did that can be run against any agent, so you can be sure that it will work in production.
Getting Started.
If you want to see the tests in action quickly and want to see a few more example tests you can try it out here: Drupal Forge AI Agent Test Demo
The Git repository linked to on that page has more Drupal CMS Agent Tests.
To read more about how it works, check the AI Documentation page about AI Agents Test