promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

[view on github]last commit: May 24, 2026
stars
21,544
7d
+151
30d
+978
90d
+4,390
## star history
## found in