promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
[view on github]last commit: Apr 2, 2026
stars
19,110
7d
+433
30d
+1,927
90d
+2,277
## star history