We rebuilt 3 real vulnerable apps in a lab and let our agent loose. It found all three CVEs.
We rebuilt Lunary, LibreChat, and Gradio at their vulnerable versions, told Vibehacker nothing about what to look for, and let it scan each target blackbox. It rediscovered all three CVEs — not by magic, but through trial and error and a self-improvement loop that uses LLMs to rewrite the swarm's knowledge between runs.
#case-study#ai#pentest#cve#owasp