New Security Plugin for AI Agents

Adversa AI has launched SecureClaw, described as the first open-source, OWASP-aligned security plugin and skill for OpenClaw AI agents. The release is timed as OpenAI expands its investment in personal AI agents, highlighting a growing need for purpose-built security solutions in the space. SecureClaw aims to provide a security platform for this emerging technology.

- Adversa AI, founded in 2021, is an Israeli cybersecurity company that specializes in identifying vulnerabilities in AI systems through services like AI Red Teaming. The company's founder, Alex Polyakov, is also known for co-inventing the Server-Side Request Forgery (SSRF) web vulnerability class over a decade ago. - The plugin is aligned with the OWASP Top 10 for Large Language Models, a framework that identifies critical security risks in AI applications. Key risks addressed include prompt injection, where attackers manipulate inputs to trigger unintended actions, and excessive agency, where an AI performs actions beyond its intended scope. - OpenClaw is an open-source AI assistant that runs on a user's own hardware, connecting with messaging apps like Slack or WhatsApp to execute tasks. Unlike chatbots, it can access local files, run shell commands, and control a web browser, which creates new security challenges. - The core security risk with autonomous agents like OpenClaw is that they can be given broad access to a user's systems and data. If compromised through a technique like indirect prompt injection, an agent's legitimate access to APIs and databases can be hijacked by an attacker to exfiltrate data or execute malicious commands. - Because AI agents are designed to be autonomous, they can create new attack vectors not found in traditional software. These include AI tool poisoning, where an attacker compromises a plugin the agent relies on, and multi-step attacks where a chain of seemingly harmless operations becomes malicious. - Other open-source security tools for AI agents are also emerging, focusing on different parts of the security lifecycle. These include sandboxing tools like SandboxAI for isolating AI-generated code and red teaming frameworks like Microsoft's PyRIT for automating adversarial attacks.

New Security Plugin for AI Agents

Get your own daily briefing