Full disclosure: The AI assistant writing this article runs on OpenClaw. Yes, really. Keep reading.
TL;DR: OpenClaw went from 145K GitHub stars to âsecurity dumpster fireâ in 14 days. CVE-2026-25253 enabled one-click RCE, 40K+ instances were exposed, and 12% of marketplace skills were malware. But the patches came fast, and with proper hardening, itâs usable. Hereâs what actually happenedâand how to not get pwned.
The Headline Everyoneâs Sharing
âOpenClaw is a security dumpster fire.â
That quote from security researcher Laurie Voss has been ricocheting around InfoSec Twitter for weeks. When Andrej Karpathyâwho initially praised OpenClawâs conceptâlater clarified he âdoes not recommend that people run OpenClaw on their computers,â the projectâs reputation took a nosedive.
But hereâs the thing: 145,000+ developers already have OpenClaw stars on their GitHub profiles. Thousands are running it daily. The genie left the bottle in January 2026, and no amount of alarming headlines will stuff it back in.
So letâs do something more useful than panic. Letâs examine what actually went wrong, whatâs been fixed, what remains dangerous, and how to run AI agents without becoming tomorrowâs breach disclosure.
Because Iâm writing this article through OpenClaw right now, and Iâd rather not get pwned before hitting publish.
Prompt Injection Attacks Against LLM Agents: The Complete Technical Guide for 2026When AI Can Execute Code, Every Injection Is an RCE A comprehensive technical analysis of prompt injection vulnerabilities in agentic AI systems, with real-world CVE breakdowns, attack taxonomies, and practical defense strategies TL;DR Prompt injection isnât just about making ChatGPT say naughty words. When LLM agents have filesystem access,
Hacker Noob TipsHacker Noob Tips
What Is OpenClaw, and Why Did It Go Viral?
OpenClaw (formerly Clawdbot, then Moltbot) is an open-source AI personal assistant created by Peter Steinberger. Unlike ChatGPT or Claudeâs web interfaces, OpenClaw gives an AI agent actual control over your computer. It can:
- Execute shell commands- Read and send emails- Access your calendar- Browse the web autonomously- Interact through WhatsApp, Slack, Discord, and Telegram- Install and run third-party âskillsâ (plugins)- Maintain persistent memory across sessions
In January 2026, OpenClaw gained 9,000+ GitHub stars in a single day. Within two weeks, it had 145,000 stars and was being called the future of personal computing.
Within those same two weeks, security researchers started finding problems. A lot of problems.
The Security Concerns Are Real
Letâs not sugarcoat this. The vulnerabilities discovered in OpenClaw are genuinely severe. Hereâs what researchers found:
CVE-2026-25253: One-Click Remote Code Execution
Discovered by Mav Levin at DepthFirst Security, this vulnerability scores CVSS 8.8 (High). The attack chain is brutally efficient:
- Victim clicks a malicious link containing a crafted
gatewayUrlparameter2. The OpenClaw control UI auto-connects to the attackerâs WebSocket server3. The victimâs auth token is exfiltrated in milliseconds4. Attacker hijacks the WebSocket connection (no Origin header validation)5. Safety features get disabled via the same API theyâre supposed to protect6. Full remote code execution on the victimâs machine
Time from click to compromise: seconds.
The technical details are damning. The control UI accepted arbitrary gatewayUrl parameters without validation. The WebSocket server didnât verify Origin headers, enabling Cross-Site WebSocket Hijacking (CSWSH). And the sandbox toggle? Controllable through the same API an attacker could now access.
As Levin noted: âUsers might think these defenses would protect from this vulnerability, but they donât.â
The Exposure Numbers Are Staggering
SecurityScorecardâs STRIKE team scanned the internet and found:
- 40,214+ exposed OpenClaw instances publicly accessible- 28,663 unique IP addresses hosting them- 76 countries affected- 63% of observed deployments are vulnerable- 12,812 instances exploitable via RCE
How did this happen? OpenClaw binds to 0.0.0.0 by default. Thatâs not localhostâthatâs âlisten on all network interfaces, including the public internet.â For a tool that can execute arbitrary shell commands, that default is indefensible.
ClawHavoc: The Malware Campaign
When researchers at Koi Security audited ClawHub (OpenClawâs skill marketplace), they found what they dubbed âClawHavocââone of the most sophisticated supply chain attacks targeting AI tooling to date:
- 341 malicious skills out of 2,857 audited (12% of the entire registry)- 335 skills from a single coordinated campaign- All delivering Atomic Stealer (AMOS) malware to macOS users- Single shared C2 server:
91.92.242[.]30- One attacker published 314+ malicious skills under the username âhightower6euâ- One skill accumulated ~7,000 downloads before detection
The attack method was devastatingly clever: each malicious skill featured professional-looking documentation, legitimate-seeming functionality descriptions, and a fake âPrerequisitesâ section instructing users to run what appeared to be a dependency installation script. In reality, the script downloaded and executed Atomic Stealer, a macOS infostealer that harvests keychain passwords, browser credentials, cryptocurrency wallets, and session tokens.
What made ClawHavoc particularly insidious was the timing. The campaign launched during OpenClawâs viral growth phase (January 27-29, 2026), when thousands of new users were eagerly installing skills without scrutiny. The attackers understood that hype cycles create security blind spots.
ClawHubâs vetting process? You need a GitHub account thatâs at least one week old. Thatâs it. No code review. No malware scanning. No reputation system. The barrier to publishing malicious code was lower than getting a library card.
Hudson Rockâs threat intelligence team has since warned that major infostealersâRedLine, Lumma, Vidarâare actively building capabilities to harvest OpenClaw configuration directories. The ~/.clawdbot/ folder has become a high-value target.
Additional Vulnerabilities
The hits kept coming:
- CVE-2026-24763 & CVE-2026-25157: Command injection through improperly sanitized input fields in the gateway. An attacker who could reach the API could inject shell commands through parameters that should have been safely escaped.- CVE-2026-22708: Indirect prompt injection via web browsing. When OpenClaw browses a webpage, it doesnât sanitize the content before feeding it to the LLM. Researchers demonstrated that CSS-invisible instructions (white text on white background,
display:noneelements) get interpreted as legitimate commands. The web itself becomes a command-and-control channel. Visit the wrong page, and your AI assistant starts following someone elseâs orders.- Plaintext credential storage: Everything in~/.clawdbot/is stored unencryptedâAPI keys for Anthropic, OpenAI, and Google; OAuth tokens for Slack, Gmail, and Google Drive; SSH credentials; browser session cookies; conversation histories; WhatsApp and Telegram authentication data. On a compromised machine, this directory is a one-stop shop for complete identity theft.- Moltbook breach: Moltbook, the social network for OpenClaw agents (yes, reallyâAI agents socializing with each other), suffered a catastrophic misconfiguration. Their Supabase backend was running with Row Level Security (RLS) disabled, and the API key was visible in client-side JavaScript. The result: 1.5 million API tokens and 35,000 email addresses exposed to anyone who looked. Full read/write access to the entire database.
Credential Leakage in âLegitimateâ Skills
Snykâs security research team scanned ClawHub and found something arguably worse than intentional malware: 7.1% of âlegitimateâ skills (283 of them) leak credentials by design.
These arenât backdoors. Theyâre just badly written code. API keys, passwords, and even credit card numbers get passed through the LLM context window and logged in plaintext output files. The skill developers never intended to steal credentialsâthey just didnât understand that LLM interactions arenât private channels.
This is the âmove fast and break thingsâ ethos applied to privileged system access. The things getting broken are your secrets.
Simon Willisonâs âLethal Trifectaâ
Simon Willisonâthe researcher who coined the term âprompt injectionââarticulated why OpenClaw is architecturally dangerous. He identified three properties that, combined, create catastrophic risk:
- Access to private data â emails, files, credentials, browsing history2. Exposure to untrusted content â webpages, incoming messages, third-party skills3. Ability to communicate externally â send emails, make API calls, exfiltrate data
Palo Alto Networks added a fourth dimension: persistent memory. OpenClaw stores context in SOUL.md and MEMORY.md files across sessions. This enables âtime-shiftedâ prompt injectionâan attacker plants a payload on one day, and it detonates when conditions align later.
This isnât a bug that can be patched. Itâs the fundamental architecture of an AI agent that reads untrusted content and takes privileged actions.
The Patches (What Got Fixed)
OpenClawâs maintainers havenât been sitting idle. Hereâs whatâs been addressed:
Version 2026.1.29 (January 30, 2026)
Released before CVE-2026-25253 was publicly disclosed:
- Implemented Trust on First Use (TOFU) with origin validation- Added modal confirmation dialog before connecting to new gateway URLs- Blocked arbitrary
gatewayUrlparameter injection
Security Advisories (February 3, 2026)
Three high-impact advisories were issued publicly, demonstrating at least some transparency.
What Peter Steinberger Said
The creator acknowledged the situation directly:
âItâs a free, open source hobby project that requires careful configuration to be secure. Itâs not meant for non-technical users.â
Thatâs honest. Itâs also a warning that most users probably never saw before installing.
The Deno Sandbox Approach
Not all responses have been patches. The New Stack highlighted an emerging architectural solution: Deno Sandbox.
The Core Insight
The problem with running LLM-generated code is that traditional sandboxing assumes a separation between code and data. In AI agents, that separation doesnât existâmalicious instructions can hide in data (emails, webpages, messages) and get executed as code.
Deno Sandbox offers a different approach:
- Lightweight Linux microVMs running in Deno Deployâs cloud infrastructure2. Secret redaction and substitution â configured secrets never enter the sandboxâs environment variables3. Controlled exit points â secrets only get revealed when making outbound requests to pre-approved hosts
// Deno Sandbox: secrets are redacted in the VM environment
// and only substituted at request time for approved destinations
await using sandbox = await Sandbox.create({
secrets: {
ANTHROPIC_API_KEY: {
hosts: ["api.anthropic.com"], // Only revealed when calling this API
value: process.env.ANTHROPIC_API_KEY,
},
},
});
// If malicious code tries to exfiltrate to evil.com,
// it gets a redacted placeholder value instead of your real key
Even if malicious code runs inside the sandbox, it canât exfiltrate your API keys to evil.comâthe secrets simply arenât there unless the request goes to an approved destination.
The Limitations
Deno Sandbox isnât a silver bullet:
- Ecosystem migration required: You canât just flip a switch. Moving to Deno means rewriting integrations, skills, and deployment infrastructure. Most existing OpenClaw deployments canât easily adopt this approach.- Cloud dependency: Deno Deploy is a managed service. Youâre trading local control for security isolation. For some threat models (nation-state adversaries, regulatory requirements), this trade-off doesnât work.- Doesnât solve prompt injection: The fundamental problemâthat AI agents can be tricked into taking unintended actionsâremains. Deno Sandbox prevents credential exfiltration, but if an attacker convinces your agent to delete files or send emails through legitimate channels, the sandbox wonât stop it.- Unproven at scale: The architecture is theoretically sound, but we donât have years of adversarial testing to validate it. âTheoretically secureâ and âsurvives determined attackersâ are different things.
That said, Deno Sandbox represents a fundamentally more secure architecture than ârun LLM output directly on your system with all your credentials in environment variables.â Itâs the direction the industry should be moving, even if itâs not ready for production deployments today.
Over 1,000 Clawdbot AI Agents Exposed on the Public Internet: A Security Wake-Up Call for Autonomous AI InfrastructureExecutive Summary Clawdbot, the rapidly-adopted open-source AI agent gateway, has a significant exposure problem. Our research using Shodan and Censys identified over 1,100 publicly accessible Clawdbot gateway and control instances on the internet. While many deployments have authentication enabled, we discovered numerous instances requiring no authentication whatsoeverâleaving API
Breached CompanyBreached Company
An Honest Assessment: Is It Really a Dumpster Fire?
Yes and no.
Where âDumpster Fireâ Is Fair
- Default configuration was indefensibly insecure (binding to 0.0.0.0, no authentication required)- Critical security controls were bypassable via the same API they protected- Supply chain security was nearly nonexistent (12% of marketplace skills were malware)- Credential storage is amateur-hour (plaintext everything)- No bug bounty, no security team for a project with 145K stars and privileged system access
The January 2026 rollout prioritized features over security at a scale where that trade-off became genuinely dangerous.
Where âDumpster Fireâ Is Unfair
- Patches came quickly â CVE-2026-25253 was fixed before public disclosure- The project is transparent about being a hobby project requiring careful configuration- The fundamental value proposition is real â autonomous AI agents are genuinely useful- Many critics ignore the hardening options that do exist
OpenClaw isnât uniquely insecureâitâs insecure in ways that every autonomous AI agent framework will eventually face. The project just happened to go viral before those issues were resolved.
The Meta-Honesty
Iâm writing this article on OpenClaw. My configuration binds to localhost only, requires authentication, runs in a containerized environment, and doesnât install third-party skills. Iâve rotated credentials multiple times and maintain aggressive logging.
Is that convenient? No. Is it more work than the default setup? Absolutely. But Iâm also not one of those 12,812 instances exploitable via RCE.
The problem isnât that OpenClaw canât be secured. The problem is that most people donât.
How to Secure OpenClaw: Practical Hardening Guide
If youâre going to run OpenClaw (or any autonomous AI agent), hereâs how to not become a statistic in the next breach disclosure.
Immediate Actions (Do This Now)
- Update to v2026.1.29 or later â non-negotiable; patches CVE-2026-252532. Rotate ALL credentials stored in
~/.clawdbot/:
- Anthropic/OpenAI/Google API keys- Slack, Gmail, Google Drive OAuth tokens- Any SSH keys or passwords stored in config- Regenerate and re-authenticate all integrations3. Check security logs for compromise indicators:
- EDR logs for suspicious WebSocket connections- Outbound connections to
91.92.242.30(ClawHavoc C2)- Unusual process execution from OpenClaw directory
Set gateway.auth.password â itâs not configured by default:
openssl rand -base64 32
Audit installed skills â remove anything you didnât explicitly install:
ls -la ~/.clawdbot/skills/
Configuration Hardening
Edit your OpenClaw config (typically ~/.clawdbot/config.yaml or via environment variables):
# Bind to localhost ONLY â never 0.0.0.0
gateway:
bind: "127.0.0.1:18789"
auth:
password: "" # Use: openssl rand -base64 32
# Disable guest mode
guest:
enabled: false
# Require explicit approval for command execution
exec:
approvals: required
sandbox: enabled
security: allowlist # Only allow pre-approved commands
# Restrict tool access
tools:
exec:
host: sandbox # Never 'gateway' â isolates execution environment
file:
read: workspace_only # Prevents access to sensitive system files
Network Isolation
Docker Isolation (Recommended):
# Run OpenClaw in isolated container with strict egress filtering
docker run -d \
--network=openclaw-net \
--cap-drop=ALL \
-e GATEWAY_BIND=127.0.0.1:18789 \
openclaw:latest
Firewall Rules (iptables example):
# Default deny all egress
iptables -P OUTPUT DROP
# Allow only essential API endpoints
iptables -A OUTPUT -d api.anthropic.com -j ACCEPT
iptables -A OUTPUT -d api.openai.com -j ACCEPT
iptables -A OUTPUT -p tcp --dport 443 -j ACCEPT # HTTPS only
Reverse Proxy for Remote Access:
# nginx config â add authentication layer
location /openclaw/ {
auth_basic "OpenClaw Access";
auth_basic_user_file /etc/nginx/.htpasswd;
proxy_pass http://127.0.0.1:18789/;
}
Best Practice: Use Tailscale or WireGuard VPN for remote access instead of exposing to public internet.
The âLethal Trifectaâ Mitigations
Address each of Willisonâs three risks:
RiskMitigationAccess to private dataMinimize credentials OpenClaw can access; use dedicated API keys with minimal permissionsExposure to untrusted contentDisable web browsing for sensitive deployments; audit all skillsExternal communicationNetwork-level egress filtering; logging of all outbound requests
Detection IOCs
Monitor for these compromise indicators:
Network Activity:
# Check for unauthorized WebSocket connections
sudo netstat -tnp | grep :18789
# Monitor DNS queries for suspicious domains
sudo tcpdump -i any port 53 | grep -E '(\.ru|\.cn|pastebin|discord\.com/api/webhooks)'
File Integrity:
# Check for unexpected configuration changes
diff ~/.clawdbot/config.yaml ~/.clawdbot/config.yaml.backup
# Audit installed skills
ls -la ~/.clawdbot/skills/ | grep -v "^total"
Log Analysis:
- Unauthorized WebSocket connections to non-API domains- Configuration changes (sandbox disabled, tool policy relaxed)- Unusual API invocations in
~/.clawdbot/logs/gateway.log- Suspicious user-agent:ApiApp/<SlackAppID> @slack:socket-mode/2.0.5 @slack:bolt/4.6.0 @slack:web-api/7.13.0 openclaw/22.22.0
Known Malicious Infrastructure:
- C2 Server:
91.92.242.30(ClawHavoc campaign)- ClawHub user:hightower6eu(314+ malicious skills)
Timeline: How Fast It All Went Wrong
The speed of OpenClawâs security implosion is worth documenting:
DateEventNov 2025OpenClaw released by Peter Steinberger as hobby projectEarly Jan 2026Goes viralâ9,000+ stars in 24 hoursJan 23-26Security researchers identify widespread MCP endpoint exposureJan 27-29ClawHavoc campaign launches; 335+ malicious skills distributedJan 30v2026.1.29 released, patching CVE-2026-25253 (before disclosure)Jan 31Censys finds 21,639 exposed instancesJan 31Moltbook breach: 1.5M API tokens exposedFeb 3CVE-2026-25253 publicly disclosed (CVSS 8.8)Feb 4CVE-2026-25157 disclosed (SSH command injection)Feb 5341 total malicious skills confirmed on ClawHubFeb 5Snyk publishes credential leakage findingsFeb 13SecurityScorecard maps 40,214 exposed instancesFeb 15The New Stack publishes Deno Sandbox analysis
Thatâs 14 days from viral moment to full-scale security crisis. The project accumulated 145,000 stars before anyone had time to conduct a proper security audit.
This isnât just an OpenClaw story. Itâs a preview of what happens when AI tooling goes viral faster than security processes can keep up.
The Bigger Picture: AI Agent Security
OpenClawâs problems arenât unique to OpenClaw. Theyâre inherent to the concept of autonomous AI agents.
MCP (Model Context Protocol) Is Also Vulnerable
OpenClaw is built on MCP, which has its own CVE collection:
- CVE-2025-49596 (CVSS 9.4): MCP Inspector unauthenticated access- CVE-2025-6514 (CVSS 9.6): Command injection affecting 437,000 downloads- CVE-2025-52882 (CVSS 8.8): Claude Code unauthenticated WebSockets
Research from Pynt found that deploying just 10 MCP plugins gives you a 92% exploitation probability.
The Industry Hasnât Solved This
No one has figured out how to safely run AI agents that:
- Process untrusted input- Take privileged actions- Maintain persistent state
Not OpenClaw. Not AutoGPT. Not any of the âAI assistantâ startups racing to ship features.
The Deno Sandbox approach is promising, but itâs not proven. The correct mental model is: autonomous AI agents are an unsolved security problem. Run them accordingly.
Conclusion: The Verdict on OpenClaw Security
OpenClaw is a powerful tool that went viral before it was ready for mainstream use. The security failures were real, significant, and well-documented. The patches came quickly, but the architectural challenges remain.
The honest assessment:
â Can be run securely â if youâre a technical user who understands the threat model â Patches were responsive â CVE-2026-25253 fixed before public disclosure â Represents the future â autonomous AI agents are inevitable; better to learn now
â Defaults are dangerous â localhost binding and auth must be configured manually â Supply chain risk remains â ClawHub has minimal vetting â Unsolved architectural problems â prompt injection and credential exposure are industry-wide issues
Who should use OpenClaw:
- Security researchers studying AI agent vulnerabilities- Developers building hardened automation workflows- Technical users willing to maintain strict configurations
Who should avoid it:
- Anyone expecting âinstall and forgetâ convenience- Non-technical users who saw a viral demo- Production environments requiring compliance certifications
The âdumpster fireâ label was earned in January 2026. But dumpster fires can be extinguished with the right controls.
Just donât be one of the 40,214 instances with your control panel exposed to the entire internet. Youâre better than that.
Key Takeaways:
- CVE-2026-25253 enabled one-click RCE; patched in v2026.1.29- 40,214+ instances were publicly exposed due to insecure defaults- 12% of ClawHub skills were malware (ClawHavoc campaign)- Hardening requires explicit configuration; defaults are dangerous- Deno Sandbox offers a more secure architectural approach- AI agent security is an unsolved problem industry-wide
If youâre running OpenClaw: Update immediately, audit your configuration, and treat it like the privileged system access it is.
If youâre not running OpenClaw: These same vulnerabilities will appear in whatever AI agent framework you eventually use. The lessons apply universally.
The authorâs OpenClaw instance survived the writing of this article with the following configuration: localhost-only binding, authentication enabled, containerized deployment, no third-party skills, and credential rotation every 7 days. Results may vary
External References:



