Hackers News Hackers News
  • CyberSecurity News
  • Threats
  • Attacks
  • Vulnerabilities
  • Breaches
  • Comparisons

Social Media

Hackers News Hackers News
  • CyberSecurity News
  • Threats
  • Attacks
  • Vulnerabilities
  • Breaches
  • Comparisons
Search the Site
Popular Searches:
technology Amazon AI
Recent Posts
152 Chrome Extensions Maliciously Hide Ad Tracking
June 14, 2026
Maine AG Takes Data Breach Portal Offline After Fake
June 14, 2026
Agentjacking Attack Hijacks AI Coding Agent for Mal
June 13, 2026
Home/Threats/OpenClaw AI Agent Leaks Credentials in Sensitive Phishing
Threats

OpenClaw AI Agent Leaks Credentials in Sensitive Phishing

Artificial intelligence agents are increasingly integral to corporate operations, handling tasks such as email triage, file retrieval, and drafting replies for employees. However, new research...

Sarah simpson
Sarah simpson
June 10, 2026 4 Min Read
15 0

Artificial intelligence agents are increasingly integral to corporate operations, handling tasks such as email triage, file retrieval, and drafting replies for employees. However, new research confirms these agents are susceptible to social engineering, often more so than their human counterparts. This vulnerability is detailed in a recent report: A new phishing simulation has shown that an AI agent called OpenClaw can be manipulated into leaking sensitive credentials with a single convincing email.

In controlled tests, the agent forwarded AWS IAM keys, database passwords, and SSH access to an external Gmail address, raising immediate concerns about how AI agents handle trust and identity.

Researchers from Varonis Threat Labs designed the experiment to test whether phishing techniques that have long targeted humans would also work on AI agents.

They put an OpenClaw agent named Pinchy through four phishing simulations under two profiles: a general productivity setup and a stricter security-aware one.

Varonis said in a report shared with Cyber Security News (CSN) that the results were alarming. The lab setup mirrored a real enterprise inbox, seeded with mock AWS credentials, CRM exports, internal conversations, and calendar invites.

The goal was to see how the agent responded when faced with requests that looked entirely routine. What the researchers found was that OpenClaw struggled most with social manipulation, not technical deception.

It could identify fake login pages and suspicious OAuth prompts, yet a casually written email from a fake colleague was enough to bypass its defenses entirely.

OpenClaw AI Agent Leaks Sensitive Credentials

In the first and most serious test, a fake email arrived from an attacker impersonating a team lead named Dan.

The message claimed there was a production emergency and asked the agent to share staging environment credentials. The email came from an external Gmail account, not a verified corporate address.

The agent searched the mailbox, found the credentials, and forwarded them in plain text. The reply included AWS IAM access keys, database connection strings, and SSH details with internal host information.

OpenClaw lab architecture used in the test deployment (Source - Varonis)
OpenClaw lab architecture used in the test deployment (Source – Varonis)

This occurred even under the Strict profile, which explicitly told the agent to verify sender identities before acting on sensitive requests.

The agent’s own reasoning trace acknowledged the mistake afterward. It understood the policy had existed and that it had violated it. In the moment, the urgency of the simulated emergency had simply overridden the verification step.

A second test took a softer approach. An attacker sent a casually worded message asking for the latest customer export, claiming to be working remotely on a presentation.

The agent complied without any verification, forwarding a dataset with 247 enterprise customers and roughly $1.28 million in monthly recurring revenue.

Agent Phishing vs Technical Defenses

Not every test ended in failure. When researchers introduced a fake gift card redemption link and a malicious OAuth consent screen, the agent showed much stronger judgment.

It inspected redirect URLs, flagged suspicious destinations, and halted the OAuth flow before any consent was granted.

That contrast highlights where AI agents are strong and where they fall short. Technical phishing, including fake login pages and malicious links, was handled reliably. Social phishing, where a request simply sounds like it came from a trusted colleague, was handled poorly.

Forwarded credentials (left) and the agent's reasoning trace afterwards (right) (Source - Varonis)
Forwarded credentials (left) and the agent’s reasoning trace afterwards (right) (Source – Varonis)

The researchers noted a difference between the two AI models tested. GPT-5.4 maintained a stricter posture around sharing sensitive data, while Gemini 3.1 Pro was more willing to interact with suspicious content before raising concern. Both models remained equally vulnerable to social-context manipulation.

To close these gaps, researchers recommended treating the agent configuration file as a formal security control rather than a basic setup document.

They also advised blocking agents from sending outbound emails to unknown addresses and requiring human approval for any action involving credentials or external routing. Limiting an agent’s data access based on where a request originates adds a meaningful layer of defense.

The findings make one thing clear: AI agents behave like a new employee with full system access but no organizational instinct. That is exactly what makes them useful, and exactly what makes them a target.

Disclaimer: HackersRadar reports on cybersecurity threats and incidents for informational and awareness purposes only. We do not engage in hacking activities, data exfiltration, or the hosting or distribution of stolen or leaked information. All content is based on publicly available sources.

Tags:

AttackphishingSecurityThreat

Share Article

Sarah simpson

Sarah simpson

Sarah is a cybersecurity journalist specializing in threat intelligence and malware analysis. With over 8 years of experience covering APT groups, zero-day exploits, and advanced persistent threats, Sarah brings deep technical expertise to breaking cybersecurity news. Previously, she worked as a security researcher at leading threat intelligence firms, where she analyzed malware samples and tracked cybercriminal operations. Sarah holds a Master's degree in Computer Science with a focus on cybersecurity and is a regular contributor to major security conferences.

Previous Post

Windows Collaborative Translation Framework 0-Day Vulnerability

Next Post

Hackers Abuse Fake Utility Downloads for ScreenConnect & Crypto Mining

No Comment! Be the first one.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts
Government Directive Blocks Anthropic Fable 5 & Mythos Access
June 13, 2026
Fancy Bear Abuses EdgeRouters & Cloud for Stealthy
June 12, 2026
Hackers Abuse NinjaOne RMM to Bypass Malware Legitimate Software
June 12, 2026
Top Authors
Marcus Rodriguez
Marcus Rodriguez
Jennifer sherman
Jennifer sherman
Emy Elsamnoudy
Emy Elsamnoudy
Let's Connect
156k
2.25m
285k

Related Posts

Jennifer sherman
By Jennifer sherman
Threats

GlassWorm Attacks macOS via Malicious VS Code…

January 1, 2026
Emy Elsamnoudy
By Emy Elsamnoudy
Attacks

ClickFix Attack Hides Malicious Code via Stegan Security

January 1, 2026
Sarah simpson
By Sarah simpson
Vulnerabilities

MongoBleed Detector Tool Detects Critical MongoDB CVE-

January 1, 2026
Emy Elsamnoudy
By Emy Elsamnoudy
Breaches

Conti Ransomware Gang Leaders & Infrastructure Exposed

January 1, 2026
Hackers News Hackers News
  • [email protected]

Quick Links

  • Contact Us
  • Privacy Policy
  • Terms of service

Categories

Attacks
Breaches
Comparisons
CyberSecurity News
Threats
Vulnerabilities

Let's keep in touch

receive fresh updates and breaking cyber news every day and week!

All Rights Reserved by HackersRadar ©2026

Follow Us