Hackers News Hackers News
  • CyberSecurity News
  • Threats
  • Attacks
  • Vulnerabilities
  • Breaches
  • Comparisons

Social Media

Hackers News Hackers News
  • CyberSecurity News
  • Threats
  • Attacks
  • Vulnerabilities
  • Breaches
  • Comparisons
Search the Site
Popular Searches:
technology Amazon AI
Recent Posts
Trellix Source Code Breach: Hackers Access Repository
May 2, 2026
Hackers Exploit cPanel Flaw to Breach Government Military
May 2, 2026
Exim Mail Server Vulnerabilities Lead to Crash via DNS Data
May 2, 2026
Home/CyberSecurity News/OpenAI Launches AI Safety Bug Bounty for AI Vulnerabilities
CyberSecurity News

OpenAI Launches AI Safety Bug Bounty for AI Vulnerabilities

OpenAI has launched a public Safety Bug Bounty program specifically designed to identify AI abuse and safety risks across its product offerings. Hosted on Bugcrowd, the new initiative marks a...

Jennifer sherman
Jennifer sherman
March 26, 2026 2 Min Read
0 0

OpenAI has launched a public Safety Bug Bounty program specifically designed to identify AI abuse and safety risks across its product offerings.

Hosted on Bugcrowd, the new initiative marks a significant step in the company’s efforts to address vulnerabilities that fall outside the scope of traditional security flaws but still pose real-world harm potential.

The Safety Bug Bounty program is designed to complement OpenAI’s existing Security Bug Bounty program by accepting submissions that carry meaningful abuse and safety risks even when those issues don’t qualify as conventional security vulnerabilities.

Submissions will be triaged jointly by OpenAI’s Safety and Security Bug Bounty teams and may be rerouted between the two programs depending on scope and ownership.

AI-Specific Risk Categories in Focus

The program targets several distinct categories of AI-specific safety scenarios:

Agentic Risks Including MCP — This covers third-party prompt injection and data exfiltration scenarios where attacker-controlled text can reliably hijack a victim’s AI agent, including Browser, ChatGPT Agent, and similar agentic products, to perform harmful actions or leak sensitive user data.

To qualify, the behavior must be reproducible at least 50% of the time. Reports involving agentic products performing disallowed or potentially harmful actions at scale are also in scope.

OpenAI Proprietary Information — Researchers can report model generations that inadvertently expose reasoning-related proprietary information, as well as vulnerabilities that leak other confidential OpenAI data.

Account and Platform Integrity — This category targets weaknesses in account and platform integrity signals, including bypassing anti-automation controls, manipulating account trust signals, and evading account restrictions, suspensions, or bans.

OpenAI has been explicit about what is out of scope: generic jailbreaks that result in rude language or surface publicly available information will not be considered.

General content-policy bypasses without demonstrable safety or abuse impact are also excluded. However, OpenAI periodically runs private bug bounty campaigns targeting specific harm types, such as Biorisk content issues in ChatGPT Agent and GPT-5, and invites researchers to apply when those programs become available.

For vulnerabilities enabling unauthorized access to features, data, or functionality beyond permitted permissions, researchers are directed to the existing Security Bug Bounty program instead.

The launch signals a growing recognition that AI systems introduce an entirely new attack surface, one that traditional security frameworks weren’t built to address.

By incentivizing safety-focused research alongside conventional vulnerability disclosure, OpenAI is effectively establishing a structured framework for AI-specific threat modeling.

Researchers interested in participating can apply directly through OpenAI’s Safety Bug Bounty page on Bugcrowd.

Disclaimer: HackersRadar reports on cybersecurity threats and incidents for informational and awareness purposes only. We do not engage in hacking activities, data exfiltration, or the hosting or distribution of stolen or leaked information. All content is based on publicly available sources.

Tags:

AttackSecurityThreatVulnerability

Share Article

Jennifer sherman

Jennifer sherman

Jennifer is a cybersecurity news reporter covering data breaches, ransomware campaigns, and dark web markets. With a background in incident response, Jennifer provides unique insights into how organizations respond to cyber attacks and the evolving tactics of threat actors. Her reporting has covered major breaches affecting millions of users and has helped organizations understand emerging threats. Jennifer combines technical knowledge with investigative journalism to deliver in-depth coverage of cybersecurity incidents.

Previous Post

Kiss Loader Malware Uses Early Bird APC Injection Attacks

Next Post

Microsoft Entra ID Feature Unlocks MFA for All Users

No Comment! Be the first one.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts
cPanelSniper PoC Exploit for cPanel Vulner Disclosed Vulnerability
May 2, 2026
EtherRAT Targets Enterprise Admins with SEO Poison
May 1, 2026
New Spyware Platform: Rebrand & Resell Android Lets Buyers
May 1, 2026
Top Authors
Marcus Rodriguez
Marcus Rodriguez
Sarah simpson
Sarah simpson
Emy Elsamnoudy
Emy Elsamnoudy
Let's Connect
156k
2.25m
285k

Related Posts

Jennifer sherman
By Jennifer sherman
Threats

GlassWorm Attacks macOS via Malicious VS Code…

January 1, 2026
Emy Elsamnoudy
By Emy Elsamnoudy
Attacks

ClickFix Attack Hides Malicious Code via Stegan Security

January 1, 2026
Sarah simpson
By Sarah simpson
Vulnerabilities

MongoBleed Detector Tool Detects Critical MongoDB CVE-

January 1, 2026
Emy Elsamnoudy
By Emy Elsamnoudy
Breaches

Conti Ransomware Gang Leaders & Infrastructure Exposed

January 1, 2026
Hackers News Hackers News
  • [email protected]

Quick Links

  • Contact Us
  • Privacy Policy
  • Terms of service

Categories

Attacks
Breaches
Comparisons
CyberSecurity News
Threats
Vulnerabilities

Let's keep in touch

receive fresh updates and breaking cyber news every day and week!

All Rights Reserved by HackersRadar ©2026

Follow Us