Researcher, Automated Red Teaming
OpenAI - San Francisco, California, United States
Posted Feb 23, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Researcher, Automated Red Teaming San Francisco, California, United States About the team Preparedness is a critical Safety Research team at OpenAI, which is focused on mitigating AI threats to global security that could scale to an extreme level of severity. Our work involves: - Measurement. Monitoring and predicting the evolving capabilities of frontier AI systems. - Mitigation. Keeping misuse safeguards, alignment tools, and security measures on track to adequately address extreme threats that might arise in the future. - Coordination. Setting mitigation targets by maintaining OpenAI's preparedness framework , and partnering with other staff to achieve these targets. This is urgent, fast-paced work that has far-reaching implications for the company and for society. About the role This role leads the Automated Red Teaming (ART) effort: building scalable, research-driven systems that continuously uncover failure modes in our models and safeguards, and translate those findings into actionable, production-facing improvements. The goal is to reduce expected harm by finding the highest-leverage, least-covered weaknesses early and reliably. In this role, you'll: - Own the research and technical direction for automated red teaming across catastrophic risk areas, with an initial emphasis on: - Automated classifier jailbreak discovery (cyber and bio). - Automated bio threat-development elicitation (worst-feasible planning uplift). - CoT monitoring evasion probing (and adjacent loss-of-control evaluations). - Partner closely with: - Vertical risk teams (Cyber, Bio, Loss of Control) to define threat models, prioritize targets, and land mitigations. - The Classifiers team to turn discovered attacks into training data, evals, and measurable robustness gains.
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Security Coordinator 4 (12675-1. 15471-1. 13771-1)
Northrop Grumman - United States-Utah-Roy
-
Loan Servicing Representative
AXOS Financial INC - Las Vegas, NV
-
Staff Test Conductor
Northrop Grumman - United States-California-Palmdale
-
Off Premise Specialist
Constellation Brands - 2 Locations