Researcher, Automated Red Teaming

OpenAI - San Francisco, California, United States

Posted Feb 23, 2026

Benefits

Parental leave: Not verified
Non-birth-parent leave: Not verified
Family-building benefits: Fertility benefits: Not verified
Adoption assistance: Not verified
Surrogacy assistance: Not verified
Mental health support: Not verified
Relocation assistance: Not verified
Childcare support: Not verified
Learning budget: Not verified
Verification: Not verified
Salary: Not verified
401(k) match: Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type: Not verified
Weekend work: Not verified

Application

Cover letter: Not verified
Assessment: Not verified
Deadline: Not stated

Where they hire

State eligibility is not yet verified.

About this role

Researcher, Automated Red Teaming San Francisco, California, United States About the team Preparedness is a critical Safety Research team at OpenAI, which is focused on mitigating AI threats to global security that could scale to an extreme level of severity. Our work involves: - Measurement. Monitoring and predicting the evolving capabilities of frontier AI systems. - Mitigation. Keeping misuse safeguards, alignment tools, and security measures on track to adequately address extreme threats that might arise in the future. - Coordination. Setting mitigation targets by maintaining OpenAI's preparedness framework , and partnering with other staff to achieve these targets. This is urgent, fast-paced work that has far-reaching implications for the company and for society. About the role This role leads the Automated Red Teaming (ART) effort: building scalable, research-driven systems that continuously uncover failure modes in our models and safeguards, and translate those findings into actionable, production-facing improvements. The goal is to reduce expected harm by finding the highest-leverage, least-covered weaknesses early and reliably. In this role, you'll: - Own the research and technical direction for automated red teaming across catastrophic risk areas, with an initial emphasis on: - Automated classifier jailbreak discovery (cyber and bio). - Automated bio threat-development elicitation (worst-feasible planning uplift). - CoT monitoring evasion probing (and adjacent loss-of-control evaluations). - Partner closely with: - Vertical risk teams (Cyber, Bio, Loss of Control) to define threat models, prioritize targets, and land mitigations. - The Classifiers team to turn discovered attacks into training data, evals, and measurable robustness gains.

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs

Security Coordinator 4 (12675-1. 15471-1. 13771-1)

Northrop Grumman - United States-Utah-Roy
Loan Servicing Representative

AXOS Financial INC - Las Vegas, NV
Staff Test Conductor

Northrop Grumman - United States-California-Palmdale
Off Premise Specialist

Constellation Brands - 2 Locations

Researcher, Automated Red Teaming

Benefits

Schedule

Application

Where they hire

About this role

What verified means

Related jobs

Security Coordinator 4 (12675-1. 15471-1. 13771-1)

Loan Servicing Representative

Staff Test Conductor

Off Premise Specialist