Security Researcher, Agentic AI Threats
OpenAI
ABOUT THE TEAM
Preparedness is a critical Safety Research team at OpenAI, which is focused on mitigating AI threats to global security https://openai.com/index/updating-our-preparedness-framework/ that could scale to an extreme level of severity.
Our work involves:
1. Measurement. Monitoring and predicting the evolving capabilities of frontier AI systems.
2. Mitigation. Keeping misuse safeguards, alignment tools, and security measures on track to adequately address extreme threats that might arise in the future.
3. Coordination. Setting mitigation targets by maintaining OpenAIβs preparedness framework https://openai.com/index/updating-our-preparedness-framework/, and partnering with other staff to achieve these targets.
This is urgent, fast-paced work that has far-reaching implications for the company and for society.
ABOUT THE ROLE
As AI agents become more capable at software engineering, and automate more of our internal work, they could become a dangerous cyber threat. People in this role will help OpenAI prepare for security threats from advanced AI agent insiders.
IN THIS ROLE, YOU WILL:
- Identify paths by which capable future internal AI agents could compromise OpenAI.
- Design security controls - focusing on measures with long lead times that benefit from advanced preparation.
- Stress-test defenses with AI agent evaluations and penetration tests
YOU MIGHT THRIVE IN THIS ROLE IF YOU:
- Are deeply technical across security and modern infrastructure, and are comfortable digging into the details of operating systems, cloud, containers, CI/CD, or distributed systems.
- Have strong software engineering skills and enjoy building prototypes yourself.
- Are interested in engaging with stakeholders and can do so effectively.
- Bonus: have experience securing cloud infrastructure, and are deeply familiar with core components of the AI stack.
Compensation Range: $293K - $405K USD
About OpenAI
OpenAI is an AI research and deployment co...
Share this job: