Model Policy

OpenAI

🌍 North America 🏠 Remote ⏱ Part-time 💼 Mid-level 🗓 6 days ago

About the Team

Our Safety Systems https://openai.com/safety/safety-systems team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

Within Safety Systems, the Model Policy team aligns model behavior with desired human values and norms. We co-design policy with models and for models by driving rapid policy taxonomy iteration based on data and defining evaluation criteria for foundational models’ ability to reason about safety.

About the Role

If you have a specific expertise or speciality related to this work, please note it in your application via your resume, cover letter or application note.

Frontier AI systems are expanding what people can do across domains, creating both enormous opportunities and difficult safety questions: when should a model help, when should it refuse, and how do we make those boundaries clear enough to train, evaluate, and enforce?

In this role, you will help define how OpenAI’s models should behave in high-risk or high-ambiguity contexts, such as agentic systems, multimodal systems, user safety, privacy, and other emerging risk domains.

This is an ideal role for someone who can move across unfamiliar topics, reason from first principles, and turn ambiguity into practical model behavior. You will work closely with research, engineering, product, preparedness, and operations teams to build policies that are technically grounded, measurable, and responsive to real-world risk.

In this role, you will:

- Design and maintain model policies across safety-relevant domains, including dual-use, agentic, and emerging frontier-risk areas.

- Translate risk and harm models into clear behavioral specifications, evaluation criteria, grading guidance, and system-level safeguards.

- Define practical boundaries between beneficial uses of AI and assistance that could materially enable harm, exploitation, misuse, or unsafe outcomes.

- Build po...

Apply on company site

More jobs at OpenAI