Member of technical staff - Research - Agent
H Company
About H:
H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential.
H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute.
About the Team: The Agent team defines new learning algorithms and agent paradigms to push the frontiers of agentic systems. We build upon foundation models and reinforcement learning to develop new approaches to train artificial general agents and work closely with the LLM/VLM and Safety teams to explore new directions.
This is a heavily engineering-focused role embedded within the research team. You will be responsible for defining the architecture and building the robust, scalable systems that underpin our research efforts. Your work will translate cutting-edge research concepts into high-performance, production-quality platforms, enabling the next generation of agentic AI.
Key Responsibilities:
- Research & Leadership: Design and develop new agents, proposing new research directions, e.g., combining state-of-the-art RL with foundation models (LLMs/VLMs).
- Algorithm & Systems Design: Design, implement, and scale complex, high-performance systems for training large-scale agents. This includes both the foundational infrastructure and the novel algorithms, reward models, and sophisticated training environments.
- Research-to-Production: Collaborate closely with researchers and engineers to implement, test, and productionize new agent logics, learning algorithms, and system architectures.
- Evaluation & Reliability: Create, manage, and scale massive benchmarks and evaluation systems to rigorously track agent capabilities. You will own system reliability, scalability, and observability for our ...
Share this job: