Forward Deployed Researcher
Surge
Posted on Jul 22, 2025
Forward Deployed Researcher
About Us
We’re building a platform that powers the biggest AI groups in the world — including OpenAI, Anthropic, Meta, and Google — with human feedback data for evaluating and training their models. Surge was founded by former ML engineers focused on providing the highest quality data in the industry. Instead of outsourcing to call centers overseas, we’ve built an elite workforce based in the US, custom annotation tools, and sophisticated quality control systems. Our product has been a “game-changer” for ML teams, and we’ve run a profitable business from day one without raising venture funding.
The Role
As a Forward Deployed Researcher, you’ll embed directly with leading AI labs and frontier model teams to explore how their models behave in the wild — and what it takes to make them work reliably at scale. You’ll lead hands-on research to evaluate capabilities, surface subtle failure modes, and design interventions that shape model behavior and deployment outcomes.
You won’t just run tests — you’ll uncover insights that inform the next generation of model development. This is a role for someone who thrives on close collaboration, lives for iteration, and loves turning real-world complexity into actionable breakthroughs. Your work will shape how some of the most advanced AI systems interface with the world.
What We’re Looking for
Curiosity about Model Behavior – Interest in how models reason, generalize, and fail when applied to complex real-world tasks
Experimental Rigor – Strong instincts for research design and insight generation in ambiguous settings
Customer-Centered Mindset – Excitement to embed deeply with partners, understand their goals, and build towards meaningful impact
Examples of Work You Might Do
Designing and running evaluations to probe model generalization and behavior under deployment constraints
Building data pipelines, fine-tuning workflows, or interventions to steer and improve model outputs
Partnering with internal stakeholders or external AI labs to operationalize custom frontier model use cases
Exploring and addressing failure modes in long-context reasoning, tool use, or real-time feedback loops
How to Apply
To apply, please email careers@surgehq.ai with your background and interest in collaborating with Surge. Please include the name of the role you’re applying for in the subject line or email body. We welcome personal projects and writings!