Forward Deployed Researcher

About Us

We’re building a platform that powers the biggest AI groups in the world — including OpenAI, Anthropic, Meta, and Google — with human feedback data for evaluating and training their models. Surge was founded by former ML engineers focused on providing the highest quality data in the industry. Instead of outsourcing to call centers overseas, we’ve built an elite workforce based in the US, custom annotation tools, and sophisticated quality control systems. Our product has been a “game-changer” for ML teams, and we’ve run a profitable business from day one without raising venture funding.

The Role

As a Forward Deployed Researcher, you’ll embed directly with leading AI labs and frontier model teams to explore how their models behave in the wild — and what it takes to make them work reliably at scale. You’ll lead hands-on research to evaluate capabilities, surface subtle failure modes, and design interventions that shape model behavior and deployment outcomes.

You won’t just run tests — you’ll uncover insights that inform the next generation of model development. This is a role for someone who thrives on close collaboration, lives for iteration, and loves turning real-world complexity into actionable breakthroughs. Your work will shape how some of the most advanced AI systems interface with the world.

What We’re Looking for

Curiosity about Model Behavior – Interest in how models reason, generalize, and fail when applied to complex real-world tasks

Experimental Rigor – Strong instincts for research design and insight generation in ambiguous settings

Customer-Centered Mindset – Excitement to embed deeply with partners, understand their goals, and build towards meaningful impact

Examples of Work You Might Do

Designing and running evaluations to probe model generalization and behavior under deployment constraints

Building data pipelines, fine-tuning workflows, or interventions to steer and improve model outputs

Partnering with internal stakeholders or external AI labs to operationalize custom frontier model use cases

Exploring and addressing failure modes in long-context reasoning, tool use, or real-time feedback loops

How to Apply

To apply, please email careers@surgehq.ai with your background and interest in collaborating with Surge. Please include the name of the role you’re applying for in the subject line or email body. We welcome personal projects and writings!

Cognitive Collective