Come build the core of Microsoft Copilot for enterprise with Microsoft Turing Team, where you'll join a collaborative group of applied scientists and engineers pushing the frontier of large language models (LLMs) to improve the productivity of hundreds of millions of users around the world.
Our team is responsible for the core systems that power Microsoft 365 Copilot Chat delivering intelligence, quality, and transformative new features. We work at the intersection of research and engineering—advancing orchestrator reasoning, training next-generation models, and shipping impactful, model-driven experiences in Microsoft 365 Copilot.
Shape the way the world measures and trusts AI. In the Experimentation & Evaluation area, you’ll define what “good” means for Copilot, by working on metrics, scorecards and eval pipelines across offline, shadow, and online experiments. You’ll work at the intersection of science and engineering, gaining a front-row seat to how AI impacts millions of users and help steer one of Microsoft’s most important products forward. As a Senior Applied Scientist in the Experimentation & Evaluation area, you will lead the advancement of Copilot’s measurement and experimentation science. You will design and evolve evaluation frameworks across offline, shadow, and online scorecards, setting new standards for rigor, reliability, and scalability. By driving innovation in statistical analysis, machine learning, and experimentation methodology, you will generate trustworthy insights that shape product direction, influence leadership decisions, and accelerate Copilot’s impact at scale.
Our team is responsible for the core systems that power Microsoft 365 Copilot Chat delivering intelligence, quality, and transformative new features. We work at the intersection of research and engineering—advancing orchestrator reasoning, training next-generation models, and shipping impactful, model-driven experiences in Microsoft 365 Copilot.
Shape the way the world measures and trusts AI. In the Experimentation & Evaluation area, you’ll define what “good” means for Copilot, by working on metrics, scorecards and eval pipelines across offline, shadow, and online experiments. You’ll work at the intersection of science and engineering, gaining a front-row seat to how AI impacts millions of users and help steer one of Microsoft’s most important products forward. As a Senior Applied Scientist in the Experimentation & Evaluation area, you will lead the advancement of Copilot’s measurement and experimentation science. You will design and evolve evaluation frameworks across offline, shadow, and online scorecards, setting new standards for rigor, reliability, and scalability. By driving innovation in statistical analysis, machine learning, and experimentation methodology, you will generate trustworthy insights that shape product direction, influence leadership decisions, and accelerate Copilot’s impact at scale.
This position is based out of either Redmond, WA or Mountain View, CA with 3 days per week work in the office and 2 days per week work from home. Living local to the Greater Seattle Area or Bay Area is required, and relocation assistance is available.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond

