Research Intern - Cloud Reliability & Efficiency

Microsoft hybrid • Redmondintern

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

 

In M365 Research, we are dedicated to pioneering advancements in Artificial Intelligence (AI) and Systems, driving the transfer of innovative technologies into our products, establishing Microsoft’s leadership in technical domains and enhancing community engagement. We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical expertise in machine learning, cloud systems and software engineering. We communicate our research both internally and externally through peer-reviewed scientific publications, open-source releases, blog posts, patents, and industry conferences. 

 

For this position, you will have a background in Systems research and the motivation and ambition to apply this to production systems. Some of the research problems we are currently working on are: improving reliability and observability of Agentic Systems, workload-aware placement of compute resources, holistic characterization and modelling of workloads, fault injection for improving workload reliability and mining dependency graphs for web-scale systems.