Microsoft’s vision is to democratize AI to enable every person and every organization on the planet to achieve more. With a growth mindset, we innovate to empower others, collaborate toward shared goals, and uphold respect, integrity, and accountability—fostering a culture of inclusion where everyone can thrive.
The Azure MultiModal Intelligence (MMI) team drives the development of advanced cognitive services across documents, video, image, and audio, powering scenarios such as retrieval-augmented generation (RAG), robotic automation processing (RAP), knowledge retrieval, agentic services and many more. Within this broad charter, we are seeking engineers to focus on advancing document intelligence—building best-in-class cloud and on-premises solutions that leverage deep learning and Large Language Models (LLMs) to help businesses automate document processing intelligently with AI.
Come join a creative and dedicated team of engineers! You’ll gain first-hand experience building AI products that are compliant, secure, reliable, and high-performing—serving millions of requests worldwide.
To learn more about our team, check out this document.

