This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified solutions. The Microsoft Security organization accelerates Microsoft’s mission and bold ambitions to ensure that our company and industry is securing digital technology platforms, devices, and clouds in our customers’ heterogeneous environments, as well as ensuring the security of our own internal estate. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world. The Microsoft AI Red Team is an interdisciplinary group of security experts, adversarial ML researchers, and software engineers with the mission of proactively identifying failures in Microsoft’s AI systems before they impact customers. Within the AI Red Team, our Tooling group builds platforms and developer experiences that enable teams across Microsoft to evaluate AI-powered systems at scale. Our team is the home of PyRIT (https://github.com/Azure/PyRIT ), an open-source framework used for AI risk identification and evaluation. We are expanding this space by building a new platform that enables product teams to run system-level evaluations of agentic applications and model-enabled experiences inside normal engineering workflows. We are looking for a Senior Software Engineer to help build this new system-level evaluation platform for agentic applications and model-enabled experiences. These systems increasingly involve multi-step reasoning, tool and API use, multimodal inputs, and memory. In this role, you will lead architecture and technical direction, drive key design decisions, and deliver a platform that integrates into engineering workflows and pipelines to test and measure a broad set of risks and harms.
Job Responsibility:
Lead system design and architecture
author and drive design reviews to ensure solutions meet security, privacy, compliance, scalability, and reliability requirements.
Establish engineering best practices and standards for code quality, testing, reproducibility, performance, and operational excellence.
Lead incident retrospectives and drive systemic improvements through root cause analysis, prevention mechanisms, and reliability investments.
Define success and guardrail metrics
drive instrumentation and feedback loops enabling continuous improvement and high-quality ship decisions.
Foster cross-team alignment with partner engineering teams and stakeholders to ensure broad adoption and clear integration paths.
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
Microsoft Cloud Background Check: This position will be required to pass the Microsoft background and Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Nice to have:
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience.
2+ years of experience designing, building, and operating scalable, highly available cloud services or distributed systems on platforms such as Azure, AWS, GCP, or comparable cloud environments, with production ownership and CI/CD pipeline integration.
1+ years of experience leading design and architecture for a product area or subsystem, including authoring design documents, driving design reviews, and ensuring solutions meet security, reliability, and performance requirements.
3+ years of experience designing, developing, or maintaining secure software systems, with applied knowledge of authentication, data protection, access control, and secure coding practices.
1+ years experience with generative AI or agentic systems, such as LLM-based applications, tools/function calling, retrieval/memory, or multimodal pipelines, especially in the context of evaluation, testing, or safety/security.
Experience applying distributed systems concepts such as concurrency, conflict resolution, and consensus algorithms to the design of resilient and maintainable back-end architectures.
Demonstrated experience designing and delivering platforms or frameworks that are adopted by multiple independent product teams (e.g., shared services, SDKs, internal developer platforms, or evaluation frameworks).