This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Azure Core Insights team is a growing Agile team seeking a passionate candidate who possesses both machine learning and data science expertise, along with development skills, and a strong interest in building Artificial Intelligence for IT Operations (AIOps) solutions to address the unique challenges in cloud environments and drive the next generation of cloud infrastructure. As a member of our team, you will help design and implement anomaly detection, automatic triaging and correlation, and causal inference models to deliver preventive insights that improve the availability, reliability, and efficiency of the Azure cloud system.
Job Responsibility:
Share accountability of a wide array of assets and be comfortable with learning a broad array of technologies
Independently design and implement anomaly detection, auto-triaging/correlation, and causal inference model to deliver preventive insights to improve Azure cloud system availability, reliability, and efficiency
Work with partner teams to integrate the Insights into Azure daily dev operations and Azure system for automatic mitigation and repairs
Contribute towards driving visibility into customer impacting on Virtual Machines or Containers or higher-level Azure services built on top of Virtual Machines
Assist with building an automated data quality solution to detect problems in downstream dependencies and take automated action to correct them
Look for opportunities to share learnings and tools broadly within Microsoft and beyond
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
4+ years in anomaly detection algorithm design or implementations experience
2+ years of familiarity with open source machine learning library such as Scikit-Learn, Pandas, Seaborn, and/or similar
2+ years of experience with AI Agent framework and machine learning models or LLM models such as linear/nonlinear regression, Bagged and Boosted Trees, Bayes methods, Transformer, and/or similar
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Nice to have:
Bachelor's Degree in Computer Science OR related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, OR Python
OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
4+ years of basic compute science knowledge (Concepts such as Central Processing Unit (CPU), memory, Top of Rack (ToR) switch, load balancer (LB), virtual network (VNet), and virtual local area network (VLAN), etc.), and serverless architectures and other cloud architectural patterns
4+ years of expertise in Prompt Engineering in AI Agent Framework, AI Agent deployment, large language model and GPT
4+ years of proficiency in data visualization tools such as PowerBI, Networkx or similar