Software Engineer II
Microsoft
Multiple Locations, United States, U.S.
Full-Time
Posted Sep 30, 2025
0 days / week in-office - remote
Compensation
Loading salary analysis...
About the role
The Azure Compute team builds a fault-tolerant, distributed system on top of commodity datacenter hardware to deliver infrastructure for hosting cloud applications in virtual machines (VMs). The team creates the illusion that resources are limitless, infinitely elastic, and always available.
Responsibilities
- Partners with appropriate stakeholders spanning across teams and orgs to determine project requirements
- Leads the design and architecture of change management features and services in Azure Compute
- Leverages expertise with appropriate stakeholders to develop project plans, release plans, and work items.
- Develops high quality, extensible, maintainable code and coaches others to do the same.
- Supports livesite as Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.
- Collaborates with data scientists and ML engineers to design and integrate predictive models that proactively detect hardware anomalies and trigger live migrations, improving VM uptime and SLA compliance.
- Leads initiatives to embed AI-driven diagnostics and root cause analysis into availability services, reducing time-to-resolution for incidents and improving operational efficiency.
- Drives the adoption of generative AI tools to automate documentation, incident summaries, and engineering workflows, enhancing team productivity and knowledge sharing.
- Partners with platform teams to build intelligent observability pipelines that leverage anomaly detection and trend analysis for early warning systems.
- Evaluates and integrates large-scale AI models into control plane services to enable smarter, context-aware repair decisions across millions of Azure VMs.
Requirements
- Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, Rust, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
Benefits
- Discounts on products and services
- Educational resources
- Generous time away
- Giving programs
- Industry leading healthcare
- Maternity and paternity leave
- Opportunities to network and connect
- Savings and investments
About the Company
Microsoft’s mission is to empower every person and every organization on the planet to achieve more.
Job Details
Salary Range
Salary not disclosed
Location
Multiple Locations, United States, U.S.
Employment Type
Full-Time
Original Posting
View on company website