Cloud Network Engineer II
Microsoft
Compensation
About the role
The High Performance Computing and Artificial Intelligence (HPC and AI) team is focused on building the next-generation distributed artificial intelligence supercomputer. Our goal is to enable breakthroughs in artificial intelligence by delivering unmatched computational power, scalability, and reliability. We design and develop advanced infrastructure that supports high-performance model training at scale, laying the groundwork for innovations that expand the boundaries of what artificial intelligence can achieve.
Responsibilities
- Network Design & Implementation: Architect and deploy high-throughput, low-latency physical network topologies (e.g., Clos, FatTree) using technologies such as InfiniBand and Ethernet to support AI model training and HPC workloads.
- Infrastructure Automation: Develop and maintain automation frameworks for provisioning, validating, and monitoring physical network infrastructure at scale, ensuring consistency and reliability across data centres.
- Operational Readiness: Serve as a Designated Responsible Individual (DRI) for physical network systems—monitoring health, responding to incidents, performing root-cause analysis, and driving improvements in availability and observability.
- Tooling & Instrumentation: Build and integrate tooling for telemetry, diagnostics, and performance tuning of physical network components, enabling real-time visibility into link health, congestion, and jitter.
- Cross-Functional Collaboration: Partner with hardware engineering, DataCentre operations, and software-defined networking teams to ensure seamless integration of physical and logical network layers.
- Documentation & Standards: Own the documentation of physical network designs, cabling standards, and deployment procedures. Lead design reviews and ensure alignment with compliance and safety standards.
- Innovation & Research: Stay current with advancements in optical networking, high-speed interconnects, and AI/HPC fabric technologies. Evaluate and integrate emerging solutions to improve scalability, efficiency, and performance.
Requirements
- Master's Degree in Electrical Engineering, Optical Engineering, Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in network design, development, and automation
- Bachelor's Degree in Electrical Engineering, Optical Engineering, Computer Science, Information Technology, or related field AND 2+ years technical experience in network design, development, and automation
- 1+ year of experience designing, deploying, and supporting data center and backbone networks for distributed computing platforms such as artificial intelligence and machine learning clusters, high-performance computing systems, or hyperscale data centers
- 1+ year of experience with network performance tuning (latency, jitter, throughput optimization) and hands-on experience with telemetry and observability tools for physical infrastructure
- 1+ year of experience with Optical networking, high-speed interconnects (e.g., InfiniBand, Ethernet, NVLink), and fabric orchestration in large-scale environments and
- Network automation frameworks, structured cabling standards, and tools for link validation, diagnostics, and monitoring
Benefits
- 401k matching
- Health insurance
- Flight privileges
- Discounts on products and services
- Savings and investments
- Maternity and paternity leave
- Generous time away
- Giving programs
- Opportunities to network and connect
About the Company
Microsoft is committed to empowering every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day, we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Job Details
Salary Range
Salary not disclosed
Location
Multiple, United States, U.S.
Employment Type
Full-Time
Original Posting
View on company website