Back to all jobs
Microsoft logo

Senior ML Research Engineer – LLM Quantization & Model Optimization

Microsoft

Mountain View, California, U.S.
Full-Time
Posted Sep 03, 2025
Up to 50% work from home

Compensation

Loading salary analysis...

About the role

Join the Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft’s expanding Cloud Infrastructure and for powering Microsoft’s “Intelligent Cloud” mission.

Responsibilities

  • Design and develop novel quantization techniques to enable efficient deployment of LLM inference and training in Microsoft’s Azure production environments.
  • Drive software development and model optimization tooling proof-of-concept effort to streamline deployment of quantized models.
  • Analyze performance bottlenecks in state-of-the-art LLM architectures and drive performance improvements.
  • Prototype and evaluate emerging low-precision data formats through proof-of-concept implementations.
  • Co-design model architecture optimized for low-precision deployment in close collaboration with companywide AI teams.
  • Work cross-functionally with data scientists and ML researchers/engineers to align on model accuracy and performance goals.
  • Partner with hardware architecture and AI software framework teams to ensure end-to-end system efficiency.

Requirements

  • Doctorate in relevant field OR equivalent experience.
  • 4+ years of combined experience, including 2+ years of industry experience in low-precision model optimization and quantization for LLM workloads.

Benefits

  • Health insurance
  • 401k matching
  • Flight privileges
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

About the Company

Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure.

Job Details

Salary Range

$119,800 - $234,700/yearly

Location

Mountain View, California, U.S.

Employment Type

Full-Time

Original Posting

View on company website
Create resume for this position