Senior ML Research Engineer – LLM Quantization & Model Optimization
Microsoft
                            
                            Mountain View, California, U.S.
                        
                        
                            
                            Full-Time
                        
                        
                            
                            
                                Posted Sep 03, 2025
                            
                        
                    Up to 50% work from home
                        
                    Compensation
                                    
                                    Loading salary analysis...
                                
                            About the role
Join the Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft’s expanding Cloud Infrastructure and for powering Microsoft’s “Intelligent Cloud” mission.
Responsibilities
- Design and develop novel quantization techniques to enable efficient deployment of LLM inference and training in Microsoft’s Azure production environments.
- Drive software development and model optimization tooling proof-of-concept effort to streamline deployment of quantized models.
- Analyze performance bottlenecks in state-of-the-art LLM architectures and drive performance improvements.
- Prototype and evaluate emerging low-precision data formats through proof-of-concept implementations.
- Co-design model architecture optimized for low-precision deployment in close collaboration with companywide AI teams.
- Work cross-functionally with data scientists and ML researchers/engineers to align on model accuracy and performance goals.
- Partner with hardware architecture and AI software framework teams to ensure end-to-end system efficiency.
Requirements
- Doctorate in relevant field OR equivalent experience.
- 4+ years of combined experience, including 2+ years of industry experience in low-precision model optimization and quantization for LLM workloads.
Benefits
- Health insurance
- 401k matching
- Flight privileges
- Discounts on products and services
- Savings and investments
- Maternity and paternity leave
- Generous time away
- Giving programs
- Opportunities to network and connect
About the Company
Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure.
Job Details
Salary Range
$119,800 - $234,700/yearly
Location
Mountain View, California, U.S.
Employment Type
Full-Time
Original Posting
View on company website