Senior Cloud Site Reliability Engineer - SMTS
athenahealth
Bengaluru, Karnataka, India
Full-time
Posted Oct 22, 2025
Full-time
Compensation
Loading salary analysis...
About the role
Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
Responsibilities
- Define, measure, and maintain SLOs and SLIs for cloud services and infrastructure.
- Lead efforts to improve system availability, fault tolerance, and disaster recovery.
- Ensure proactive incident detection, root cause analysis, and timely resolution.
- Participate in a 24x7 on-call rotation.
- Drive automation to reduce manual intervention in cloud infrastructure management.
- Implement IaC using tools like Terraform, AWS CloudFormation, and Ansible.
- Automate deployment, scaling, and monitoring processes.
- Design and implement monitoring, logging, and alerting solutions.
- Use observability tools (e.g., Prometheus, Grafana, CloudWatch) for performance insights.
- Identify and resolve performance bottlenecks.
- Build cloud infrastructure with security best practices and compliance in mind.
- Collaborate with security teams to implement controls and mitigate risks.
- Conduct regular audits for vulnerabilities and compliance gaps.
- Partner with development, DevOps, and operations teams to align infrastructure with business needs.
- Mentor junior engineers and promote a culture of operational excellence.
- Serve as a technical point of contact for infrastructure-related issues.
- Lead incident response for cloud infrastructure issues.
- Conduct post-incident reviews and implement preventive measures.
- Continuously improve incident management processes.
Requirements
- 5–9 years of hands-on experience with cloud automation and configuration tools (e.g., Terraform, CloudFormation, Ansible) in a hybrid cloud setup.
- 4+ years in SRE, Infrastructure Engineering, or DevOps roles
- Deep expertise in AWS services (e.g., EC2, S3, Lambda) and Kubernetes.
- Proficiency in scripting/programming (e.g., Python, Go, Bash).
- Experience with observability tools (e.g., Prometheus, Grafana, Datadog, ELK).
- Familiarity with CI/CD pipelines and cloud-native development practices.
- Strong experience managing production environments in AWS, GCP, or Azure.
- Knowledge of cloud-native architectures, microservices, and containerization (Kubernetes, Docker).
- Proven ability to build scalable, fault-tolerant systems.
- Solid understanding of cloud networking, storage, compute, and security best practices.
Benefits
- 401k matching
- Health insurance
- Commuter support
- Employee assistance programs
- Tuition assistance
- Employee resource groups
- Collaborative workspaces
About the Company
athenahealth is a progressive & innovative U.S. health-tech leader, delivering cloud-based solutions that improve clinical and financial performance across the care continuum.
Job Details
Salary Range
Salary not disclosed
Location
Bengaluru, Karnataka, India
Employment Type
Full-time
Original Posting
View on company website