About the Role
Lead SRE-driven operations, ITIL governance, and mission-critical platform reliability for enterprise cloud systems.
What You'll Do
- Own 24×7 production operations
- Define SLIs, SLOs, and error budgets
- Lead major incident management
- Implement monitoring, alerting & tracing standards
- Improve MTTR & MTTD
- Drive automation-first operations
- Implement Incident, Problem, and Change Management processes
- Define SLAs and ensure compliance
- Maintain service catalogues & SOPs
- Build and lead production support teams
- Define KPIs and shift models
- Drive continuous improvement
What We're Looking For
- 15+ years in IT Operations / SRE / Service Management
- Cloud expertise (AWS / Azure / GCP)
- Microservices & SaaS platform experience
- Strong ITIL process knowledge
Nice to Have
- ITIL v4 certification
- Cloud certifications (AWS / Azure / GCP)
- SRE / DevOps certifications
Apply for this Role
DepartmentEngineering
LocationHybrid – India
TypeFull-time
Experience15+ years
Or email careers@aeternoconsulting.com