SRE Engineer - Production Operations
Resume ready? Build an ATS-optimized one — free.
Try JotCV →Resume ready for this application?
Build an ATS-optimized resume free on JotCV →
Disney+ Hotstar is hiring an SRE Engineer to help maintain the reliability of one of the world's highest-throughput streaming platforms. During IPL matches, Hotstar sets global records for concurrent streaming, requiring extreme engineering precision under load.
You will work on Kubernetes-based infrastructure, observability systems, and automated incident response tooling that keeps 300 million users streaming seamlessly. This is a high-stakes, high-growth role in a team that has solved unprecedented scale challenges.
- Maintain SLOs for Hotstar's streaming infrastructure
- Build and improve auto-scaling systems for massive traffic spikes
- Develop chaos engineering and game day scenarios
- 4-7 years of SRE or infrastructure engineering experience
- Expert Kubernetes skills with large production clusters
- Experience designing auto-scaling systems for unpredictable traffic patterns
- Strong observability skills (Prometheus, Grafana, Datadog, Jaeger)
- Programming skills in Go or Python for tool development
- Experience with chaos engineering (LitmusChaos, Gremlin) is a plus
- Own SLOs and error budgets for critical streaming services
- Design and implement proactive auto-scaling and capacity planning
- Lead blameless post-mortems and drive long-term reliability improvements
- Build observability and alerting systems for production services
- Execute load tests and chaos engineering exercises before major events
Job Overview
Stay Ahead in the
Jobspri Market
Join 50,000+ candidates receiving weekly job alerts, interview tips, and salary insights directly from top recruiters.
By subscribing, you agree to ourTerms of ServiceandPrivacy Policy.
