Role Purpose
Ensure stable, reliable, and efficient operation of cloud environments by managing production workloads, monitoring, incident response, and continuous operational improvement.
Key Responsibilities
- Operate and support cloud environments including monitoring, incident management, and problem resolution.
- Ensure backups, recovery processes, and operational controls align with business requirements.
- Perform capacity planning, performance management, and cost optimization.
- Automate operational tasks to reduce manual effort and improve consistency.
- Support application teams operating distributed cloud workloads.
- Maintain operational documentation, runbooks, and reporting.
Required Skills & Experience
- Experience in cloud operations or production support roles.
- Strong troubleshooting and incident management capabilities.
- Familiarity with monitoring, alerting, and operational tooling in cloud environments.
Behavioral Expectations
- Remains calm, structured, and decisive during incidents.
- Demonstrates strong customer and service orientation.
- Collaborates effectively across technical and business teams.
- Continuously seeks to reduce operational toil through automation.