The role will play a pivotal role in ensuring the operational excellence of containerized services within a service provider environment. The position is responsible for leading the deployment, management, and optimization of OpenShift and OpenShift AI platforms, including the implementation of enterprise container platforms. The successful candidate will drive continuous improvement in service delivery, automation, and high availability, directly impacting customer satisfaction and overall organizational performance.
Key Responsibilities:
- Led deployment, administration, and optimization of OpenShift, OpenShift AI, and Kubernetes platforms, ensuring operational availability and performance.
- Oversee implementation and lifecycle management of LLMs on OpenShift AI environments, including upgrades and enhancements.
- Fulfill service requests, manage platform upgrades, and drive automation initiatives for enhanced service delivery and efficiency.
- Monitor compliance with strict service level agreements (SLAs), proactively identifying and mitigating risks to operational continuity.
- Collaborate with senior administrators, IT support teams, and security personnel to resolve complex incidents and optimize platform performance.
- Establish escalation paths for issues beyond standard troubleshooting, ensuring timely resolution and adherence to SLA commitments.
- Documentation (operational guides, process improvements, and architecture).
- The position requires close collaboration with customers, solution architects, senior administrators, IT support teams, and security personnel to achieve operational objectives.
- The Container Team Lead will guide team members, foster knowledge sharing, and ensure alignment with organizational goals for service excellence.
Experience and Job Specific Skills:
- Bachelors degree in Information Technology or related field.
- Should have 8 to 10 years of experience with OpenShift and OpenShift AI, including advanced container orchestration and platform maintenance.
- Demonstrated ability to deploy, manage, and optimize large language models (LLMs) on containerized platforms.
- Strong understanding of automation tools and methodologies for operational optimization.
- Proven track record in service provider environments with stringent SLAs and high-availability requirements.
- Excellent leadership, communication, and problem-solving skills, with a focus on operational excellence and cross-functional collaboration.
- Relevant certifications in OpenShift, container technologies, or cloud-native platforms are highly desirable.
- This role operates within a service provider background, where strict SLAs and operational availability are paramount.
- The Container Team Lead will be instrumental in maintaining and improving service quality through proactive platform management, incident resolution, and continuous automation.