Senior DevOps Engineer
Location: Remote
Type: Full-time
Role Overview
We are seeking a Senior DevOps Engineer to design, deploy, and operate enterprise-grade DevOps and platform engineering solutions.
The role includes ownership of the architecture, deployment, and lifecycle management of GitLab platforms, as well as the design and operation of Kubernetes platforms, including Google Kubernetes Engine (GKE).
The engineer will apply Infrastructure as Code principles to deliver consistent, repeatable deployments across on-premises, cloud, and hybrid environments, and will support large-scale enterprise and multinational organizations with high availability, security, and operational excellence requirements.
Key Responsibilities
Gitlab Platform Engineering
- Design, deploy, and operate GitLab platforms across on-premises, cloud, and hybrid environments.
- Own the end-to-end lifecycle of GitLab installations, including installation, configuration, upgrades, scaling, and maintenance.
- Design resilient and scalable architectures to support large development teams and CI/CD workloads.
- Integrate GitLab with identity providers, external databases, storage systems, and other enterprise services.
- Define backup, disaster recovery, availability, and operational standards for GitLab platforms.
- Define availability models, backup strategies, and disaster recovery approaches.
Kubernetes & GKE Platform Engineering
- Design, deploy, and operate Kubernetes platforms, including Google Kubernetes Engine (GKE).
- Implement Kubernetes best practices such as namespace isolation, RBAC, workload identity, autoscaling, and resource governance.
- Design and manage service exposure, ingress/egress, and traffic management.
- Support production workloads running on Kubernetes and lead troubleshooting efforts.
- Establish operational standards, monitoring, and upgrade strategies for Kubernetes platforms.
Infrastructure as Code & Automation
- Design and maintain Infrastructure as Code using Terraform.
- Standardize platform deployments through reusable modules and automation.
- Ensure consistency across development, staging, and production environments.
- Detect and remediate configuration drift and operational risks.
Reliability, Observability & Operations
- Implement monitoring, logging, and alerting for platforms and workloads.
- Participate in incident response, root-cause analysis, and continuous improvement activities.
- Contribute to capacity planning, performance tuning, and cost optimization initiatives.
- Maintain runbooks, operational documentation, and platform standards.
Security & Best Practices
- Apply security best practices across platforms, infrastructure, and automation.
- Enforce least-privilege access, secrets management, and secure configuration baselines.
- Support security reviews, audits, and remediation activities.
- Collaborate with security teams to align platforms with enterprise policies.
Collaboration & Leadership
- Work closely with developers, cloud architects, and security teams.
- Support enterprise and multinational customers with diverse technical and regulatory requirements.
- Mentor junior engineers and contribute to team knowledge sharing.
- Participate in architecture reviews and technical decision-making.
What is Expected from the Candidate
- Strong ownership and accountability for platform stability and quality.
- Ability to work independently across multiple engagement types.
- Confidence operating in complex, enterprise-scale environments.
- Proactive approach to automation, reliability, and risk reduction.
- Clear and professional communication with technical and non-technical stakeholders.
- Comfort working across different cultures, regions, and time zones.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
- 7+ years of experience in DevOps, Platform Engineering, or SRE roles.
- Strong hands-on experience with:
- GitLab platform deployments
- Kubernetes platforms, including GKE
- Infrastructure as Code, including Terraform
- Linux systems and networking fundamentals
- Proven experience supporting production environments at enterprise scale.
Preferred Certifications
- GitLab Professional Services Engineer (PSE)
- Google Cloud Professional Cloud Architect
- Google Cloud Professional DevOps Engineer or equivalent
- Kubernetes certifications (CKA, CKAD, or equivalent)
Work Arrangement & Travel
- This role is remote-based, with the flexibility to work from any location, subject to business and customer requirements.
- The position requires periodic travel to customer sites
- Travel frequency may vary depending on project needs and customer engagement scope, including regional and international travel.