We are searching for a senior DevOps Engineer who will be responsible for the architectural design, implementation, and maintenance of our cloud-native infrastructure. You will lead the transition toward a GitOps model, ensuring that our Kubernetes ecosystems are resilient, our CI/CD pipelines are seamless, and our services are backed by proactive monitoring and observability.
Key Responsibilities
1. Kubernetes & Orchestration
- Cluster Management: Architect and manage production-grade K8s clusters (EKS, GKE, or On-prem).
- Service Mesh & Networking: Implement and maintain service meshes (e.g., Istio, Linkerd) to manage service-to-service communication, security, and observability.
- Workload Optimization: Manage Helm charts, Kustomize configurations, and resource limits to ensure cost-effective scaling.
2. Advanced Linux Administration
- System Hardening: Secure Linux environments through kernel tuning, SSH hardening, and regular patching.
- Performance Tuning: Deep dive into system bottlenecks using tools like eBPF, top, iostat, and netstat.
- Automation: Replace manual sysadmin tasks with high-level Python or Go automation.
3. Pipeline Integration (CI/CD)
- End-to-End Automation: Integrate security (DevSecOps), automated testing, and artifact management (JFrog/Artifactory) into Jenkins, GitLab CI, or GitHub Actions.
- GitOps Delivery: Drive deployment automation using ArgoCD or Flux to ensure the cluster state matches the Git repository.
- Release Strategies: Implement Blue/Green and Canary deployment patterns to minimize downtime during service updates.
4. Monitoring & Observability
- Full-Stack Visibility: Build and maintain the O11y stackPrometheus for metrics, Grafana for visualization, and ELK/Loki for logs.
- Proactive Alerting: Define SLIs/SLOs and configure smart alerting to reduce alert fatigue and catch issues before they impact users.
- Tracing: Integrate distributed tracing (e.g., Jaeger) to debug complex microservice interactions