Job Summary
We are seeking a skilled Linux System Administrator to manage and scale our enterprise Linux estate. You will ensure stability, security, and performance of RHEL-based platforms, automate operational tasks, and support business-critical workloads. Candidates with advanced certifications (RHCE/RHCA) will additionally drive architecture, automation at scale, and platform engineering initiatives.
Key Responsibilities
Core Responsibilities
Install, configure, and maintain RHEL 9/10 and related Linux distributions.
- Manage users, groups, permissions, and centralized authentication (IdM/LDAP/AD via SSSD/realmd).
- Configure and support network services (DNS, DHCP, NTP, SSH, firewall) and troubleshoot TCP/IP issues.
- Implement patch management, vulnerability remediation, and security hardening (CIS/SCAP, SELinux).
- Manage storage (LVM, XFS/ext4, iSCSI, NFS, SMB) and backup/restore procedures.
- Provision systems via Kickstart and Red Hat Satellite; maintain content views and lifecycle environments.
- Monitor and tune performance (CPU, memory, IO, network) using native tools (top, vmstat, sar, ss, iostat).
- Automate tasks with Bash and Ansible (playbooks, roles, inventories).
- Maintain documentation, runbooks, and SOPs.
- Participate in on-call rotation and incident/problem management (ITIL).
Advanced Responsibilities
- Design secure, scalable RHEL platform architecturesHA, DR, and multi-site patterns.
- Architect multi-location Satellite + Capsule topology; optimize content sync, PXE/Kickstart flows, and remote execution at scale.
- Build enterprise Ansible stacks (Collections, AWX/Automation Controller/Tower); implement config-as-code, policy-as-code, and GitOps workflows.
- Lead hardening baselines (STIG/CIS), SELinux policy tuning, file integrity, certificate automation (IdM/ACME), and vulnerability mgmt integration.
- Define SLIs/SLOs, implement logging/metrics/tracing (e.g., RHEL Performance Co-Pilot, Prometheus), and conduct capacity planning.
- Design secure container hosts (Podman/OpenShift edge), image pipelines, and hybrid integrations (VMware/KVM, AWS/Azure).
- Implement access control (HBAC), sudo policies, MFA, and periodic access reviews, champion audit readiness.
- Own DR runbooks, chaos/game days, backup strategy validation, and RTO/RPO adherence.
- Establish coding standards, reusable Ansible roles, golden images, guardrails, and mentor L1/L2 engineers.
Required Skills & Qualifications
- 510 years (Senior) Linux administration experience.
- Strong knowledge of system, SELinux, SSSD/realmd, and firewall.
- Proven experience with Ansible, Satellite, Kickstart, LVM, and networking.
- Familiarity with virtualization (KVM/VMware) and exposure to cloud (AWS/Azure/OpenStack).
- Scripting: Bash (required), Python (preferred).
- Certifications: RHCSA/RHCE preferred; RHCA or Red Hat Specialist exams are a plus.