Mission
SysOps/SRE team is responsible for provisioning, operating & administrating of the infrastructure of Valeo tools and applications used by R&D Engineers across all Valeo worldwide sites (hosted on AWS or any other Cloud Provider).
The team is also responsible for implementing & configuring the monitoring solutions, required for monitoring the previously mentioned tools and applications.
Responsibilities
- Build highly secure and scalable infrastructure/applications required for the R&D tool chain in Valeo
- Operate the previously mentioned tool chain applications (Infrastructure & Application Operations)
- Promote, document, and implement systems infrastructure best practices
- Assist in solving technical problems when they arise
- Ensure the implementation of agreed architecture and infrastructure
- Address technical concerns, ideas and suggestions
- Build the required monitoring for the Infra & Networking metrics, as well as the required functional monitoring (on Prometheus & Grafana) for the whole tool chain applications
- Act according to the previously mentioned alerts; to satisfy the committed operational excellence
- Finetune and configure the system by adding/removing new infra resources; to achieve the best cost efficiency corresponding to the provisioned Infrastructure
- Identify any weakness areas in the running CI pipeline and provide solutions
- Raise flag upon any production down incidents and take needed actions when applicable
- Approve new CI/CD deployments on production after verifying the level of standardization & Ensure CI/CD compliance with security policies
- Respond to issues through ticketing system and provide end to end resolution within defined SLA
- Report set of defined KPIs to assess the platform health and the service level
- Contribute effectively in the continuous improvement of his/her project, team, and work environment by submitting improvement proposals whenever possible.
- Participate effectively in all standard team meetings
- Show Can-Do attitude and provide needed support for his/her colleagues
Qualifications/Technical Skills Required
- Very Good knowledge in Linux Administration
- Very Good knowledge in AWS & Cloud Operations
- Very Good knowledge in Terraform
- Very Good knowledge in Docker
- Very Good knowledge in Jenkins (from the administrative perspective, not pipelines development)
- Preferred to have knowledge in Ansible, Windows Administration and K8s
- Preferred to have knowledge in any monitoring or log analysis tools (ex. Prometheus & Grafana, Splunk, ELK, Datadog, ......)
- Preferred to have any scripting knowledge
- English Language is a must
- Professional Experience of (3-4) years is required
- Mainly the candidates should be graduates of computer engineering or computer science of communication engineering