Safaricom is looking for an Engineer - Infrastructure SRE to ensure the reliability, availability, and performance of infrastructure platforms. This role involves implementing Site Reliability Engineering (SRE) principles to manage production infrastructure through automation and code. You will design, build, and operate hybrid cloud environments while maintaining strict adherence to company safety and integrity policies.
Responsibilities
- Uphold the company code of conduct, policies, and procedures, ensuring integrity and accountability.
- Adhere to safety, health, and wellbeing policies, guidelines, and procedures in all actions and decisions.
- Implement and support SLIs, SLOs, and error budgets for assigned platforms and services.
- Monitor platform health, availability, latency, and error rates; participate in on-call rotations, incident response, and major incident recovery.
- Design and implement end-to-end infrastructure automation across on-premise data centers, private cloud, and public cloud environments.
- Build and maintain Infrastructure as Code (IaC) using Terraform, Ansible, and Helm. Automate infrastructure provisioning, scaling, patching, recovery, and decommissioning.
- Develop scripts and tooling (Bash, Python, PowerShell) to reduce manual operational tasks and contribute to self-healing and auto-remediation workflows.
- Engineer and operate on-premise infrastructure including virtualization, compute, storage, backup, and network platforms.
- Engineer and operate hybrid cloud environments, ensuring seamless integration between data centers and public cloud platforms.
- Engineer and operate infrastructure across AWS, Azure, GCP, and OCI under defined enterprise standards.
- Support Kubernetes platforms (EKS/AKS/GKE/OpenShift), including upgrades, scaling, and reliability tuning.
- Support DevSecOps practices by integrating security checks into pipelines.
- Assist with DR/BCP testing, backup validation, and recovery procedures.
Qualifications and Requirements
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related technical field.
- Proven hands-on experience supporting production infrastructure and cloud platforms.
- Strong automation mindset with a demonstrated reduction of manual operational tasks.
- Experience working within ITIL / DevOps / SRE operating models.
Preferred Certifications (Added Advantage)
- Cloud Associate or Professional certifications (AWS, Azure, or GCP).
- Kubernetes certifications (CKA / CKAD).
- Linux certifications (RHCSA / RHCE).
- DevOps or SRE-related certifications.
How to Apply
Interested and qualified candidates should apply online via the Safaricom recruitment portal on Oracle Cloud by clicking the following link: https://www.myjobmag.co.ke/apply-now/1175736. Ensure your application is submitted by the deadline on March 17, 2026.