Cloud Site Reliability Engineer (Cloud SRE)
2025-07-04T10:37:41+00:00
World Vision Kenya
https://cdn.greatkenyanjobs.com/jsjobsdata/data/employer/comp_8131/logo/wv.png
https://www.wvi.org/kenya
FULL_TIME
Nairobi
Nairobi
00100
Kenya
Nonprofit, and NGO
Computer & IT
2025-05-17T17:00:00+00:00
Kenya
8
JOB PURPOSE:
The Cloud Site Reliability Engineer (SRE) ensures the availability, latency, performance, and scalability of cloud services. This role combines software engineering and operations expertise to create reliable systems and improve incident response and observability.
KEY RESPONISBILITIES:
- Preference will be given to candidates with experience in Terraform and Azure DevOps. Familiarity with GitHub Actions, Ansible, or scripting tools such as PowerShell or Python is highly desirable.
- Security-first mindset is essential in all aspects of cloud infrastructure and operations.
- Build and maintain automation for deployment, monitoring, and self-healing infrastructure.
- Design service-level objectives (SLOs), indicators (SLIs), and monitoring dashboards.
- Respond to incidents and participate in on-call rotations.
- Continuously improve system reliability through chaos engineering, testing, and automation.
- Collaborate with developers to design for operability and scalability.
- Provide support, guidance, and collaboration to field office IT staff across all regions to ensure consistent service delivery and alignment with global standards.
- Work collaboratively within Agile teams, embracing iterative delivery, continuous improvement, and adaptive planning as part of the organization's new ways of working.
KNOWLEDGE/QUALIFICATIONS FOR THE ROLE:
- Bachelor’s degree in Computer Science, Information Systems, or a related field.
- Demonstrated proficiency in written and verbal communication in English.
- Knowledge of cloud observability tools and incident management practices.
- Strong scripting or programming skills (e.g., Python, Bash, Go).
- Problem-solving with an engineering mindset and focus on operational excellence.
- Customer-centric—ensuring systems deliver consistent, high-quality experiences globally.
KEY RESPONSIBILITIES: Provide support, guidance, and collaboration to field office IT staff across all regions to ensure consistent service delivery and alignment with global standards. Manage and maintain cloud-based databases (e.g., Azure SQL, Amazon RDS, PostgreSQL, MySQL) across production and non-production environments. Implement and monitor backup, high availability, and disaster recovery strategies. Optimize database performance through indexing, tuning, and automation. Ensure database security and compliance with organizational and regulatory requirements. Collaborate with developers, architects, and infrastructure teams to support application performance and scalability. Implement Infrastructure as Code (IaC) practices for consistent and automated database deployments. Conduct proactive monitoring and capacity planning using ITOM tools. Work collaboratively within Agile teams, embracing iterative delivery, continuous improvement, and adaptive planning as part of the organization's new ways of working.
KNOWLEDGE/QUALIFICATIONS FOR THE ROLE: Bachelor’s degree in programming/systems, computer science, information systems, or mathematics, specialized training, certification or equivalent work experience. Has a broad level of understanding surrounding information systems and application and/or web standards. Preference will be given to candidates with experience in Terraform and Azure DevOps. Familiarity with GitHub Actions, Ansible, or scripting tools such as PowerShell or Python is highly desirable. Security-first mindset is essential in all aspects of cloud infrastructure and operations. Strong understanding of IT Operations Management (ITOM) and database observability. Experience with public cloud platforms (Azure, AWS) and cloud-native database services. Problem-solving skills with a value-driven mindset, focusing on service optimization and reliability. Customer-centric approach—understanding how database performance impacts end-user experience and business outcomes. Strong scripting and automation skills (e.g., PowerShell, Terraform, Python) for DevOps integration.
KNOWLEDGE/QUALIFICATIONS FOR THE ROLE: Bachelor’s degree in programming/systems, computer science, information systems, or mathematics, specialized training, certification or equivalent work experience. Has a broad level of understanding surrounding information systems and application and/or web standards. Preference will be given to candidates with experience in Terraform and Azure DevOps. Familiarity with GitHub Actions, Ansible, or scripting tools such as PowerShell or Python is highly desirable. Security-first mindset is essential in all aspects of cloud infrastructure and operations. Strong understanding of IT Operations Management (ITOM) and database observability. Experience with public cloud platforms (Azure, AWS) and cloud-native database services. Problem-solving skills with a value-driven mindset, focusing on service optimization and reliability. Customer-centric approach—understanding how database performance impacts end-user experience and business outcomes. Strong scripting and automation skills (e.g., PowerShell, Terraform, Python) for DevOps integration.
No Requirements
JOB-6867aef551085
Vacancy title:
Cloud Site Reliability Engineer (Cloud SRE)
[Type: FULL_TIME, Industry: Nonprofit, and NGO, Category: Computer & IT]
Jobs at:
World Vision Kenya
Deadline of this Job:
Saturday, May 17 2025
Duty Station:
Nairobi | Nairobi | Kenya
Summary
Date Posted: Friday, July 4 2025, Base Salary: Not Disclosed
Similar Jobs in Kenya
Learn more about World Vision Kenya
World Vision Kenya jobs in Kenya
JOB DETAILS:
JOB PURPOSE:
The Cloud Site Reliability Engineer (SRE) ensures the availability, latency, performance, and scalability of cloud services. This role combines software engineering and operations expertise to create reliable systems and improve incident response and observability.
KEY RESPONISBILITIES:
- Preference will be given to candidates with experience in Terraform and Azure DevOps. Familiarity with GitHub Actions, Ansible, or scripting tools such as PowerShell or Python is highly desirable.
- Security-first mindset is essential in all aspects of cloud infrastructure and operations.
- Build and maintain automation for deployment, monitoring, and self-healing infrastructure.
- Design service-level objectives (SLOs), indicators (SLIs), and monitoring dashboards.
- Respond to incidents and participate in on-call rotations.
- Continuously improve system reliability through chaos engineering, testing, and automation.
- Collaborate with developers to design for operability and scalability.
- Provide support, guidance, and collaboration to field office IT staff across all regions to ensure consistent service delivery and alignment with global standards.
- Work collaboratively within Agile teams, embracing iterative delivery, continuous improvement, and adaptive planning as part of the organization's new ways of working.
KNOWLEDGE/QUALIFICATIONS FOR THE ROLE:
- Bachelor’s degree in Computer Science, Information Systems, or a related field.
- Demonstrated proficiency in written and verbal communication in English.
- Knowledge of cloud observability tools and incident management practices.
- Strong scripting or programming skills (e.g., Python, Bash, Go).
- Problem-solving with an engineering mindset and focus on operational excellence.
- Customer-centric—ensuring systems deliver consistent, high-quality experiences globally.
Work Hours: 8
Experience: No Requirements
Level of Education: bachelor degree
Job application procedure
Interested and qualified? Click here to apply
All Jobs | QUICK ALERT SUBSCRIPTION