Senior Site Reliability Engineer job at Diamond Trust Bank
5 Days Ago
Linkedid Twitter Share on facebook
Senior Site Reliability Engineer
2025-11-19T07:18:55+00:00
Diamond Trust Bank
https://cdn.greatkenyanjobs.com/jsjobsdata/data/employer/comp_6560/logo/DTB.png
FULL_TIME
 
Nairobi
Nairobi
00100
Kenya
Banking
Computer & IT, Engineering, Management
KES
 
MONTH
2025-11-28T17:00:00+00:00
 
Kenya
8

Key Responsibilities:

  • Define and enforce SLOs and Error Budgets for mission-critical banking channels, ensuring compliance with CBK and business continuity directives.
  • Implement, maintain, and enhance observability stacks for traceability across inter-bank transactions and payment APIs.
  • Automate operational workflows, infrastructure provisioning, and recovery processes using Terraform, Crossplane, and ArgoCD.
  • Integrate anomaly detection insights with SIEM platforms (e.g., Sentinel) to support unified reliability-security monitoring.
  • Conduct chaos engineering and resilience testing to validate RTO/RPO and high-availability commitments.
  • Lead and document incident post-mortems, ensuring corrective actions inform continuous improvement and regulatory audit readiness.

Skills & Qualifications:

  • Bachelor’s degree in computer science, Engineering, or related field
  • 5+ years of experience managing large-scale production systems in cloud environments
  • Proven experience maintaining uptime and latency SLAs for digital banking or financial systems
  • Expert proficiency in observability tools (Dynatrace, Grafana, OTEL, Jaeger)
  • Familiarity with CBK ICT Risk Management Guidelines, Basel III Operational Risk Principles, and PCI DSS
  • Hands-on experience with Terraform, Crossplane, GitOps workflows, and automated deployment pipelines
  • Define and enforce SLOs and Error Budgets for mission-critical banking channels, ensuring compliance with CBK and business continuity directives.
  • Implement, maintain, and enhance observability stacks for traceability across inter-bank transactions and payment APIs.
  • Automate operational workflows, infrastructure provisioning, and recovery processes using Terraform, Crossplane, and ArgoCD.
  • Integrate anomaly detection insights with SIEM platforms (e.g., Sentinel) to support unified reliability-security monitoring.
  • Conduct chaos engineering and resilience testing to validate RTO/RPO and high-availability commitments.
  • Lead and document incident post-mortems, ensuring corrective actions inform continuous improvement and regulatory audit readiness.
  • Expert proficiency in observability tools (Dynatrace, Grafana, OTEL, Jaeger)
  • Familiarity with CBK ICT Risk Management Guidelines, Basel III Operational Risk Principles, and PCI DSS
  • Hands-on experience with Terraform, Crossplane, GitOps workflows, and automated deployment pipelines
  • Bachelor’s degree in computer science, Engineering, or related field
  • 5+ years of experience managing large-scale production systems in cloud environments
  • Proven experience maintaining uptime and latency SLAs for digital banking or financial systems
bachelor degree
60
JOB-691d6f5f94f26

Vacancy title:
Senior Site Reliability Engineer

[Type: FULL_TIME, Industry: Banking, Category: Computer & IT, Engineering, Management]

Jobs at:
Diamond Trust Bank

Deadline of this Job:
Friday, November 28 2025

Duty Station:
Nairobi | Nairobi | Kenya

Summary
Date Posted: Wednesday, November 19 2025, Base Salary: Not Disclosed

Similar Jobs in Kenya
Learn more about Diamond Trust Bank
Diamond Trust Bank jobs in Kenya

JOB DETAILS:

Key Responsibilities:

  • Define and enforce SLOs and Error Budgets for mission-critical banking channels, ensuring compliance with CBK and business continuity directives.
  • Implement, maintain, and enhance observability stacks for traceability across inter-bank transactions and payment APIs.
  • Automate operational workflows, infrastructure provisioning, and recovery processes using Terraform, Crossplane, and ArgoCD.
  • Integrate anomaly detection insights with SIEM platforms (e.g., Sentinel) to support unified reliability-security monitoring.
  • Conduct chaos engineering and resilience testing to validate RTO/RPO and high-availability commitments.
  • Lead and document incident post-mortems, ensuring corrective actions inform continuous improvement and regulatory audit readiness.

Skills & Qualifications:

  • Bachelor’s degree in computer science, Engineering, or related field
  • 5+ years of experience managing large-scale production systems in cloud environments
  • Proven experience maintaining uptime and latency SLAs for digital banking or financial systems
  • Expert proficiency in observability tools (Dynatrace, Grafana, OTEL, Jaeger)
  • Familiarity with CBK ICT Risk Management Guidelines, Basel III Operational Risk Principles, and PCI DSS
  • Hands-on experience with Terraform, Crossplane, GitOps workflows, and automated deployment pipelines

 

Work Hours: 8

Experience in Months: 60

Level of Education: bachelor degree

Job application procedure

Are You Interested? Click Here to Apply Now

 

All Jobs | QUICK ALERT SUBSCRIPTION

Job Info
Job Category: Engineering jobs in Kenya
Job Type: Full-time
Deadline of this Job: Friday, November 28 2025
Duty Station: Nairobi | Nairobi | Kenya
Posted: 19-11-2025
No of Jobs: 1
Start Publishing: 19-11-2025
Stop Publishing (Put date of 2030): 10-10-2076
Apply Now
Notification Board

Join a Focused Community on job search to uncover both advertised and non-advertised jobs that you may not be aware of. A jobs WhatsApp Group Community can ensure that you know the opportunities happening around you and a jobs Facebook Group Community provides an opportunity to discuss with employers who need to fill urgent position. Click the links to join. You can view previously sent Email Alerts here incase you missed them and Subscribe so that you never miss out.

Caution: Never Pay Money in a Recruitment Process.

Some smart scams can trick you into paying for Psychometric Tests.