Khaled Al-Hafnawi

Site Reliability Engineer specializing in cloud infrastructure, incident management, and building self-healing systems that guarantee 99%+ availability.

Areas of Expertise

Cloud Infrastructure

AWS, GCP, Terraform, VMware

CI/CD & Automation

GitHub Actions, Jenkins, Python/Bash

Container Orchestration

Docker, Kubernetes, Service Mesh

SRE & Observability

Grafana, SLI/SLO, Incident Management

Certifications

Google Cloud Kubernetes (Jan 2025)

Cluster management & distributed systems

GitHub Foundations (2025-2028)

Advanced CI/CD workflows