Hi, I'm Vivek Goel, a passionate Site Reliability Engineer with experience building scalable, resilient, and secure infrastructure in production. I specialize in:
- 🔧 Automating infrastructure using Terraform
- ☁️ Designing cloud-native solutions on Azure & AWS
- 🐧 Mastering Linux for performance and security
- 💾 Optimizing database reliability with MariaDB replication and deployment automation
- 🔍 Building observability with Prometheus, Grafana, and alerting pipelines
- 🏗️ Designed and implemented a secure, scalable Hub‑Spoke architecture with integrated disaster recovery capabilities, optimizing network traffic and ensuring business continuity for 10+ production applications using firewalls, private endpoints, and automated failover mechanisms.
- ⚙️ Automated infrastructure provisioning with Terraform by creating reusable modules to standardize deployments across multiple environments, reducing setup time by 50%.
- 🔐 Configured the Azure Virtual Network Gateway (VNG) to establish secure connectivity between on‑premises data centers and Azure networks.
- 📊 Deployed and scaled observability stacks—Prometheus, Grafana, and Loki—enabling 24/7 monitoring, real‑time alerting, and faster incident resolution across production workloads.
- 🚦 Configured and managed NGINX for high‑performance web traffic handling, optimizing load balancing and security, and built custom dashboards for real‑time analytics on 4xx/5xx status codes.
- 💾 Automated the setup of a multi‑master MariaDB Galera Cluster via SaltStack and implemented a robust failover strategy to ensure high availability and data consistency.
- 🤝 Collaborated with development teams to define and implement SLIs, SLOs, and SLA alerts, increasing service reliability and operational transparency.
- 🚨 Participated in on‑call rotations, performed incident response and root cause analysis, and guided restoration efforts for critical service‑impacting events.
- 🔍 SRE practices at scale — error budgets, SLIs, SLOs
- 🤖 AI for infrastructure monitoring and log analysis
- 📦 Kubernetes & service mesh (Istio)
- 🧩 I love chess, traveling, and cooking new dishes
- 🎮 I unwind with online games and Linux CLI experiments
- 🧭 My goal: Become a top-tier SRE and give back to the tech community
Want to collaborate or just say hi? Feel free to reach out:
📧 vkg78854@gmail.com
💬 DM on LinkedIn
"Systems break. What defines an SRE is what happens next."

