Skip to content
View its-vivek-goel's full-sized avatar

Block or report its-vivek-goel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
its-vivek-goel/README.md

Vivek Kumar Goel 👋

Site Reliability Engineer | Cloud Infrastructure | Automation Enthusiast

LinkedIn Portfolio Email YouTube


Typing SVG


👨‍💻 About Me

Hi, I'm Vivek Goel, a passionate Site Reliability Engineer with experience building scalable, resilient, and secure infrastructure in production. I specialize in:

  • 🔧 Automating infrastructure using Terraform
  • ☁️ Designing cloud-native solutions on Azure & AWS
  • 🐧 Mastering Linux for performance and security
  • 💾 Optimizing database reliability with MariaDB replication and deployment automation
  • 🔍 Building observability with Prometheus, Grafana, and alerting pipelines

🛠️ Tech Toolbox

My Skills


🚧 Recent Highlights

  • 🏗️ Designed and implemented a secure, scalable Hub‑Spoke architecture with integrated disaster recovery capabilities, optimizing network traffic and ensuring business continuity for 10+ production applications using firewalls, private endpoints, and automated failover mechanisms.
  • ⚙️ Automated infrastructure provisioning with Terraform by creating reusable modules to standardize deployments across multiple environments, reducing setup time by 50%.
  • 🔐 Configured the Azure Virtual Network Gateway (VNG) to establish secure connectivity between on‑premises data centers and Azure networks.
  • 📊 Deployed and scaled observability stacks—Prometheus, Grafana, and Loki—enabling 24/7 monitoring, real‑time alerting, and faster incident resolution across production workloads.
  • 🚦 Configured and managed NGINX for high‑performance web traffic handling, optimizing load balancing and security, and built custom dashboards for real‑time analytics on 4xx/5xx status codes.
  • 💾 Automated the setup of a multi‑master MariaDB Galera Cluster via SaltStack and implemented a robust failover strategy to ensure high availability and data consistency.
  • 🤝 Collaborated with development teams to define and implement SLIs, SLOs, and SLA alerts, increasing service reliability and operational transparency.
  • 🚨 Participated in on‑call rotations, performed incident response and root cause analysis, and guided restoration efforts for critical service‑impacting events.

🧠 Currently Exploring

  • 🔍 SRE practices at scale — error budgets, SLIs, SLOs
  • 🤖 AI for infrastructure monitoring and log analysis
  • 📦 Kubernetes & service mesh (Istio)

✨ Fun Facts

  • 🧩 I love chess, traveling, and cooking new dishes
  • 🎮 I unwind with online games and Linux CLI experiments
  • 🧭 My goal: Become a top-tier SRE and give back to the tech community

📬 Reach Me

Want to collaborate or just say hi? Feel free to reach out:

📧 vkg78854@gmail.com
💬 DM on LinkedIn


📌 Let's Build Something That Never Fails

"Systems break. What defines an SRE is what happens next."

Popular repositories Loading

  1. its-vivek-goel.github.io its-vivek-goel.github.io Public

    CSS

  2. its-vivek-goel its-vivek-goel Public

  3. jarvis jarvis Public

    Python

  4. node-express-realworld-example-app node-express-realworld-example-app Public

    Forked from gothinkster/node-express-realworld-example-app

    JavaScript

  5. react-redux-realworld-example-app react-redux-realworld-example-app Public

    Forked from gothinkster/react-redux-realworld-example-app

    Exemplary real world application built with React + Redux

    JavaScript

  6. django-realworld-example-app django-realworld-example-app Public

    Forked from gothinkster/django-realworld-example-app

    Python