Β
Site Reliability Engineer | SRE Team Lead | Cloud Native Specialist
- π Currently working as Site Reliability Engineer at Alyssum Global Services (Herzliya, Israel)
- π₯ Leading 24x7 SRE support team of 5-6 members across multi-national enterprise projects
- π Managing zero-downtime operations for:
- GigaSpaces (Israeli AI RAG Platform - Enterprise ChatGPT alternative)
- JITPS (French E-Commerce Platform - SFR-like application)
- π Expert in eBPF-based Observability, Prometheus, Grafana, and ClickHouse
- β‘ Achieving 99.9% uptime for mission-critical applications serving 1M+ daily users
- π§ Specialized in AWS, Kubernetes, Terraform, and DevSecOps
- Site Reliability Engineer - Alyssum Global Services (Feb 2025 - Present)
- DevOps Engineer - Karix Mobile Pvt. Ltd. (Jan 2023 - Aug 2024)
- AWS Certified Cloud Practitioner (Score: 755/1000)
- π¬ Ask me about SRE Practices, Kubernetes, AWS, Observability, and CI/CD
- π§ Email: sanskargupta966@gmail.com
- β‘ Fun fact: I am passionate about Fitness and building resilient systems
- π― 99.9% Uptime - Maintained across 20+ Kubernetes clusters
- π 60% MTTR Reduction - Through eBPF-based observability
- π₯ Leading 24x7 SRE Teams - Managing 5-6 engineers across global projects
- π Multi-National Projects - GigaSpaces (Israel) & JITPS (France)
- π° $250K+ Cost Savings - Via ML-enhanced alerting and resource optimization
- π 10M+ Metrics/Min - Processing with ClickHouse and Prometheus
- π€ 95% Prediction Accuracy - AI-powered failure prediction system


