Skip to content
View manupanand's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report manupanand

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
manupanand/README.md

πŸ‘‹ Hi there, I'm Manu P Anand

🎯 DevOps / MLOps / AIOps Engineer with over 7 years of total technical experience (2+ years dedicated DevOps & Cloud). I transitioned from a Mechanical R&D and HPC background to DevOps and AI Infrastructure engineering, blending deep analytical skills with modern platform engineering practices.


πŸš€ About Me

  • πŸ€– Hands-on MLOps and LLM infrastructure engineer β€” fine-tuned open-source LLMs (Qwen, Gemma), performed NVFP4 and Gemma4 model quantization, and deployed production AI workloads on NVIDIA DGX Blackwell (GB10) GPU servers.
  • ☸️ Expert in Kubernetes platform engineering β€” self-managed on-premises clusters (kubeadm), EKS, GKE, DMZ cluster architecture, RBAC, network policies, and production incident resolution.
  • πŸ”§ Passionate about automating everything β€” from provisioning infrastructure to setting up monitoring, CI/CD pipelines, and ML model lifecycle management.
  • 🐧 Expert in Linux systems (RHEL & Debian) with experience automating complex batch processes, HPC server management, and GPU compute resource scheduling.
  • ☁️ Skilled across cloud platforms: AWS, GCP, and Azure β€” with hands-on Terraform IaC across all three.
  • πŸ” Designed robust CI/CD pipelines using Jenkins, GitLab CI, and GitHub Actions, reducing deployment time by 60%+ and enabling 100% automated rollouts.
  • πŸ“¦ Built and deployed applications using Docker, Kubernetes (self-managed and EKS), Helm, and ArgoCD GitOps.
  • πŸ” Advocate of DevSecOps β€” integrating Vault, Trivy, SonarQube, and Istio mTLS into delivery pipelines; authored security architecture documents for enterprise clients.
  • πŸ“ˆ Built production observability stacks (Prometheus, Grafana, ELK) with SLO/SLI dashboards β€” reducing MTTD by 40% and MTTR by 35%.
  • πŸš€ Forward deployed engineer β€” visited client sites to deliver infrastructure upgrades, lead technical discussions with client leadership, and architect on-premises AI platforms.
  • πŸ› οΈ Tools I love: Terraform, Ansible, Python, Go, Bash, ArgoCD, Helm, MLflow, DVC, vLLM, Ollama

🧰 Tech Stack

  • Languages: Python, Go, Bash, Shell, JavaScript, C
  • AI / MLOps: LLM Fine-tuning (Qwen, Gemma), NVFP4 Quantization, Gemma4, vLLM, Ollama, MLflow, DVC, ClickHouse, NVIDIA GPU Scheduling, DGX Blackwell (GB10)
  • Infra as Code: Terraform, Ansible, Bicep (learning), GitOps
  • Containers: Docker, Kubernetes (On-Prem kubeadm, EKS, GKE), Helm, containerd
  • CI/CD: Jenkins, GitHub Actions, GitLab CI, ArgoCD, Cloud Build, Spinnaker
  • Monitoring / SRE: Prometheus, Grafana, ELK Stack, New Relic, AWS CloudWatch, GCP Monitoring, SLO/SLI design
  • Security: HashiCorp Vault, Istio (mTLS), Trivy, SonarQube, AWS/GCP IAM, Kubernetes RBAC, firewalld
  • Networking: HAProxy, Nginx, Calico CNI, VPC design, DNS (Route 53), Load Balancing
  • Cloud Platforms: AWS, GCP, Azure, IBM Cloud, OCI, DigitalOcean
  • Data / Streaming: Apache Kafka, MongoDB, PostgreSQL, Redis, BigQuery, ClickHouse
  • Build Tools: Maven, NPM, Uvicorn
  • Version Control: Git, GitHub, GitLab

πŸ€– MLOps / AIOps Stack

  • LLM Serving: vLLM, Ollama
  • LLM Fine-tuning: Qwen series, Gemma series (supervised fine-tuning for domain adaptation)
  • Quantization: NVFP4, Gemma4 quantization (VRAM optimization)
  • GPU Infra: NVIDIA DGX Blackwell (GB10), GPU Kubernetes node scheduling
  • ML Lifecycle: MLflow (experiment tracking), DVC (data versioning)
  • Metadata Store: ClickHouse
  • Orchestration: Kubernetes GPU workloads, resource limits and requests, node affinity
  • Frameworks: LangChain, LangGraph (learning)
  • Vision AI: NVIDIA DeepStream (hands-on learning), CCTV-based analytics platforms

"Automate what you can. Monitor what you can't. Improve what matters."


Tech-Stack

C Python Dart TypeScript Go Rust Node.js HTML CSS React Next.js Express MongoDB TailwindCSS React Native Docker Kubernetes PostgreSQL Prisma SQLAlchemy GORM Ansible AWS Linux RHEL Terraform Jenkins SonarQube JFrog Artifactory Argo CD Istio GitHub Actions HashiCorp Vault MLflow Prometheus Grafana Helm GitLab CI Ollama NVIDIA GraphQL LaTeX Cloudflare Vercel Bootstrap Django FastAPI Flask Flutter jQuery JWT Socket.io Nginx AmazonDynamoDB Neo4J SQLite Canva Figma Pandas NumPy Postman ElasticSearch Notion Jira LangChain LangGraph FOSS


Certifications

AWS Certified DevOps Engineer Professional AWS Certified Cloud Practitioner IBM DevOps Certified Google SRE


πŸ“« Let's Connect!

🌐 Portfolio Website

LinkedIn

Outlook Twitter

Feel free to connect with me on LinkedIn.

Fun Fact

I enjoy exploring new technologies and building cool projects in my free time.

Hobbies

Web development, Robotics, Aquarist.

πŸ“˜ My Resume

Resume

πŸ“Š GitHub Stats


πŸ“ˆ Contribution Graph

Activity Graph


πŸ† GitHub Trophies


Thanks for visiting my GitHub profile! 😊


Sponsor

Pinned Loading

  1. manupanand-freelance-developer/kubernetes-cluster-infra-aws manupanand-freelance-developer/kubernetes-cluster-infra-aws Public

    Self-hosted Kubernetes on AWS EC2 using Terraform, Ansible, and Vault with kubeadm for secure and automated cluster provisioning.

    HCL

  2. manupanand-freelance-developer/kubernetes-cluster-selfmanged manupanand-freelance-developer/kubernetes-cluster-selfmanged Public

    Self-hosted Kubernetes on AWS EC2 using Terraform, Ansible, and Vault with kubeadm for secure and automated cluster provisioning.

    HCL

  3. gcp-personal-infra/multi-cloud-kubernetes-infra-automation gcp-personal-infra/multi-cloud-kubernetes-infra-automation Public

    Infrastructure-as-Code project to provision and manage Kubernetes clusters across AWS (EKS),Azure (AKS),IBM Cloud,OCI and GCP (GKE) using Terraform, with Python-based automation scripts, containeri…

    Shell

  4. manupanand-freelance-developer/mlops-pipeline-prjt-20-2025 manupanand-freelance-developer/mlops-pipeline-prjt-20-2025 Public

    This repository provisions the foundational cloud infrastructure and deploys the core open-source MLOps tool stack on AWS Elastic Kubernetes Service (EKS) and EC2 instances.

    HCL

  5. manupanand-freelance-developer/seclm-log-threat-detection manupanand-freelance-developer/seclm-log-threat-detection Public

    Fine-tuned Qwen3-8B for cybersecurity log analysis and threat detection. Classifies security events, maps to MITRE ATT&CK, extracts IOCs, and recommends response actions. LoRA/QLoRA training on a s…

  6. llm-quantization-playbook llm-quantization-playbook Public

    Production-grade LLM quantization and deployment pipeline for NVIDIA GPU infrastructure. Covers FP8 (L4, H100) and NVFP4 (GB200) precision targets using TensorRT-LLM and NVIDIA ModelOpt.

    Python