Specializing in Scalable AI Agents, Multi-Agent Systems, and Production MLOps.
I am a Senior Machine Learning Engineer focused on bridging the gap between cutting-edge research and scalable production systems. My expertise lies in designing robust Multi-Agent Architectures (BMAD), deploying Vision-Language-Action (VLA) models, and implementing enterprise-grade MLOps pipelines.
I specialize in transitioning prototypes to production:
- Methodology: Pioneering Spec-Driven Development for AI Agents and Multi-Agent Design Patterns.
- Inference Scale: Migrating from prototypes (Ollama) to high-throughput product serving (vLLM, Continuous Batching, PagedAttention).
- Infrastructure: Architecting resilient microservices with Spring AI, Spring Cloud Gateway, and Kubernetes.
| Domain | Advanced Technologies |
|---|---|
| Agentic AI | Spec-Driven Development, Multi-Agent Design Patterns, BMAD, LangGraph |
| Model Serving | vLLM (Production), Continuous Batching, PagedAttention, TensorRT-LLM |
| Vision & Action | VLA Models, SSD Architectures, Computer Vision Ops |
| Microservices | Spring AI, Spring Cloud Gateway, PGVector, Confluence API Integration |
| MLOps & Infra | Kubeflow, Terraform, Prometheus, Grafana, AIOps, Docker, Kubernetes |
| Core AI Stack | PyTorch, TensorFlow, ROS2, LangChain, Ollama (Prototyping) |
I actively engage with the AI community to stay at the bleeding edge of technology.
| Platform | Communities / Channels |
|---|---|
| r/MachineLearning, r/LocalLLaMA, r/DataEngineering, r/Robotics | |
| Medium | Towards Data Science, Towards AI, Analytics Vidhya |
| Foundations | Hugging Face Open Source, LangChain Community, EleutherAI |
| Project | Architectural Highlights | Tech Stack |
|---|---|---|
| Medical Fundus RAG | Explainable Medical AI. Eye disease diagnosis using RAG with BiomedCLIP & Qdrant. | BiomedCLIP, Qdrant, Gemini Pro, Gradio |
| Awesome Agent RAG LMMs Apps | Agentic RAG System. Designed to orchestrate complex retrieval workflows. | Python, LangChain, RAG, Agents |
| Ollama Chat App | Private LLM Deployment. Local inference optimization for privacy-first AI. | Python, Streamlit, Ollama, LocalLLM |
| ROS Based Package | Robotics Middleware. Modular ROS2 packages for autonomous navigation. | C++, ROS2, Gazebo, SLAM |
| ChatGPT Discord Bot | Scalable Microservice. Event-driven bot architecture. | Go, OpenAI API, async/await |
Let's build something amazing together! π¦Ύ
