Skip to content
View ABL4Z3's full-sized avatar

Highlights

  • Pro

Block or report ABL4Z3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ABL4Z3/README.md

Hi there, I'm Aayush Kumar πŸ‘‹

AI/ML Engineer β€’ LLM Systems β€’ Efficient AI Infrastructure

I am an AI/ML engineer and researcher focused on building efficient, scalable, and practical AI systems. My interests lie at the intersection of Large Language Models (LLMs), transformer optimization, retrieval systems, and real-world AI infrastructure.

Recently, I have been exploring how modern AI systems can become more compute-efficient and accessible under constrained hardware environments. My research project, NeuroCache, focuses on memory-efficient transformer training through budget-constrained activation offloading and system-level optimization techniques.

Beyond research, I build applied AI systems involving:

  • Retrieval-Augmented Generation (RAG)
  • LangChain & Agentic Workflows
  • AI Automation Pipelines
  • Speech & Multimodal AI Systems
  • Scalable FastAPI-based AI backends

Portfolio web:- Link


πŸ›  Technical Stack

AI / Machine Learning

  • PyTorch
  • TensorFlow
  • Scikit-learn
  • Hugging Face Transformers
  • LangChain
  • LangGraph
  • RAG Pipelines

Systems & Backend

  • FastAPI
  • Docker
  • REST APIs
  • Async Python
  • MongoDB
  • Vector Databases

Research Interests

  • Efficient Transformer Training
  • LLM Optimization
  • Memory-Aware AI Systems
  • Quantization & PEFT
  • Scalable AI Infrastructure
  • Multimodal AI

πŸ“‚ Selected Projects

🧠 NeuroCache

Research project focused on memory-efficient transformer training under low-VRAM GPU environments using controlled activation offloading strategies.

Research Paper:- DOI: 10.13140/RG.2.2.11793.39526

πŸ€– Trading Swarm

Production-oriented AI trading infrastructure with modular strategy execution, risk-aware automation, and Railway deployment support.

πŸ’Ό Interview AI

LLM-powered interview platform capable of question generation, response evaluation, and structured AI-driven feedback workflows.

Live:- https://interview-ai-frontend-e5o26ydjv-aayush-kumars-projects-1ae66488.vercel.app/

πŸŽ™ Voice Emotion Assistant

Multilingual speech emotion detection and empathetic response system integrating speech processing and LLM workflows.


⚑ Current Focus

  • Efficient training systems for Large Language Models
  • Transformer optimization under constrained hardware
  • Research in scalable and practical AI infrastructure
  • Exploring next-generation AI system architectures

Pinned Loading

  1. SmartMLPipeline SmartMLPipeline Public