Skip to content
View Emart29's full-sized avatar
☺️
+2349018909603
☺️
+2349018909603

Block or report Emart29

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Emart29/README.md

Hey, I'm Emmanuel Nwanguma 👋

Machine Learning Engineer · Data Scientist · AI Systems Builder

I don't just build models — I build systems that make decisions.

LinkedIn Medium Email


🧭 What I'm About

Most ML projects die in notebooks. Mine don't. I sit at the intersection of machine learning engineering and product thinking — taking data from raw to reliable, whether that means a RAG pipeline answering questions over complex documents, a ML model predicting health risk, fraud, or churn, or an LLM evaluation framework catching reasoning failures before they reach users. I build things that actually work outside the lab.

"A model that can't be deployed is just a very expensive experiment."


🔬 Current Focus

  • 🏗️ Building production-grade ML pipelines with observability, evaluation, and monitoring baked in
  • 🧠 Exploring LLM reasoning diagnostics — understanding why models fail, not just when
  • 🔍 Developing RAG systems that go beyond naive retrieval — semantic chunking, hybrid search, real-time ingestion
  • 📊 Automating ML quality gates to catch prompt regressions before they ship

🚀 Featured Projects

End-to-end ML system · 88.5% accuracy · FastAPI + Streamlit + SHAP + MLflow + Docker

A production-ready healthcare ML system built the right way — not just a notebook, but a deployable service with explainability (SHAP), experiment tracking (MLflow), a REST API, an interactive dashboard, and full Docker support. Built for trust, not just performance.


Research-grade diagnostics for multi-step reasoning failures in LLMs

Most LLM evals tell you a model is wrong. This framework tells you where the reasoning broke down. Designed to surface failure modes in chain-of-thought reasoning — useful for anyone building reliable LLM-powered products.


Production RAG system · FastAPI + React + ChromaDB + LLM observability

A full-stack document Q&A system with integrated LLM observability and monitoring. Goes beyond basic retrieval with real observability into how the system answers — not just what it answers.


Automated prompt regression detection · CI/CD for LLMs

A production-grade evaluation pipeline that automatically catches quality regressions before prompts reach users. Think CI/CD, but for LLM behavior — because shipping a broken prompt is just as bad as shipping broken code.


🛠️ Tech Stack

Languages     │ Python · SQL
ML & AI       │ PyTorch · scikit-learn · Hugging Face · LLMs · RAG · AI Agents
MLOps         │ FastAPI · MLflow · Docker · GitHub Actions
Data & BI     │ Pandas · NumPy · Tableau · Amazon QuickSight
Cloud         │ AWS · GCP · BigQuery
Explainability│ SHAP

📈 GitHub Activity

GitHub Stats Top Languages


🤝 Let's Build Something

I'm open to collaboration on:

  • Production ML systems — from model to API to deployment
  • AI-powered data products — real-time insights, dashboards, automation
  • LLM evaluation & reliability — making AI systems you can actually trust
  • Real-world automation — turning workflows into intelligent pipelines

If you're building something ambitious and need an ML engineer who thinks beyond the notebook, let's talk.

📬 nwangumaemmanuel29@gmail.com


"Data is the input. Decisions are the output. Everything in between is engineering."

Pinned Loading

  1. phi4-finance-finetuning phi4-finance-finetuning Public

    Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA — +69% ROUGE-L over base model. Live demo on Hugging Face.

    Jupyter Notebook 2

  2. llm-quality-gate llm-quality-gate Public

    pytest for LLMs — automated quality gate that catches prompt regressions and model degradations before they reach production. CI/CD integration, multi-provider support, and a live monitoring dashbo…

    Python 1

  3. emartai/remembr emartai/remembr Public

    Persistent memory infrastructure for AI agents — semantic search, multi-tenant isolation, 8 framework adapters, Python + TypeScript SDKs.

    Python 1

  4. ecommerce-product-classifier ecommerce-product-classifier Public

    Production-ready NLP classifier: fine-tuned DistilBERT across 19 e-commerce categories with FastAPI serving, real-time drift detection via Evidently AI, and a React analytics dashboard. Fully conta…

    Jupyter Notebook 1

  5. ragwell ragwell Public

    A production-ready Retrieval-Augmented Generation (RAG) pipeline with multiple search strategies, semantic chunking, and real-time document processing.

    Python 1

  6. Heart-Disease-Prediction Heart-Disease-Prediction Public

    🫀 Production-ready ML system for heart disease risk assessment with 88.5% accuracy. Features FastAPI REST API, Streamlit dashboard, SHAP explainability, MLflow tracking, and Docker deployment. Demo…

    Jupyter Notebook 2