| Project | Focus | Technical Highlights |
|---|---|---|
| Agent Gatsby | Local-first GenAI pipeline | Deterministic state machine, quote verification, bounded expansion loops, hallucination control, multilingual output |
| Labels On Tap | AI verification & compliance | Local-first ML compliance engine using OCR/NLP, deterministic matching, FastAPI, Docker, and human-in-the-loop review |
| LexiChess | LLM/agent benchmarking | Reproducible chess-agent evaluation with legal-move validation, SQLite storage, ratings, PGN exports, and live broadcast tooling |
| TopicMiner | Deterministic NLP classification | Classical NLP and ML toolkit for high-volume, auditable email classification and topic discovery |
| AdaptiveLASSO | Statistical computing | Python package implementing adaptive LASSO with a Statsmodels-style workflow, tests, documentation, and demo notebook |
| iSignify | Bioinformatics & private AI | Local/on-device AI interpretation with custom k-mer comparison for genomic signature discovery |
A local-first AI pipeline built around deterministic workflow control. The project demonstrates how GenAI systems can be constrained, verified, and structured to reduce hallucination risk while still producing useful multilingual output.
Key themes: GenAI orchestration, deterministic verification, quote validation, local-first AI, bounded generation loops.
A local-first AI verification and compliance system that combines deterministic string matching, OCR/NLP, and human-in-the-loop review. The project demonstrates how AI can assist regulatory triage while preserving auditability.
Key themes: compliance automation, OCR/NLP, FastAPI, Docker, deterministic matching, human review.
A chess tournament and evaluation platform for benchmarking LLMs, chess agents, and custom engines. The project emphasizes reproducibility, legal-move validation, rating updates, persistent storage, and live interaction.
Key themes: LLM evaluation, agent benchmarking, SQLite, FastAPI, PGN exports, live broadcast tooling.
A modular Python toolkit for deterministic, high-volume email classification using classical NLP and machine learning methods. It is designed as an auditable alternative to LLM-based classification for narrow-taxonomy tasks.
Key themes: LDA, Word2Vec, Doc2Vec, K-Means, Random Forest, XGBoost, deterministic NLP.
A statistical computing project implementing adaptive LASSO in Python with a clean workflow inspired by Statsmodels. The project shows applied statistical modeling, packaging, testing, documentation, and reproducibility.
Key themes: regression, shrinkage, statistical computing, Python packaging, reproducible modeling.
A bioinformatics and AI pipeline focused on local/on-device interpretation and custom k-mer comparison for identifying unique genomic signatures across large datasets.
Key themes: private AI, bioinformatics, k-mer algorithms, local inference, computational efficiency.
Contributing to production-grade Generative AI systems and building modular Python pipeline components for enterprise AI delivery.
Recent work includes object-oriented pipeline architecture, complex data ingestion, Pydantic validation, production Python development, unit testing, and peer-reviewed code delivery.
Led and supported projects involving LLM/RAG interfaces, email classification, audit automation, executive briefings, forecasting, and Python ETL systems.
This work combined statistical modeling, automation, technical leadership, and communication with senior federal stakeholders.
Built forecasting systems, workforce planning models, Python statistical tooling, and automated executive reporting workflows.
Work included time-series forecasting, statistical learning, internal Python tool development, and applied statistics training.
Developed computer vision models, real-time predictive modeling systems, API normalization workflows, and statistical process analysis tools.
Work included CNN image classification, sensor-data modeling, API-driven normalization, ANOVA/Tukey testing, SQL, and time-series forecasting.
Python · SQL · Linux · GitHub · GitLab · VS Code · JupyterLab
Pandas · NumPy · Scikit-Learn · Statsmodels · PyTorch · XGBoost · Random Forests · LDA · Word2Vec · Doc2Vec
LLMs · RAG · Agentic Workflows · FastAPI · Docker · Pydantic · SQLite · Local-First AI · Human-in-the-Loop Review
Time Series · SARIMAX · LOWESS · Regression · Classification · Multivariate Analysis · Forecasting · Shrinkage Methods
Unit Testing · Code Review · Modular Architecture · ETL Automation · Data Validation · Reproducible Workflows · Documentation
I am especially interested in AI and data systems that are:
- Useful in real operational settings
- Reviewable and explainable
- Reliable under messy real-world data
- Statistically defensible
- Testable and maintainable
- Production-ready rather than demo-only
My preferred approach is pragmatic: use GenAI where it adds value, use classical ML where it is stronger, and use deterministic validation wherever reliability matters.
Please use the contact and social links in my GitHub profile sidebar.
GitHub: github.com/AaronNHorvitz
LinkedIn: linkedin.com/in/aaron-horvitz-a666215
Resume: Download Resume
