Skip to content
View dylan-almonte's full-sized avatar

Block or report dylan-almonte

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dylan-almonte/README.md

πŸ‘‹ Hey, I'm Dylan Cabahug-Almonte

B.S. Electrical & Computer Engineering @ Carnegie Mellon University
Minor in Machine Learning | Class of 2026


⚑ About Me

πŸŽ“ I'm a systems and ML engineer passionate about building efficient, scalable, and intelligent infrastructure β€” from distributed protocols and real-time kernels to deep learning frameworks and generative models.

πŸ’‘ Currently, I’m a Research Assistant at CMU, where I:

  • 🧩 Develop LoRA-based fine-tuning pipelines for compact open-weight LLMs (LLaMA 3 8B)
  • 🧠 Design dynamic model routing systems using contextual bandit algorithms
  • πŸ“Š Benchmark fine-tuning efficiency across adapter architectures

πŸ’Ό Previously interned at:

  • Capital One (Summer 2025) – built scalable performance-testing infrastructure for 15+ microservices (10K+ TPS, 5 ms latency)
  • Ansys (Spring 2025) – optimized CMake builds and CI pipelines for mechanical simulation software
  • Efficio (Summer 2024) – designed cloud-cost optimization tools reducing deployment costs by 23%

πŸ› οΈ Tech Stack



πŸš€ Featured Projects

🧠 Mini Deep Learning Framework (Python + C++)

A PyTorch-like mini framework built from scratch featuring:

  • Reverse-mode autograd and dynamic computation graphs
  • Custom CUDA kernels for tensor ops β†’ 20Γ— faster than CPU
  • Modular API for model definition and training

🧩 Technologies: Python, C++, CUDA, NumPy, PyTorch Autograd internals

⛏️ Distributed Bitcoin Miner (Go)
  • Designed the Live Sequence Protocol (LSP) for reliable UDP communication
  • Implemented concurrency with goroutines + channels for scalable task distribution
  • Added fault-tolerant load balancing and miner reassignment logic

🧩 Technologies: Go, Networking, UDP, Concurrency, Distributed Systems

⏱️ Real-Time Operating System (C + ARM Assembly)
  • Built a real-time kernel supporting context switching, task scheduling, and mutexes
  • Implemented rate-monotonic scheduling (RMS) with schedulability checks
  • Designed thread control blocks (TCBs) for multi-threading management

🧩 Technologies: C, ARM Cortex-M, Embedded Systems, RTOS Design

πŸŒ€ Denoising Diffusion Probabilistic Model (PyTorch)
  • Implemented forward + reverse diffusion processes using cosine variance scheduling
  • Trained U-Net-based DDPM on Animal Faces-HQ for image generation
  • Evaluated using FrΓ©chet Inception Distance (FID) and visualized training convergence

🧩 Technologies: PyTorch, Diffusion Models, Generative AI, Deep Learning


πŸ“Š GitHub Stats

GitHub Streak

🎯 What I'm Working On

  • πŸ”¬ Research on efficient LoRA fine-tuning and adapter composition at CMU
  • 🧠 Expanding my custom DL framework with quantization + distributed training
  • ☁️ Experimenting with cloud-native LLM deployment pipelines using Docker + AWS

πŸ’¬ Let's Connect!

If you’re interested in ML systems, low-level optimization, or AI infrastructure,
feel free to reach out β€” I love collaborating and discussing new ideas!

πŸ“« [email protected]
πŸ’Ό linkedin.com/in/dcalmonte

Popular repositories Loading

  1. poker-ai poker-ai Public

    C++ 2

  2. hack112 hack112 Public

    hack 112

    Python

  3. 15112_Poker112 15112_Poker112 Public

    Python

  4. poker_ai poker_ai Public

    Forked from fedden/poker_ai

    πŸ€– An Open Source Texas Hold'em AI

    Python

  5. AWAP2024 AWAP2024 Public

    Python

  6. minidb minidb Public

    minidb