Skip to content
View duongduc388222's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report duongduc388222

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duongduc388222/README.md

Typing SVG

😎 About me:

  • πŸŽ“ Sophomore at Grinnell College, pursuing B.A. in Computer Science & Mathematics (Concentration: Statistics)
  • πŸ“Š Striving for jobs in quantitative research, data engineering, and AI/ML
  • πŸ’» I was a Data Engineer and AI Engineer at Gtel Data Research Group in Summer 2025, and a NLP Intern at Data Glacier in Fall 2025.
  • 🌱 Learning diffusion models, reinforcement learning, and LLM finetuning
  • πŸ‘― Open to collaborating on quant research, machine learning/computer vision projects, and solving sudoku problems
  • πŸ’¬ Ask me about machine learning, deep learning architectures, or just life in general
  • πŸ“ Portfolio & Blog: https://ducduong-portfolio.vercel.app/
  • ⚑ Fun fact: I love dabbling in variants Sudoku, badminton, and soccer

πŸ“« Reach me at:


🎯 Hobbies & Interests

  • 🧩 Cracking the Cryptic is the best YouTube channel in the world.
  • ⚽ Born to play soccer but peaked at 🏸 badminton
  • πŸ“– Reading AI/ML research papers, quant finance literature, and manga

⚑ GitHub Stats


πŸ› οΈ Languages & Tools

Python

C++

C#

C

Java

Scala

R

SQL

JavaScript

HTML

CSS

MATLAB

Bash

Linux

Git

Docker

Kubernetes

AWS

GCP

Azure

Flask

React

Node.js

TensorFlow

PyTorch

Scikit-learn

Hugging Face

Pandas

NumPy

Airflow

BigQuery

Visual Studio Code

Google Colab

Jupyter Notebook



πŸš€ Featured Projects

πŸ—ΊοΈ Spatial and Demographic Effects on Theft Distribution in Los Angeles.

πŸ”— Sponsored by American Statistical Association (ASA) & CAUSE. | Dec. 2025

Analyzed theft patterns across Los Angeles using the 2020 LAPD dataset to understand how spatial and demographic factors affect theft distribution.

  • Applied nested logistic regression models with predictors such as population size, density, victim age, sex, and race.
  • Found population density to be the strongest negative predictor of theft, while demographic analysis showed older victims and women were slightly more likely to be targeted.
  • Highlighted racial differences in exposure to theft vs. violent crimes.
  • Work was recognized nationally, earning 1st Prize in the USPROC Introductory Statistics Class Project competition.

🎡 Predicting Tonal Languages

πŸ”— GitHub Repository | Aug 2024 – May 2025

A research project exploring whether machine learning models can distinguish tonal vs. non-tonal languages from multilingual audio samples.

  • Collected and processed 125 multilingual audio clips from 18 countries.
  • Designed spectral and pitch-based features that reduced raw noise by 30% and improved dataset balance.
  • Benchmarked 7 ML models (logistic regression, SVM, random forest, neural nets, etc.) with cross-validation, achieving 65% accuracy (20% over baseline).
  • Built reproducible pipelines in scikit-learn and PyTorch for comparative metrics (precision, recall, F1).
  • Proposed scalable data collection strategies for future interdisciplinary research in linguistics + machine learning.

πŸ“œ Certifications & Awards

🚧 Duc is still a work in progress... 🚧

Work in Progress Badge

Pinned Loading

  1. portfolio-introDUCtion portfolio-introDUCtion Public

    introDUCtion

    TypeScript

  2. predict-tonal-languages-machine-learning predict-tonal-languages-machine-learning Public

    Jupyter Notebook

  3. market-pulse market-pulse Public

    Python