Skip to content
View dimitris-markopoulos's full-sized avatar
🧩
Focusing
🧩
Focusing

Block or report dimitris-markopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Profile views Python VSCode

Dimitris Markopoulos's Github

Machine Learning • Quantitative Finance

🎓 M.A. in Statistics at Columbia University (4.25/4.33)
🎓 B.S. in Applied Mathematics & Statistics from Stony Brook University (3.98/4.0)


Project Summary
latent-semantic-clustering UMAP + EM-GMM clustering of book chapters via NLP frequency vectors
mnist-image-classification Comparing Lasso, Naive Bayes, Ridge, SVM, and Group Lasso
quantitative-finance BSM & Heston option pricing, Monte Carlo simulations, VaR, Algo Trading, etc
sepsis-prediction Applied ML pipeline to CUMC + NYP secure patient-level dataset; HIPAA-compliant experiments using Azure Secure Environment; certified.
crime-predictor-analysis Predicting crime using UCI community features; LASSO, Ridge, Elastic Net, kernel regression + manually implemented CV

Supervised Learning & Statistical Modeling: LASSO, Ridge, Elastic Net, Logistic Regression, LDA, ARIMA, Group Lasso, etc
Dimensionality Reduction & Feature Analysis: PCA, UMAP, t-SNE, Spectral Embedding, MDS, NMF, Kernel PCA
Unsupervised Learning & Clustering: KMeans++, Gaussian Mixture Models (GMM), Spectral Clustering, Hierarchical Clustering

Languages: Python (primary), SQL, R, MATLAB
Libraries: PyTorch, TensorFlow, scikit-learn, XGBoost, Numpy, Pandas, Statsmodels
Visualization: Matplotlib, Seaborn, Streamlit
Workflow: Workflow: VSCode + Jupyter Notebooks, Git/GitHub, Google Colab (for GPU compute), LaTeX
Infra: Azure, APIs, GitHub Actions

🌐 Connect With Me
LinkedIn
GitHub
[email protected]

GitHub followers GitHub Repo stars GitHub Repo forks


"Averaged over all possible data-generating distributions, every classification algorithm has the same error rate."
— David H. Wolpert, No Free Lunch Theorems for Optimization

Pinned Loading

  1. latent-semantic-clustering latent-semantic-clustering Public

    Clustering book chapters with unsupervised ML—custom EM-GMM, sklearn baselines, and dimensionality reduction.

    Jupyter Notebook 1

  2. quantitative-finance quantitative-finance Public

    A collection of quantitative finance projects covering option pricing, risk analysis, volatility modeling, and investment strategies. Includes Monte Carlo simulations, Black-Scholes & Heston models…

    Jupyter Notebook 1

  3. mnist-image-classification mnist-image-classification Public

    Classifying MNIST digits (3, 5, 8) using supervised learning methods including Logistic Regression, LDA, SVM, Naive Bayes, and Group LASSO. Includes model comparison, confusion matrix visualization…

    Jupyter Notebook

  4. crime-predictor-analysis crime-predictor-analysis Public

    Predicting violent crime rates using high-dimensional community data from the UCI dataset. Implements a structured machine learning pipeline with extensive preprocessing, multiple feature selection…

    Jupyter Notebook