Skip to content

Commit 756d93a

Browse files
committed
feat: new articles
1 parent f296de8 commit 756d93a

19 files changed

+438
-113
lines changed

_posts/2020-11-05-probability_theory_basics.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Statistics
55
classes: wide
66
date: '2020-11-05'
7-
excerpt: An introduction to probability theory concepts every data scientist should know.
7+
excerpt: An introduction to probability theory concepts every data scientist should
8+
know.
89
header:
910
image: /assets/images/data_science_10.jpg
1011
og_image: /assets/images/data_science_10.jpg
@@ -17,15 +18,17 @@ keywords:
1718
- Random variables
1819
- Distributions
1920
- Data science
20-
seo_description: Learn the core principles of probability theory, from random variables to common distributions, with practical examples for data science.
21-
seo_title: 'Probability Theory Basics for Data Science'
21+
seo_description: Learn the core principles of probability theory, from random variables
22+
to common distributions, with practical examples for data science.
23+
seo_title: Probability Theory Basics for Data Science
2224
seo_type: article
23-
summary: This post reviews essential probability concepts like random variables, expectation, and common distributions, illustrating how they underpin data science workflows.
25+
summary: This post reviews essential probability concepts like random variables, expectation,
26+
and common distributions, illustrating how they underpin data science workflows.
2427
tags:
2528
- Probability
2629
- Statistics
2730
- Data science
28-
title: 'Probability Theory Basics for Data Science'
31+
title: Probability Theory Basics for Data Science
2932
---
3033

3134
Probability theory provides the mathematical foundation for modeling uncertainty. By understanding random variables and probability distributions, data scientists can quantify risks and make informed decisions.

_posts/2020-11-10-simple_linear_regression_intro.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Data Science
55
classes: wide
66
date: '2020-11-10'
7-
excerpt: Understand how simple linear regression models the relationship between two variables using a single predictor.
7+
excerpt: Understand how simple linear regression models the relationship between two
8+
variables using a single predictor.
89
header:
910
image: /assets/images/data_science_11.jpg
1011
og_image: /assets/images/data_science_11.jpg
@@ -16,15 +17,17 @@ keywords:
1617
- Linear regression
1718
- Least squares
1819
- Data analysis
19-
seo_description: Discover the mechanics of simple linear regression and how to interpret slope and intercept when fitting a straight line to data.
20-
seo_title: 'A Primer on Simple Linear Regression'
20+
seo_description: Discover the mechanics of simple linear regression and how to interpret
21+
slope and intercept when fitting a straight line to data.
22+
seo_title: A Primer on Simple Linear Regression
2123
seo_type: article
22-
summary: This article introduces simple linear regression and the least squares method, showing how a single predictor explains variation in a response variable.
24+
summary: This article introduces simple linear regression and the least squares method,
25+
showing how a single predictor explains variation in a response variable.
2326
tags:
2427
- Regression
2528
- Statistics
2629
- Data science
27-
title: 'A Primer on Simple Linear Regression'
30+
title: A Primer on Simple Linear Regression
2831
---
2932

3033
Simple linear regression is a foundational technique for modeling the relationship between a predictor variable and a response variable. By fitting a straight line, we can quantify how changes in one variable are associated with changes in another.

_posts/2020-11-20-bayesian_inference_basics.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Statistics
55
classes: wide
66
date: '2020-11-20'
7-
excerpt: Explore the fundamentals of Bayesian inference and how prior beliefs combine with data to form posterior conclusions.
7+
excerpt: Explore the fundamentals of Bayesian inference and how prior beliefs combine
8+
with data to form posterior conclusions.
89
header:
910
image: /assets/images/data_science_12.jpg
1011
og_image: /assets/images/data_science_12.jpg
@@ -17,15 +18,17 @@ keywords:
1718
- Priors
1819
- Posterior distributions
1920
- Data science
20-
seo_description: An overview of Bayesian inference, demonstrating how to update prior beliefs with new evidence to make data-driven decisions.
21-
seo_title: 'Bayesian Inference Explained'
21+
seo_description: An overview of Bayesian inference, demonstrating how to update prior
22+
beliefs with new evidence to make data-driven decisions.
23+
seo_title: Bayesian Inference Explained
2224
seo_type: article
23-
summary: Learn how Bayesian inference updates prior beliefs into posterior distributions, providing a flexible framework for reasoning under uncertainty.
25+
summary: Learn how Bayesian inference updates prior beliefs into posterior distributions,
26+
providing a flexible framework for reasoning under uncertainty.
2427
tags:
2528
- Bayesian
2629
- Inference
2730
- Statistics
28-
title: 'Bayesian Inference Explained'
31+
title: Bayesian Inference Explained
2932
---
3033

3134
Bayesian inference offers a powerful perspective on probability, treating unknown quantities as distributions that update when new evidence appears.

_posts/2020-11-25-hypothesis_testing_real_world_applications.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Statistics
55
classes: wide
66
date: '2020-11-25'
7-
excerpt: See how hypothesis testing helps draw meaningful conclusions from data in practical scenarios.
7+
excerpt: See how hypothesis testing helps draw meaningful conclusions from data in
8+
practical scenarios.
89
header:
910
image: /assets/images/data_science_13.jpg
1011
og_image: /assets/images/data_science_13.jpg
@@ -17,15 +18,18 @@ keywords:
1718
- P-values
1819
- Significance
1920
- Data science
20-
seo_description: Learn how to apply hypothesis tests in real-world analyses and avoid common pitfalls when interpreting p-values and confidence levels.
21-
seo_title: 'Applying Hypothesis Testing in the Real World'
21+
seo_description: Learn how to apply hypothesis tests in real-world analyses and avoid
22+
common pitfalls when interpreting p-values and confidence levels.
23+
seo_title: Applying Hypothesis Testing in the Real World
2224
seo_type: article
23-
summary: This post walks through frequentist hypothesis testing, showing how to formulate null and alternative hypotheses and interpret the results in practical data science tasks.
25+
summary: This post walks through frequentist hypothesis testing, showing how to formulate
26+
null and alternative hypotheses and interpret the results in practical data science
27+
tasks.
2428
tags:
2529
- Hypothesis testing
2630
- Statistics
2731
- Experiments
28-
title: 'Applying Hypothesis Testing in the Real World'
32+
title: Applying Hypothesis Testing in the Real World
2933
---
3034

3135
Hypothesis testing allows data scientists to objectively assess whether an observed pattern is likely due to chance or reflects a genuine effect.

_posts/2020-11-30-data_visualization_best_practices.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Data Science
55
classes: wide
66
date: '2020-11-30'
7-
excerpt: Discover best practices for creating clear and compelling data visualizations that communicate insights effectively.
7+
excerpt: Discover best practices for creating clear and compelling data visualizations
8+
that communicate insights effectively.
89
header:
910
image: /assets/images/data_science_14.jpg
1011
og_image: /assets/images/data_science_14.jpg
@@ -17,15 +18,17 @@ keywords:
1718
- Charts
1819
- Communication
1920
- Best practices
20-
seo_description: Guidelines for selecting chart types, choosing colors, and avoiding clutter when visualizing data for stakeholders.
21-
seo_title: 'Data Visualization Best Practices'
21+
seo_description: Guidelines for selecting chart types, choosing colors, and avoiding
22+
clutter when visualizing data for stakeholders.
23+
seo_title: Data Visualization Best Practices
2224
seo_type: article
23-
summary: Learn how to design effective visualizations by focusing on clarity, appropriate chart selection, and thoughtful use of color and labels.
25+
summary: Learn how to design effective visualizations by focusing on clarity, appropriate
26+
chart selection, and thoughtful use of color and labels.
2427
tags:
2528
- Visualization
2629
- Data science
2730
- Communication
28-
title: 'Data Visualization Best Practices'
31+
title: Data Visualization Best Practices
2932
---
3033

3134
Effective data visualization bridges the gap between complex datasets and human understanding. Following proven design principles ensures that your charts highlight the important messages without distractions.

_posts/2021-10-05-data_preprocessing_pipelines.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Data Science
55
classes: wide
66
date: '2021-10-05'
7-
excerpt: Learn how to design robust data preprocessing pipelines that prepare raw data for modeling.
7+
excerpt: Learn how to design robust data preprocessing pipelines that prepare raw
8+
data for modeling.
89
header:
910
image: /assets/images/data_science_6.jpg
1011
og_image: /assets/images/data_science_6.jpg
@@ -17,10 +18,12 @@ keywords:
1718
- Pipelines
1819
- Data cleaning
1920
- Feature engineering
20-
seo_description: Discover best practices for building reusable data preprocessing pipelines that handle missing values, encoding, and feature scaling.
21+
seo_description: Discover best practices for building reusable data preprocessing
22+
pipelines that handle missing values, encoding, and feature scaling.
2123
seo_title: Building Data Preprocessing Pipelines for Reliable Models
2224
seo_type: article
23-
summary: This post outlines the key steps in constructing data preprocessing pipelines using tools like scikit-learn to ensure consistent model inputs.
25+
summary: This post outlines the key steps in constructing data preprocessing pipelines
26+
using tools like scikit-learn to ensure consistent model inputs.
2427
tags:
2528
- Data preprocessing
2629
- Machine learning

_posts/2021-10-15-decision_tree_algorithms.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Machine Learning
55
classes: wide
66
date: '2021-10-15'
7-
excerpt: Understand how decision tree algorithms split data and how pruning improves generalization.
7+
excerpt: Understand how decision tree algorithms split data and how pruning improves
8+
generalization.
89
header:
910
image: /assets/images/data_science_7.jpg
1011
og_image: /assets/images/data_science_7.jpg
@@ -17,10 +18,12 @@ keywords:
1718
- Classification
1819
- Tree pruning
1920
- Machine learning
20-
seo_description: Learn the mechanics of decision tree algorithms, including entropy-based splits and pruning techniques that prevent overfitting.
21+
seo_description: Learn the mechanics of decision tree algorithms, including entropy-based
22+
splits and pruning techniques that prevent overfitting.
2123
seo_title: How Decision Trees Work and Why Pruning Matters
2224
seo_type: article
23-
summary: This article walks through the basics of decision tree construction and explains common pruning methods to create better models.
25+
summary: This article walks through the basics of decision tree construction and explains
26+
common pruning methods to create better models.
2427
tags:
2528
- Decision trees
2629
- Classification

_posts/2021-11-10-model_evaluation_metrics.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,14 +18,16 @@ keywords:
1818
- Precision
1919
- Recall
2020
- Regression metrics
21-
seo_description: A concise overview of essential metrics like precision, recall, F1-score, and RMSE for measuring model performance.
21+
seo_description: A concise overview of essential metrics like precision, recall, F1-score,
22+
and RMSE for measuring model performance.
2223
seo_title: Essential Metrics for Evaluating Machine Learning Models
2324
seo_type: article
24-
summary: Learn how to interpret common classification and regression metrics to choose the best model for your data.
25+
summary: Learn how to interpret common classification and regression metrics to choose
26+
the best model for your data.
2527
tags:
2628
- Accuracy
2729
- F1-score
28-
- RMSE
30+
- Rmse
2931
title: A Guide to Model Evaluation Metrics
3032
---
3133

_posts/2022-10-15-time_series_decomposition.md

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,8 @@ categories:
55
- Time Series
66
classes: wide
77
date: '2022-10-15'
8-
excerpt: Learn how time series decomposition reveals trend, seasonality, and residual components for clearer forecasting insights.
8+
excerpt: Learn how time series decomposition reveals trend, seasonality, and residual
9+
components for clearer forecasting insights.
910
header:
1011
image: /assets/images/data_science_12.jpg
1112
og_image: /assets/images/data_science_12.jpg
@@ -19,10 +20,12 @@ keywords:
1920
- Seasonality
2021
- Forecasting
2122
- Decomposition
22-
seo_description: Discover how to separate trend and seasonal patterns from a time series using additive or multiplicative decomposition.
23-
seo_title: 'Time Series Decomposition Made Simple'
23+
seo_description: Discover how to separate trend and seasonal patterns from a time
24+
series using additive or multiplicative decomposition.
25+
seo_title: Time Series Decomposition Made Simple
2426
seo_type: article
25-
summary: This article explains how decomposing a time series helps isolate long-term trends and recurring seasonal effects so you can model data more effectively.
27+
summary: This article explains how decomposing a time series helps isolate long-term
28+
trends and recurring seasonal effects so you can model data more effectively.
2629
tags:
2730
- Time series
2831
- Forecasting

_posts/2025-06-06-exploratory_data_analysis_intro.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ categories:
44
- Data Science
55
classes: wide
66
date: '2025-06-06'
7-
excerpt: Discover the essential steps of Exploratory Data Analysis (EDA) and how to gain insights from your data before building models.
7+
excerpt: Discover the essential steps of Exploratory Data Analysis (EDA) and how to
8+
gain insights from your data before building models.
89
header:
910
image: /assets/images/data_science_5.jpg
1011
og_image: /assets/images/data_science_5.jpg
@@ -18,16 +19,19 @@ keywords:
1819
- Python
1920
- Pandas
2021
- Data cleaning
21-
seo_description: Learn the fundamentals of Exploratory Data Analysis using Python, including data cleaning, visualization, and summary statistics.
22-
seo_title: "Beginner's Guide to Exploratory Data Analysis (EDA)"
22+
seo_description: Learn the fundamentals of Exploratory Data Analysis using Python,
23+
including data cleaning, visualization, and summary statistics.
24+
seo_title: Beginner's Guide to Exploratory Data Analysis (EDA)
2325
seo_type: article
24-
summary: This guide covers the core principles of Exploratory Data Analysis, demonstrating how to inspect, clean, and visualize datasets to uncover patterns and inform subsequent modeling steps.
26+
summary: This guide covers the core principles of Exploratory Data Analysis, demonstrating
27+
how to inspect, clean, and visualize datasets to uncover patterns and inform subsequent
28+
modeling steps.
2529
tags:
26-
- EDA
30+
- Eda
2731
- Data science
2832
- Python
2933
- Visualization
30-
title: "Exploratory Data Analysis: A Beginner's Guide"
34+
title: 'Exploratory Data Analysis: A Beginner''s Guide'
3135
---
3236

3337
Exploratory Data Analysis (EDA) is the process of examining a dataset to understand its main characteristics before applying more formal statistical modeling or machine learning. By exploring your data upfront, you can identify patterns, spot anomalies, and test assumptions that might otherwise go unnoticed.

0 commit comments

Comments
 (0)