miykael
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 1 deletion b/‎.gitignore‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎_posts/2023-10-23-01_scikit_simple.md‎
Lines changed: 16 additions & 12 deletions b/‎_posts/2023-10-23-01_scikit_simple.md‎
Lines changed: 16 additions & 12 deletions
diff --git a/‎_posts/2023-10-23-02_tensorflow_simple.md‎
Lines changed: 23 additions & 23 deletions b/‎_posts/2023-10-23-02_tensorflow_simple.md‎
Lines changed: 23 additions & 23 deletions
diff --git a/‎_posts/2023-10-23-03_scikit_advanced.md‎
Lines changed: 13 additions & 18 deletions b/‎_posts/2023-10-23-03_scikit_advanced.md‎
Lines changed: 13 additions & 18 deletions
@@ -8,4 +8,6 @@
 .tweet-cache
 Gemfile.lock
 _site
-vendor
+vendor
+model_backup*
+history_log*
@@ -9,26 +9,30 @@ This post is part of a comprehensive machine learning series that takes you from
 
 1. **Getting Started with Machine Learning** (Current Post)
    Basic classification using Scikit-learn with the MNIST dataset
+   ([View code]({{ site.baseurl }}/scripts/01_scikit_simple.py))
 
 2. **Deep Learning Fundamentals**
    Introduction to neural networks using TensorFlow
+   ([View code]({{ site.baseurl }}/scripts/02_tensorflow_simple.py))
 
 3. **Advanced Machine Learning**
    Complex regression pipelines with Scikit-learn
+   ([View code]({{ site.baseurl }}/scripts/03_scikit_advanced.py))
 
 4. **Advanced Deep Learning**
    Sophisticated neural network architectures in TensorFlow
+   ([View code]({{ site.baseurl }}/scripts/04_tensorflow_advanced.py))
 
 Each tutorial builds upon concepts from previous posts while introducing new techniques and best practices. Whether you're new to machine learning or looking to expand your skills, this series provides hands-on experience with real-world datasets and modern ML tools.
 
 Have you ever wondered how to get started with machine learning? This series of posts will guide you through practical implementations using two of Python's most popular frameworks: Scikit-learn and TensorFlow. Whether you're a beginner looking to understand the basics or an experienced developer wanting to refresh your knowledge, we'll progress from basic classification tasks to more advanced regression problems.
 
 The series consists of four parts:
 
-1. **[Getting Started with Classification using Scikit-learn](../blog/2023/01_scikit_simple)** (You are here)<br>Introduction to machine learning basics using the MNIST dataset
-2. **[Basic Neural Networks with TensorFlow](../blog/2023/02_tensorflow_simple)** (Part 2)<br>Building your first neural network for image classification
-3. **[Advanced Machine Learning with Scikit-learn](../blog/2023/03_scikit_advanced)** (Part 3)<br>Exploring complex regression problems and model optimization
-4. **[Advanced Neural Networks with TensorFlow](../blog/2023/04_tensorflow_advanced)** (Part 4)<br>Implementing sophisticated neural network architectures
+1. **[Getting Started with Classification using Scikit-learn]({{ site.baseurl }}/blog/2023/01_scikit_simple)** (You are here)<br>Introduction to machine learning basics using the MNIST dataset
+2. **[Basic Neural Networks with TensorFlow]({{ site.baseurl }}/blog/2023/02_tensorflow_simple)** (Part 2)<br>Building your first neural network for image classification
+3. **[Advanced Machine Learning with Scikit-learn]({{ site.baseurl }}/blog/2023/03_scikit_advanced)** (Part 3)<br>Exploring complex regression problems and model optimization
+4. **[Advanced Neural Networks with TensorFlow]({{ site.baseurl }}/blog/2023/04_tensorflow_advanced)** (Part 4)<br>Implementing sophisticated neural network architectures
 
 ### Why These Tools?
 
@@ -89,7 +93,7 @@ for ax, image, label in zip(axes.ravel(), digits.images, digits.target):
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_digits_sample.png" data-zoomable width=600px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_digits_sample.png" data-zoomable width=600px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
     <div class="caption">
         Figure 1: Sample of MNIST digits showing different handwritten numbers from 0-9. Each image is an 8x8 pixel grayscale representation.
     </div>
@@ -228,7 +232,7 @@ plt.close()
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_rf_heatmap.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_rf_heatmap.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
     <div class="caption">
         Figure 2: Heatmap showing model accuracy (%) for different combinations of SVM hyperparameters gamma and C. Darker colors indicate better performance.
     </div>
@@ -316,7 +320,7 @@ plt.close()
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_svm_heatmap.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_svm_heatmap.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
     <div class="caption">
         Figure 3: Confusion matrix showing the model's prediction performance across all digit classes. Diagonal elements represent correct predictions.
     </div>
@@ -375,7 +379,7 @@ plt.close()
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_confusion_matrix.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_confusion_matrix.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
     <div class="caption">
         Figure 4: Feature importance heatmap showing which pixels in the 8x8 grid contribute most to the Random Forest's classification decisions.
     </div>
@@ -405,7 +409,7 @@ plt.close()
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_feature_importance.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_feature_importance.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
     <div class="caption">
         Figure 5: Most confidently predicted digits from the test set, showing examples where the model has highest prediction probabilities.
     </div>
@@ -432,7 +436,7 @@ plt.savefig('../assets/ex_plots/01_scikit_confident_predictions.png', bbox_inche
 plt.close()
 ```
 
-<img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_confident_predictions.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_confident_predictions.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
 <div class="caption">
     Figure 6: Most confidently predicted digits from the test set, showing examples where the model has highest prediction probabilities.
 </div>
@@ -449,7 +453,7 @@ plt.savefig('../assets/ex_plots/01_scikit_uncertain_predictions.png', bbox_inche
 plt.close()
 ```
 
-<img class="img-fluid rounded z-depth-1" src="../assets/ex_plots/01_scikit_uncertain_predictions.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/01_scikit_uncertain_predictions.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
 <div class="caption">
     Figure 7: Most challenging digits for the model to predict, showing examples where the model has lowest prediction confidence.
 </div>
@@ -580,4 +584,4 @@ In the next post, we'll tackle the same MNIST classification problem using Tenso
 
 In Part 2, we'll explore how neural networks approach the same problem using TensorFlow, introducing deep learning concepts and comparing the two approaches.
 
-[Continue to Part 2 →](../blog/2023/02_tensorflow_simple)
+[Continue to Part 2 →]({{ site.baseurl }}/blog/2023/02_tensorflow_simple)
@@ -8,6 +8,8 @@ description: Building your first neural network for image classification
 
 In this second part of our machine learning series, we'll implement the same MNIST classification task using [TensorFlow](https://www.tensorflow.org/). While Scikit-learn excels at classical machine learning, TensorFlow shines when building neural networks. We'll see how deep learning approaches differ from traditional methods and learn the basic concepts of neural network architecture.
 
+The complete code for this tutorial can be found in the [02_tensorflow_simple.py]({{ site.baseurl }}/scripts/02_tensorflow_simple.py) script.
+
 ### Why Neural Networks?
 
 While our Scikit-learn models performed well in Part 1, neural networks offer several key advantages for image classification:
@@ -155,29 +157,27 @@ and the number of trainable and non-trainable parameters.
 model.summary()
 ```
 
-    Model: "sequential_1"
+    Model: "sequential"
     _________________________________________________________________
-     Layer (type)                Output Shape              Param #
+    Layer (type)                    Output Shape              Param #
     =================================================================
-     conv2d_2 (Conv2D)           (None, 26, 26, 32)        320
-     re_lu (ReLU)                (None, 26, 26, 32)        0
-     max_pooling2d_2 (MaxPooling  (None, 13, 13, 32)       0
-     2D)
-     conv2d_3 (Conv2D)           (None, 11, 11, 64)        18496
-     re_lu_1 (ReLU)              (None, 11, 11, 64)        0
-     max_pooling2d_3 (MaxPooling  (None, 5, 5, 64)         0
-     2D)
-     flatten_1 (Flatten)         (None, 1600)              0
-     dropout_2 (Dropout)         (None, 1600)              0
-     dense_2 (Dense)             (None, 32)                51232
-     re_lu_2 (ReLU)              (None, 32)                0
-     dropout_3 (Dropout)         (None, 32)                0
-     dense_3 (Dense)             (None, 10)                330
-     softmax (Softmax)           (None, 10)                0
+    conv2d (Conv2D)                 (None, 26, 26, 32)        320
+    re_lu (ReLU)                    (None, 26, 26, 32)        0
+    max_pooling2d (MaxPooling2D)    (None, 13, 13, 32)        0
+    conv2d_1 (Conv2D)               (None, 11, 11, 64)        18496
+    re_lu_1 (ReLU)                  (None, 11, 11, 64)        0
+    max_pooling2d_1 (MaxPooling2D)  (None, 5, 5, 64)          0
+    flatten (Flatten)               (None, 1600)              0
+    dropout (Dropout)               (None, 1600)              0
+    dense (Dense)                   (None, 32)                51232
+    re_lu_2 (ReLU)                  (None, 32)                0
+    dropout_1 (Dropout)             (None, 32)                0
+    dense_1 (Dense)                 (None, 10)                330
+    softmax (Softmax)               (None, 10)                0
     =================================================================
-    Total params: 70,378
-    Trainable params: 70,378
-    Non-trainable params: 0
+    Total params: 70,378 (274.91 KB)
+    Trainable params: 70,378 (274.91 KB)
+    Non-trainable params: 0 (0.00 Byte)
     _________________________________________________________________
 
 This summary tells us several important things:
@@ -286,7 +286,7 @@ axs[1].set_ylabel("Accuracy")
 plt.show()
 ```
 
-<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/ex_03_tensorflow_simple_output_16_0.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/02_tensorflow_training_history.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
 <div class="caption">
     Figure 1: Training metrics over time showing model loss (left) and Mean Absolute Error (right) for both training and validation sets. The logarithmic scale helps visualize improvement across different magnitudes.
 </div>
@@ -330,7 +330,7 @@ plt.show()
 ```
 
 <div style="text-align: center">
-    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/ex_03_tensorflow_simple_output_22_0.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+    <img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/02_tensorflow_confusion_matrix.png" data-zoomable width=500px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
 </div><br>
 
 ## 5. Model parameters
@@ -357,7 +357,7 @@ plt.tight_layout()
 plt.show()
 ```
 
-<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/ex_03_tensorflow_simple_output_24_0.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
+<img class="img-fluid rounded z-depth-1" src="{{ site.baseurl }}/assets/ex_plots/02_tensorflow_conv_kernels.png" data-zoomable width=800px style="padding-top: 20px; padding-right: 20px; padding-bottom: 20px; padding-left: 20px">
 
 ### Common Deep Learning Pitfalls
 When starting with TensorFlow and neural networks, watch out for these common issues:
 
@@ -8,6 +8,8 @@ description: Exploring complex regression problems and preprocessing pipelines
 
 In this third part of our series, we'll explore more sophisticated machine learning techniques using [Scikit-learn](https://scikit-learn.org/stable/). While Parts 1 and 2 focused on classification, we'll now tackle regression problems and learn how to build complex preprocessing pipelines. We'll use the California Housing dataset to demonstrate these concepts.
 
+The complete code for this tutorial can be found in the [03_scikit_advanced.py]({{ site.baseurl }}/scripts/03_scikit_advanced.py) script.
+
 **Note**: The purpose of this post is to highlight the flexibility and capabilities of scikit-learn's advanced features. Therefore, this tutorial focuses on introducing you to those advanced routines rather than creating the optimal regression model.
 
 ### Why Advanced Preprocessing?
@@ -437,16 +439,13 @@ import pandas as pd
 
 df_res = pd.DataFrame(res.cv_results_)
 df_res = df_res.iloc[:, ~df_res.columns.str.contains('time|split[0-9]*|rank|params')]
-new_columns = [
-    c.split('param_regressor__')[1] if 'param_regressor' in c else c
-    for c in df_res.columns
-]
-new_columns = [
-    c.split('preprocessor__')[1] if 'preprocessor__' in c else c for c in new_columns
-]
+new_columns = [c.split('param_regressor__')[1] if 'param_regressor' in c else c for c in df_res.columns]
+new_columns = [c.split('preprocessor__')[1] if 'preprocessor__' in c else c for c in new_columns]
 df_res.columns = new_columns
 df_res = df_res.sort_values('mean_test_score', ascending=False)
-df_res.head(10)
+
+print("\nTop 10 parameter combinations:")
+print(df_res.head(10))
 ```
 
 | :-------------: | :------------------: | :-----------------------------------------: | :-----------------------------: | :-------------------------------------: | :-----------------------------------------: | :--------------------------------: | :-------------------------------------------: | :-------------------------------------: | :----------------------------------------: | :----------------: | :---------------: | :-----------------: | :----------------: |
@@ -480,17 +479,13 @@ Prediction accuracy on test data:  {score_te*100:.2f}%"
 )
 ```
 
-    Prediction accuracy on train data: 7.10%
-    Prediction accuracy on test data:  8.38%
-
-# Add this interpretation
 Let's interpret these regression metrics in practical terms:
-- **Train Error (7.10%)**: On average, predictions deviate by 7.10% from true house prices
-  - For a $300,000 house, this means predictions are typically within ±$21,300
-- **Test Error (8.38%)**: Slightly higher error on unseen data
-  - For a $300,000 house, predictions are typically within ±$25,140
-- **Error Difference (1.28%)**: Small gap indicates good generalization
-- **Context**: For house price prediction, ~8% error is relatively good considering market volatility
+- **Train Error**: On average, predictions deviate by about 7-8% from true house prices
+  - For a $300,000 house, this means predictions are typically within ±$21,000-24,000
+- **Test Error**: Slightly higher error on unseen data
+  - For a $300,000 house, predictions are typically within ±$24,000-27,000
+- **Error Difference**: Small gap indicates good generalization
+- **Context**: For house price prediction, ~8-9% error is relatively good considering market volatility
 
 Great, the score seems reasonably good! But now that we know better which preprocessing routine seems to be the
 best (thanks to `RandomizedSearchCV`), let's go ahead and further fine-tune the ridge model.