DiogoRibeiro7
diff --git a/‎_posts/2025-01-31-nonlinear_growth_models_macroeconomics.md‎
Lines changed: 1 addition & 1 deletion b/‎_posts/2025-01-31-nonlinear_growth_models_macroeconomics.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎_posts/2025-02-02-time_series_forecasting_sarima_seasonal_arima_explained.md‎
Lines changed: 11 additions & 11 deletions b/‎_posts/2025-02-02-time_series_forecasting_sarima_seasonal_arima_explained.md‎
Lines changed: 11 additions & 11 deletions
diff --git a/‎_posts/2025-05-01-agentbased_models_abm_macroeconomics_mathematical_perspective.md‎
Lines changed: 8 additions & 8 deletions b/‎_posts/2025-05-01-agentbased_models_abm_macroeconomics_mathematical_perspective.md‎
Lines changed: 8 additions & 8 deletions
diff --git a/‎_posts/2025-06-07-why_math_statistics_foundations_data_science.md‎
Lines changed: 51 additions & 13 deletions b/‎_posts/2025-06-07-why_math_statistics_foundations_data_science.md‎
Lines changed: 51 additions & 13 deletions
diff --git a/‎_posts/2025-06-08-data_visualization_tools.md‎
Lines changed: 0 additions & 49 deletions b/‎_posts/2025-06-08-data_visualization_tools.md‎
Lines changed: 0 additions & 49 deletions
@@ -96,7 +96,7 @@ $$
 \dot{A} = \phi A^\beta L_A
 $$
 
-Where \( \beta > 1 \) leads to accelerating technological growth, while \( \beta < 1 \) introduces convergence or stagnation risks.
+Where $$ \beta > 1 $$ leads to accelerating technological growth, while $$ \beta < 1 $$ introduces convergence or stagnation risks.
 
 ---
 author_profile: false
 
@@ -80,9 +80,9 @@ $$
 
 Where:
 
-- \( p \): Number of autoregressive terms  
-- \( d \): Number of differencing operations  
-- \( q \): Number of moving average terms  
+- $$ p $$: Number of autoregressive terms  
+- $$ d $$: Number of differencing operations  
+- $$ q $$: Number of moving average terms  
 
 While ARIMA works well for many datasets, it does not explicitly model **seasonal structure**. For example, monthly sales data may show a 12-month cycle, which ARIMA cannot capture directly.
 
@@ -96,9 +96,9 @@ $$
 
 Where:
 
-- \( p, d, q \): Non-seasonal ARIMA parameters  
-- \( P, D, Q \): Seasonal AR, differencing, and MA orders  
-- \( s \): Seasonality period (e.g., 12 for monthly data with yearly seasonality)  
+- $$ p, d, q $$: Non-seasonal ARIMA parameters  
+- $$ P, D, Q $$: Seasonal AR, differencing, and MA orders  
+- $$ s $$: Seasonality period (e.g., 12 for monthly data with yearly seasonality)  
 
 For example:
 
@@ -128,24 +128,24 @@ $$
 \Phi(B^s) \phi(B) (1 - B)^d (1 - B^s)^D y_t = \Theta(B^s) \theta(B) \varepsilon_t
 $$
 
-Where \( \varepsilon_t \) is white noise.
+Where $$ \varepsilon_t $$ is white noise.
 
 ## 5. Parameter Selection: Seasonal and Non-Seasonal
 
-### Step 1: Seasonal Period \( s \)
+### Step 1: Seasonal Period $$ s $$
 
 Choose based on frequency (e.g., 12 for monthly).
 
-### Step 2: Differencing \( d \), \( D \)
+### Step 2: Differencing $$ d $$, $$ D $$
 
 Use plots and ADF tests to determine.
 
 ### Step 3: AR/MA Orders
 
 Use ACF and PACF plots to estimate:
 
-- \( p, q \) for non-seasonal  
-- \( P, Q \) for seasonal  
+- $$ p, q $$ for non-seasonal  
+- $$ P, Q $$ for seasonal  
 
 ### Step 4: Use Auto ARIMA (Python)
 
 
@@ -62,29 +62,29 @@ In macroeconomics, ABMs can simulate the evolution of the economy through the in
 
 Although agent-based models are primarily computational, they rest on well-defined mathematical components. A typical ABM can be formalized as a discrete-time dynamical system:
 
-Let the system state at time \( t \) be denoted as:
+Let the system state at time $$ t $$ be denoted as:
 
 $$
 S_t = \{a_{1,t}, a_{2,t}, ..., a_{N,t}\}
 $$
 
-where \( a_{i,t} \) represents the state of agent \( i \) at time \( t \), and \( N \) is the total number of agents.
+where $$ a_{i,t} $$ represents the state of agent $$ i $$ at time $$ t $$, and $$ N $$ is the total number of agents.
 
 ### 1. **Agent State and Behavior Functions**
 
 Each agent has:
 
-- A **state vector** \( a_{i,t} \in \mathbb{R}^k \) representing variables such as wealth, consumption, productivity, etc.  
-- A **decision function** \( f_i: S_t \rightarrow \mathbb{R}^k \) that determines how the agent updates its state:
+- A **state vector** $$ a_{i,t} \in \mathbb{R}^k $$ representing variables such as wealth, consumption, productivity, etc.  
+- A **decision function** $$ f_i: S_t \rightarrow \mathbb{R}^k $$ that determines how the agent updates its state:
 
 $$
 a_{i,t+1} = f_i(a_{i,t}, \mathcal{E}_t, \mathcal{I}_{i,t})
 $$
 
 Where:
 
-- \( \mathcal{E}_t \) is the macro environment (e.g., interest rates, inflation)
-- \( \mathcal{I}_{i,t} \) is local information accessible to the agent
+- $$ \mathcal{E}_t $$ is the macro environment (e.g., interest rates, inflation)
+- $$ \mathcal{I}_{i,t} $$ is local information accessible to the agent
 
 ### 2. **Interaction Structure**
 
@@ -94,7 +94,7 @@ Agents may interact through a **network topology**, such as:
 - Small-world or scale-free networks  
 - Spatial lattices  
 
-These interactions define information flow and market exchanges. Let \( G = (V, E) \) be a graph with nodes \( V \) representing agents and edges \( E \) representing communication or trade links.
+These interactions define information flow and market exchanges. Let $$ G = (V, E) $$ be a graph with nodes $$ V $$ representing agents and edges $$ E $$ representing communication or trade links.
 
 ### 3. **Environment and Aggregation**
 
@@ -104,7 +104,7 @@ $$
 \mathcal{E}_{t+1} = g(S_t)
 $$
 
-Where \( g \) is a function that computes macro variables (e.g., GDP, inflation, aggregate demand) from the microstate \( S_t \). This allows for **micro-to-macro feedback loops**.
+Where $$ g $$ is a function that computes macro variables (e.g., GDP, inflation, aggregate demand) from the microstate $$ S_t $$. This allows for **micro-to-macro feedback loops**.
 
 ## Key Features of ABMs in Macroeconomics
 
 
@@ -34,28 +34,66 @@ tags:
 title: Why Data Scientists Need Math and Statistics
 ---
 
-A common misconception is that data science is mostly about applying libraries and frameworks. While tools are helpful, they cannot replace a solid understanding of **mathematics** and **statistics**. These disciplines provide the language and theory that power every algorithm behind the scenes.
+It’s tempting to think that mastering a handful of libraries—pandas, Scikit-Learn, TensorFlow—is the fast track to data science success. Yet tools are abstractions built atop deep mathematical and statistical theory. Without understanding **why** an algorithm works—its assumptions, convergence guarantees, or failure modes—practitioners risk producing brittle models and misinterpreting outputs. Libraries accelerate development, but the true power of data science lies in the ability to reason about algorithms at a theoretical level.
 
-## The Role of Mathematics
+## 2. Mathematical Foundations: Linear Algebra and Calculus
 
-At the core of many machine learning algorithms are mathematical concepts such as **linear algebra** and **calculus**. Linear algebra explains how models handle vectors and matrices, enabling operations like matrix decomposition and gradient calculations. Calculus is vital for understanding optimization techniques that drive model training. Without these foundations, it is difficult to grasp how algorithms converge or why they sometimes fail to do so.
+At the heart of many predictive models are operations on vectors and matrices. Consider a data matrix $\mathbf{X}\in\mathbb{R}^{n\times p}$: understanding its **singular value decomposition**  
+$$
+\mathbf{X} = U\,\Sigma\,V^\top
+$$
+reveals principal directions of variance, which underpin techniques like Principal Component Analysis. Eigenvalues and eigenvectors provide insight into covariance structure, guiding feature extraction and dimensionality reduction.
 
-## Why Statistics Matters
+Calculus provides the language of change, enabling optimization of complex loss functions. Gradient-based methods update parameters $\theta$ via  
+$$
+\theta_{t+1} = \theta_t - \eta\,\nabla_\theta L(\theta_t),
+$$
+where $\eta$ is the learning rate and $\nabla_\theta L$ the gradient of the loss. Delving into second-order information—the Hessian matrix $H = \nabla^2_\theta L$—explains curvature and motivates algorithms like Newton’s method or quasi-Newton schemes (e.g., BFGS). These concepts illuminate why some problems converge slowly, why learning rates must be tuned, and how saddle points impede optimization.
 
-Statistics helps data scientists quantify uncertainty, draw reliable conclusions, and validate models. Techniques like **hypothesis testing**, **confidence intervals**, and **probability distributions** reveal whether observed patterns are significant or simply random noise. Lacking statistical insight can lead to overfitting or underestimating model errors.
+## 3. Statistical Principles: Inference, Uncertainty, and Validation
 
-## Understanding Algorithms Beyond Code
+Data science inevitably grapples with uncertainty. Statistics offers the framework to quantify and manage it. A common task is estimating the mean of a population from a sample of size $n$. The **confidence interval** for a normally distributed estimator $\hat\mu$ with known variance $\sigma^2$ is  
+$$
+\hat\mu \pm z_{\alpha/2}\,\frac{\sigma}{\sqrt{n}},
+$$
+where $z_{\alpha/2}$ corresponds to the desired coverage probability (e.g., $1.96$ for 95%). Hypothesis testing formalizes decision-making: by computing a $p$-value, one assesses the probability of observing data at least as extreme as the sample under a null hypothesis.
 
-Popular algorithms—such as decision trees, regression models, and neural networks—are built on mathematical principles. Knowing the theory behind them clarifies their assumptions and limitations. Blindly applying a model without understanding its mechanics can produce misleading results, especially when the data violates those assumptions.
+Probability distributions—Bernoulli, Poisson, Gaussian—model data generation processes and inform likelihood-based methods. Maximum likelihood estimation (MLE) chooses parameters $\theta$ to maximize  
+$$
+\mathcal{L}(\theta) = \prod_{i=1}^n p(x_i \mid \theta),
+$$
+and its logarithm simplifies optimization to summing log-likelihoods. Statistical rigor guards against overfitting, data dredging, and false discoveries, ensuring that observed patterns reflect genuine signals rather than random noise.
 
-## The Pitfalls of Ignoring Theory
+## 4. Theory in Action: Demystifying Algorithms
 
-When the underlying mathematics is ignored, it becomes challenging to debug models, tune hyperparameters, or interpret outcomes. Relying solely on automated tools may produce working code, but it often masks fundamental issues like data leakage, improper scaling, or incorrect loss functions. These mistakes can have severe consequences in real-world applications.
+Every algorithm embodies mathematical and statistical choices. A **linear regression** model  
+$$
+\hat y = X\beta + \varepsilon
+$$
+assumes that residuals $\varepsilon$ are independent, zero-mean, and homoscedastic. Violations—such as autocorrelation or heteroscedasticity—invalidate inference unless addressed. **Decision trees** rely on information‐theoretic splits, measuring impurity via entropy  
+$$
+H(S) = -\sum_{k} p_k \log p_k,
+$$
+and choosing splits that maximize information gain. **Neural networks** approximate arbitrary functions by composing affine transformations and nonlinear activations, with backpropagation systematically computing gradients via the chain rule.
 
-## Building a Strong Foundation
+Understanding these mechanics clarifies why certain models excel on specific data types and fail on others. It empowers practitioners to select or adapt algorithms—pruning trees to prevent overfitting, regularizing regression with an $L_1$ penalty to induce sparsity, or choosing appropriate activation functions to avoid vanishing gradients.
 
-Learning the basics of calculus, linear algebra, and statistics does not require becoming a mathematician. However, dedicating time to these topics builds intuition about how models work. This deeper knowledge empowers data scientists to select appropriate algorithms, customize them for specific problems, and communicate results effectively.
+## 5. Common Errors from Theoretical Gaps
 
-## Conclusion
+Ignoring foundational theory leads to familiar pitfalls. Failing to standardize features in gradient‐based models can cause one dimension to dominate updates, slowing convergence. Overlooking multicollinearity in regression inflates variance of coefficient estimates, making interpretation meaningless. Misapplying hypothesis tests without correcting for multiple comparisons increases false positive rates. Blind reliance on automated pipelines may conceal data leakage—where test information inadvertently influences training—resulting in overly optimistic performance estimates.
 
-Data science thrives on a solid grounding in mathematics and statistics. Understanding the theory behind algorithms not only improves model performance but also safeguards against hidden errors. Investing in these fundamentals is essential for anyone aspiring to be a competent data scientist.
+## 6. Cultivating Analytical Intuition: Learning Strategies
+
+Building fluency in mathematics and statistics need not be daunting. Effective approaches include:
+
+- **Structured Coursework**: Enroll in linear algebra and real analysis to master vector spaces, eigenvalues, and limits.  
+- **Applied Exercises**: Derive gradient descent updates by hand for simple models, then verify them in code.  
+- **Textbook Deep Dives**: Study “Linear Algebra and Its Applications” (Strang) and “Statistical Inference” (Casella & Berger) for rigorous yet accessible treatments.  
+- **Algorithm Implementations**: Recreate k-means clustering, logistic regression, or principal component analysis from first principles to internalize assumptions.  
+- **Peer Discussions**: Teach core concepts—Bayes’ theorem, eigen decomposition—to colleagues or study groups, reinforcing understanding through explanation.
+
+These practices foster the intuition that transforms abstract symbols into actionable insights.
+
+## 7. Embracing Theory for Sustainable Data Science
+
+A robust grounding in mathematics and statistics elevates data science from a toolkit of shortcuts to a discipline of informed reasoning. When practitioners grasp the language of vectors, gradients, probabilities, and tests, they become adept at diagnosing model behavior, innovating new methods, and communicating results with credibility. Investing time in these core disciplines yields dividends: faster debugging, more reliable models, and the ability to adapt as algorithms and data evolve. In the evolving landscape of data science, theory remains the constant that empowers us to turn data into dependable knowledge.