-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathblog.qmd
More file actions
99 lines (66 loc) · 5.69 KB
/
blog.qmd
File metadata and controls
99 lines (66 loc) · 5.69 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
---
pagetitle: "Blog"
---
Welcome to my blog, where I explore the dynamic world of **Data Science**, **Artificial Intelligence**, and **Machine Learning**. Here, you'll find insights, tutorials, and real-world applications aimed at demystifying complex concepts and showcasing the power of data-driven decision-making. Whether you're a seasoned professional or just beginning your journey, this space is designed to spark curiosity, share knowledge, and stay at the forefront of AI and ML innovation. Dive in and let's learn together!
## Heart Failure Clinical Records Synthetic Data Project
::: columns
::: {.column width="40%"}
{width=100%}
:::
::: {.column width="5%"}
:::
::: {.column width="55%"}
This project evaluates the quality of synthetic datasets derived from the Heart Failure Clinical Records data (299 patients, 13 variables). Our goal is to determine how well different generation strategies reproduce the statistical properties and analytic value of the original data while protecting privacy. We generate synthetic data using four approaches: (1) parametric imputation (MICE), (2) non-parametric imputation (CART via MICE), (3) distribution-driven synthesis (synthpop), and (4) metadata-guided rules. We then compare each synthetic dataset with the real data across three dimensions:
- Fidelity – univariate and multivariate similarity (distributions, ranges, correlations), histogram similarity, and mutual information.
- Utility – model transportability using XGBoost (TRTR vs. TSTR), feature-importance agreement, and SHAP-based behaviour.
- Privacy / Disclosure risk – exact record matches, neighbour-proximity checks, and membership-inference sensitivity.
[Read the full article here](https://linuschirchir.quarto.pub/heart-failure-clinical-records-synthetic-data-project/){target="_blank"}
:::
:::
## Navigating Health Data Analysis: My Experience in Hip and Knee Arthroplasty
::: columns
::: {.column width="55%"}
Landing in Swansea, Wales, UK, in late December 2021 from Eldoret, Kenya marked the beginning of a transformative journey during one of the most challenging times in recent history. The world was grappling with the peak of the Coronavirus disease (COVID-19) pandemic, and the outbreak of a new variant, Omicron, was a stark reminder of the urgency and importance of Health Data Science (HDS). Driven by a passion for using data to solve complex health challenges, I embarked on my master’s studies in HDS.
During my studies, I explored courses such as Machine Learning (ML), Data Modelling, Data Visualisation, Bioinformatics for Genome Analysis, Analysis of Linked Health Data, and Scientific Computing in Healthcare. My dissertation “Evaluating Cognitive Assessments in the Progression to Early-Stage Alzheimer’s Disease Using Machine Learning,” provided practical experience in applying these skills to real-world problems. Additionally, I engaged in projects using R and QGIS to visualise global COVID-19 trends and applied ML techniques in R to predict heart disease.
[Read the full article on Medium](https://medium.com/@linuschirchir/navigating-health-data-analysis-my-experience-in-hip-and-knee-arthroplasty-cd7017b89086){target="_blank"}
:::
::: {.column width="5%"}
:::
::: {.column width="40%"}

:::
:::
## How to Draw a Christmas Tree Using Crayon in R
::: columns
::: {.column width="40%"}

:::
::: {.column width="5%"}
:::
::: {.column width="55%"}
Christmas is approaching and this makes it a great time to write a festive tutorial to help you learn R.
Overview
This script creates a colorful Christmas tree in your R console. It uses the crayon package to add colors to the tree and prints a festive message below it.
Prerequisites
- R environment (such as RStudio)
- Basic knowledge of running R scripts
- Instructions
Step 1: Installation and Setup
The script below first checks if the crayon package is installed. If not, it installs the package automatically. The crayon package is essential for adding colors to the tree and the text.
[Read the full article on Medium](https://medium.com/@linuschirchir/how-to-draw-a-christmas-tree-using-crayon-in-r-493f603f6462){target="_blank"}
:::
:::
## Analysing PostgreSQL Data in R
::: columns
::: {.column width="55%"}
PostgreSQL (Postgres) is an open-source enterprise relational database management system (RDBMS) that supports both relational (SQL) and non-relational (JSON) querying. The Postgres project began at the University of California in 1986 and has been growing by leaps and bounds. This RDBMS is highly reputed for its high levels of resilience, integrity, and correctness. Many products and solutions have been developed based on PostgreSQL.
To get started, you may need to download and install PostgreSQL for your operating systems such as Linux, macOS and Windows from the link below. Follow on-screen instructions and leave all the default settings in place. The official PostgreSQL website provides a step by step guide on how to install PostgreSQL here. Download and install PostgreSQL for your operating system from the link below.
R on the other hand is a free programming language used for predictive analytics, statistical modelling, data visualization and machine learning. The language was developed by Robert Gentleman and Ross Ihaka of the University of Auckland’s Statistics Department in 1993 and has been continuously evolving over time. R is leveraged as an exploratory and investigative tool, especially in academia and healthcare.
[Read the full article on Medium](https://medium.com/@linuschirchir/analysing-postgresql-data-in-r-7ea2b2565ba7){target="_blank"}
:::
::: {.column width="5%"}
:::
::: {.column width="40%"}

:::
:::