Skip to content

Commit a8f3afd

Browse files
committed
update markdown cells to clarify fidelity computation and K-means clustering explanation
1 parent 42dc703 commit a8f3afd

1 file changed

Lines changed: 11 additions & 4 deletions

File tree

results_user_community.ipynb

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2644,12 +2644,13 @@
26442644
]
26452645
},
26462646
{
2647-
"cell_type": "markdown",
2648-
"id": "6c760ae2652933c4",
26492647
"metadata": {},
2648+
"cell_type": "markdown",
26502649
"source": [
2651-
"# TODO: show formulas for normalized entropy, fidelity, category entropy"
2652-
]
2650+
"Remark:\n",
2651+
"At first we computed fidelity as 1 - normalized_entropy, but later we decided to just use normalized_entropy directly so that higher values indicate more diversity (as for category entropy). We change the naming accordingly except in some part of the code and for the naming of some files."
2652+
],
2653+
"id": "c13e8b6121fe08dd"
26532654
},
26542655
{
26552656
"cell_type": "markdown",
@@ -2846,6 +2847,12 @@
28462847
"## 2.5 K-means clustering of groups"
28472848
]
28482849
},
2850+
{
2851+
"metadata": {},
2852+
"cell_type": "markdown",
2853+
"source": "To cluster the groups based on their features -> fidelity, category entropy, and number of channels, we use the K-means clustering algorithm. This helps us identify distinct profiles of commenting behavior among the groups. Based on our analysis, K=10 clusters is a good choice allowing to get meaningful profiles without overfitting.",
2854+
"id": "cdfea83475361bcc"
2855+
},
28492856
{
28502857
"cell_type": "markdown",
28512858
"id": "74ae8700d2d526e1",

0 commit comments

Comments
 (0)