cytomining
diff --git a/‎.gitignore
+1 b/‎.gitignore
+1
diff --git a/‎README.md
+2-2 b/‎README.md
+2-2
diff --git a/‎examples/null_size.ipynb
+32-27 b/‎examples/null_size.ipynb
+32-27
@@ -160,4 +160,5 @@ cython_debug/
 #.idea/
 
 examples/data/
+examples/cache/
 .vscode/
@@ -46,13 +46,13 @@ We provide examples demonstrating how to use copairs for:
 ## Citation
 If you find this work useful for your research, please cite our [pre-print](https://doi.org/10.1101/2024.04.01.587631):
 
-Kalinin, A.A., Arevalo, J., Vulliard, L., Serrano, E., Tsang, H., Bornholdt, M., Rajwa, B., Carpenter, A.E., Way, G.P. and Singh, S., 2024. A versatile information retrieval framework for evaluating profile strength and similarity. bioRxiv, pp.2024-04. doi:10.1101/2024.04.01.587631
+Kalinin, A.A., Arevalo, J., Vulliard, L., Serrano, E., Tsang, H., Bornholdt, M., Muñoz, A.F., Sivagurunathan, S., Rajwa, B., Carpenter, A.E., Way, G.P. and Singh, S., 2024. A versatile information retrieval framework for evaluating profile strength and similarity. bioRxiv, pp.2024-04. doi:10.1101/2024.04.01.587631
 
 BibTeX:
 ```
 @article{kalinin2024versatile,
   title={A versatile information retrieval framework for evaluating profile strength and similarity},
-  author={Kalinin, Alexandr A and Arevalo, John and Vulliard, Loan and Serrano, Erik and Tsang, Hillary and Bornholdt, Michael and Rajwa, Bartek and Carpenter, Anne E and Way, Gregory P and Singh, Shantanu},
+  author={Kalinin, Alexandr A and Arevalo, John and Vulliard, Loan and Serrano, Erik and Tsang, Hillary and Bornholdt, Michael and Muñoz, Alán F and Sivagurunathan, Suganya and Rajwa, Bartek and Carpenter, Anne E and Way, Gregory P and Singh, Shantanu},
   journal={bioRxiv},
   pages={2024--04},
   year={2024},
 
@@ -185,7 +185,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "9738b6f4faa64847aac316120975c9fd",
+       "model_id": "98a5410c76d04b30a59c1e96909fa675",
        "version_major": 2,
        "version_minor": 0
       },
@@ -199,7 +199,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "3a5ff0b3fb384c4f95baa77a66b504d7",
+       "model_id": "a5882a4e63ba4a2da14f8c25e5e2c451",
        "version_major": 2,
        "version_minor": 0
       },
@@ -213,7 +213,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "1412262fdfca4de5a0ac2968e37defc1",
+       "model_id": "3f7dea536b344dc4a0cf07e4e2f94436",
        "version_major": 2,
        "version_minor": 0
       },
@@ -227,7 +227,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "2f3bce275fab434ba1e045908ac56188",
+       "model_id": "d8108b62de1b4127967c1184f4e14ec5",
        "version_major": 2,
        "version_minor": 0
       },
@@ -241,7 +241,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "31504e9ab9334054a5c1fa9244b00461",
+       "model_id": "71c8c110a0694b5a9c1925011d64bc29",
        "version_major": 2,
        "version_minor": 0
       },
@@ -255,7 +255,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "df3ee4295bf14a0fbbd3edc9b9bbce43",
+       "model_id": "e6d5a7adab3643dcb1638b3bc617f354",
        "version_major": 2,
        "version_minor": 0
       },
@@ -269,7 +269,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "71e241775fd74abfa4e8403d365f8136",
+       "model_id": "de529c15360f45948e04127c88135f16",
        "version_major": 2,
        "version_minor": 0
       },
@@ -283,7 +283,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "bc2e1817be6b4207af2458943e547500",
+       "model_id": "370a8fd49b874cc0aa4599416b087f7f",
        "version_major": 2,
        "version_minor": 0
       },
@@ -297,7 +297,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "e66adb85c1f74eaf87efae66dbbd644a",
+       "model_id": "5d0f64f1871746a4be91ca50ee8da395",
        "version_major": 2,
        "version_minor": 0
       },
@@ -311,7 +311,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "8b390b25b8964592bf479b42a6320d06",
+       "model_id": "f1c498718ebf467fafa9f1a5860156fb",
        "version_major": 2,
        "version_minor": 0
       },
@@ -325,7 +325,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "8dace7243c3841cc9898fe6d304ed3ed",
+       "model_id": "2f565faa215142dd8c0d4d477eba412a",
        "version_major": 2,
        "version_minor": 0
       },
@@ -339,7 +339,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "44c48e472c6f4d5eaf3d558a8acb4519",
+       "model_id": "bfbde159bfd64bd18d62358f694db31b",
        "version_major": 2,
        "version_minor": 0
       },
@@ -353,7 +353,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "aba7957fbbf74fd78474030d6d7cd63a",
+       "model_id": "3f9a18859a1243ea9f291e81574583ad",
        "version_major": 2,
        "version_minor": 0
       },
@@ -367,7 +367,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "1289bf5d7e61407e87169aba69b4bb54",
+       "model_id": "28e949a9495e4c9db59eba1bf5a488c6",
        "version_major": 2,
        "version_minor": 0
       },
@@ -381,7 +381,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "7534620ff82e4520995023f3830514f8",
+       "model_id": "b78cd24410064bdea1e9104eed78a372",
        "version_major": 2,
        "version_minor": 0
       },
@@ -395,7 +395,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "8afdd30c664742769161be671d05902f",
+       "model_id": "9ab9e9d8efb64b68b64266f13dbc516d",
        "version_major": 2,
        "version_minor": 0
       },
@@ -409,7 +409,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "4292e6701ae845e2b77eea11bbdb25b5",
+       "model_id": "bc2c40c409274b378395f23bcd8aed4d",
        "version_major": 2,
        "version_minor": 0
       },
@@ -423,7 +423,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "822431e328954a908c068da8b2bf124d",
+       "model_id": "15c6eaf126374babb2c757c93b6dcfb9",
        "version_major": 2,
        "version_minor": 0
       },
@@ -437,7 +437,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "86c4b77801b548a2a6ee90abea3e8b0e",
+       "model_id": "769ed6f0be494e5696b15b4334e8edf8",
        "version_major": 2,
        "version_minor": 0
       },
@@ -451,7 +451,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "99c2fb7c7ab04103bcd8938f96aa5111",
+       "model_id": "1a15d7bb96f84429bb513f1458de3e93",
        "version_major": 2,
        "version_minor": 0
       },
@@ -465,7 +465,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "dfc7b6b9ba654496be0bcefe5f717f59",
+       "model_id": "6998d02b1fc546c1808232bb398b3b1a",
        "version_major": 2,
        "version_minor": 0
       },
@@ -479,7 +479,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "c2f45921939d4742a0d5254e404a015b",
+       "model_id": "3b3250fa2e2d4688a6834105e57226c0",
        "version_major": 2,
        "version_minor": 0
       },
@@ -493,7 +493,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "14627090dac345a695601984a5ec128a",
+       "model_id": "2a92717d58b6495186a975d5ddf858c1",
        "version_major": 2,
        "version_minor": 0
       },
@@ -507,7 +507,7 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "5e7980c5c222436d895c64f85104de83",
+       "model_id": "507d3b9a1f89420082741923b12f258f",
        "version_major": 2,
        "version_minor": 0
       },
@@ -570,16 +570,21 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Because the full null size $d_{null}=118755$, smaller sample sizes ($<5,000$) lead to poor estimation of significance for these data, while very large values ($>100,000$) cover the whole null and do not affect perturbation ranking results.\n",
+    "Because the full null size $d_{null}=118755$, smaller sample sizes ($<=1,000$) lead to poor estimation of significance for these data, while very large values ($>100,000$) cover the whole null and do not affect perturbation ranking results.\n",
     "\n",
-    "## Practical consideration for choosing null size\n",
+    "## Practical consideration for choosing the null size\n",
     "\n",
     "In practice, drawing a large number of samples is not always feasible, because compute time for each AP calculation grows with the higher number of perturbations of the dataset, the number of metadata constraints for profile grouping, sizes of perturbation groups (the number of perturbation replicates) and control groups (the number of control replicates), and profile dimensionality (the number of features in a profile).\n",
     "\n",
-    "Finding a `null_size` that works for a particular dataset is balancing between test resolution (for example, being able to tell apart vary small p-values) and compute. We provided `null_size` values for each real-world dataset in Supplemental Materials to our paper—please refer to:\n",
+    "Finding a `null_size` that works for a particular dataset means balancing between test resolution (for example, being able to tell apart vary small p-values) and compute. We provided `null_size` values for each real-world dataset in Supplemental Materials to our paper—please refer to:\n",
     "\n",
     "> Kalinin, A. A. et al. A versatile information retrieval framework for evaluating profile strength and similarity. bioRxiv, 2024-04, (2024)."
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": []
   }
  ],
  "metadata": {