Skip to content

Commit cdf3a09

Browse files
author
MSenden
committed
slight improvements of workflow descriptions. Added short note on notebooks.
1 parent 573405e commit cdf3a09

File tree

1 file changed

+7
-4
lines changed

1 file changed

+7
-4
lines changed

README.md

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ Senden, M. (2025). NeuroScape (1.0.1) [Data set]. Zenodo. [https://doi.org/10.52
3737

3838
1. **Scrape Data**
3939
- *scripts/ingestion/scraping.py*
40-
- Query PubMed for relevant neuroscience articles.
40+
- Query PubMed for relevant articles. Neuroscience, multidisciplinary and other (for training discipline classifier).
4141

4242
3. **Merge and Clean**
4343
- *scripts/ingestion/merge_and_clean.py*
@@ -53,7 +53,7 @@ Senden, M. (2025). NeuroScape (1.0.1) [Data set]. Zenodo. [https://doi.org/10.52
5353

5454
6. **Train Discipline Classifier**
5555
- *scripts/preprocessing/train_discipline_classifier.py*
56-
- Train a neural network to identify neuroscience-related articles.
56+
- Train a neural network to identify neuroscientific articles.
5757

5858
7. **Filter Data**
5959
- *scripts/preprocessing/filter_disciplines.py*
@@ -81,7 +81,7 @@ Senden, M. (2025). NeuroScape (1.0.1) [Data set]. Zenodo. [https://doi.org/10.52
8181

8282
12. **Cluster Definition**
8383
- *scripts/semantic_analysis/cluster_definition.py*
84-
- Generate descriptive titles, keywords, and summaries for each cluster.
84+
- Generate descriptive titles, keywords, and descriptions for each cluster.
8585

8686
13. **Cluster Distinction**
8787
- *scripts/semantic_analysis/cluster_distinction.py*
@@ -93,7 +93,7 @@ Senden, M. (2025). NeuroScape (1.0.1) [Data set]. Zenodo. [https://doi.org/10.52
9393

9494
15. **Dimension Categorization**
9595
- *scripts/semantic_analysis/assess_dimension_categories.py*
96-
- Categorize clusters along specific sub-dimensions (e.g., spatial vs. temporal scales).
96+
- Categorize clusters along specific sub-dimensions (e.g., fundamental vs clinical research within appliedness).
9797

9898
16. **Open Questions**
9999
- *scripts/semantic_analysis/extract_open_questions.py*
@@ -110,6 +110,9 @@ Senden, M. (2025). NeuroScape (1.0.1) [Data set]. Zenodo. [https://doi.org/10.52
110110
**Optional**
111111
- *scripts/preprocessing/update_embedding.py* allows updating the general embeddings with a newer Voyage AI model if desired.
112112

113+
## Notebooks
114+
115+
Notebooks handle creation of Figures and Tables for the manuscript, including additional analyses.
113116

114117
## **Setup Instructions**
115118

0 commit comments

Comments
 (0)