You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Requires python 3.6 or above. Required packages and versions are listed in `requirements.txt`. Older versions may still work but have not been tested.
6
6
7
-
Note: This README contains latex equations and is best viewed on GitLab.
7
+
Note: This README contains latex equations and is best viewed on the GitLab site (https://gitlab.eecs.umich.edu/mld3/FIDDLE).
8
8
9
9
## Publications & Resources
10
+
- Title: <b>Democratizing EHR analyses with FIDDLE: a flexible data-driven preprocessing pipeline for structured clinical data.</b>
11
+
- Authors: Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, Michael W. Sjoding, and Jenna Wiens.
12
+
- Published in JAMIA (Journal of the American Medical Informatics Association), October 2020: [article link](https://doi.org/10.1093/jamia/ocaa139)
13
+
- Previously presented at MLHC 2019 (<i>[Machine Learning for Healthcare](https://www.mlforhc.org/)</i>) as a [clinical abstract](https://www.mlforhc.org/s/Sjoding-jete.pdf)
14
+
- News coverage on HealthcareITNews: [link](https://www.healthcareitnews.com/news/new-framework-helps-streamline-ehr-data-extraction)
- Michael W. Sjoding, Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, and Jenna Wiens. <b>[Democratizing EHR Analyses - a Comprehensive, Generalizable Pipeline for Learning from Clinical Data](https://www.mlforhc.org/s/Sjoding-jete.pdf)</b>. Presented at MLHC <i>([Machine Learning for Healthcare](https://www.mlforhc.org/), Clinical Abstract)</i>, 2019.
- Our journal paper is currently under review by JAMIA and will be made available soon
17
+
If you use FIDDLE in your research, please cite the following publication:
18
+
19
+
```
20
+
@article{10.1093/jamia/ocaa139,
21
+
author = {Tang, Shengpu and Davarmanesh, Parmida and Song, Yanmeng and Koutra, Danai and Sjoding, Michael W and Wiens, Jenna},
22
+
title = "{Democratizing EHR analyses with FIDDLE: a flexible data-driven preprocessing pipeline for structured clinical data}",
23
+
journal = {Journal of the American Medical Informatics Association},
24
+
year = {2020},
25
+
month = {10},
26
+
issn = {1527-974X},
27
+
doi = {10.1093/jamia/ocaa139},
28
+
}
29
+
```
15
30
16
31
## Usage Notes
17
32
FIDDLE generates feature vectors based on data within the observation period $`t\in[0,T]`$. This feature representation can be used to make predictions of adverse outcomes at t=T. More specifically, FIDDLE outputs a set of binary feature vectors for each example $`i`$, $`\{(s_i,x_i)\ \text{for}\ i=1 \dots N\}`$ where $`s_i \in R^d`$ contains time-invariant features and $`x_i \in R^{L \times D}`$ contains time-dependent features.
18
33
19
34
Input:
20
-
- formatted EHR data,`.csv` or `.p`/`.pickle`files, table with 4 columns:\[`ID`, `t`, `variable_name`, `variable_value`\]
35
+
- formatted EHR data:`.csv` or `.p`/`.pickle`file, a table with 4 columns \[`ID`, `t`, `variable_name`, `variable_value`\]
21
36
- population file: a list of unique `ID`s you want processed
22
37
- arguments:
23
38
- T: The time of prediction; time-dependent features will be generated using data in $`t\in[0,T]`$.
0 commit comments