You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ehrapy transition to ehrdata: take over ehrapys data functionalities (#136)
* add basic dataframe support and test
* more tests, modified to_dataframe
* read_csv
* save commit
* more on csv, h5ad
* save commit
* updates, expanded from_pandas
* expand to_pandas
* type inference and missing value as strings
* iter commit, tests fail
* fix to_pandas and feature types, add tests
* add tests and doc for four new datasets
* save commit
* basic draft for read_zarr
* h5ad, zarr io draft
* harmonize to filename as anndata
* remove commented out code
* changelog, refactoring
* unreleased statement in changelog
* fix refactoring
* include feature type functions in doc:
* be more clear about file formats in download internally
* Apply suggestions from code review
Co-authored-by: Lukas Heumos <lukas.heumos@posteo.net>
* more intersphinx, more pandas examples, fixes
* iter commit
* address more review comments:
* sparse io test files
* cleaning round, reduce redundancy
* add missing h5ad, argument cleaning
* pandas description in tutorial
* remove backed comment zarr
* more explicit arguments, cleanup
* test nonnumeric layer
* utilzz refactor
* fix doc? fix typos
* refactor .tl, .io, fix typos
* fix display of first level functions?
* Apply suggestions from code review
Co-authored-by: Lukas Heumos <lukas.heumos@posteo.net>
* address comments
* split lines in csv
* split lins for more docstrings in io
* hdf5, fastarrayutils dependency
* csv read description like pandas
* more io like pandas wording
---------
Co-authored-by: Lukas Heumos <lukas.heumos@posteo.net>
- Cleaned up and updated tutorial notebooks ([#140](https://github.com/theislab/ehrdata/pull/140)) @agerardy
15
+
16
+
### Added
17
+
- {func}`~ehrdata.io.read_csv` Reads a csv file ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
18
+
- {func}`~ehrdata.io.read_h5ad` Reads an h5ad file ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
19
+
- {func}`~ehrdata.io.read_zarr` Reads a zarr file ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
20
+
- {func}`~ehrdata.io.write_h5ad` Writes an h5ad file ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
21
+
- {func}`~ehrdata.io.write_zarr` Writes a zarr file ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
22
+
- {func}`~ehrdata.io.from_pandas` Transform a given {class}`~pandas.DataFrame` into an {class}`~ehrdata.EHRData` object ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
23
+
- {func}`~ehrdata.io.to_pandas` Transform an {class}`~ehrdata.EHRData` object into a {class}`~pandas.DataFrame` ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
24
+
- {func}`~ehrdata.dt.mimic_2` Loads the MIMIC-II dataset ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
25
+
- {func}`~ehrdata.dt.mimic_2_preprocessed` Loads the preprocessed MIMIC-II dataset ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
26
+
- {func}`~ehrdata.dt.diabetes_130_raw` Loads the raw diabetes-130 dataset ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
27
+
- {func}`~ehrdata.dt.diabetes_130_fairlearn` Loads the preprocessed diabetes-130 dataset by fairlearn ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
28
+
- {func}`~ehrdata.infer_feature_types` Infer feature types in an {class}`~ehrdata.EHRData` object ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
29
+
- {func}`~ehrdata.feature_type_overview` Overview of inferred feature types ([#136](https://github.com/theislab/ehrdata/pull/136)) @eroell
Copy file name to clipboardExpand all lines: docs/references.bib
+44Lines changed: 44 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -1,3 +1,19 @@
1
+
@article{strack2014impact,
2
+
title={Impact of HbA1c measurement on hospital readmission rates: analysis of 70,000 clinical database patient records},
3
+
author={Strack, Beata and DeShazo, Jonathan P and Gennings, Chris and Olmo, Juan L and Ventura, Sebastian and Cios, Krzysztof J and Clore, John N},
4
+
journal={BioMed research international},
5
+
volume={2014},
6
+
number={1},
7
+
pages={781670},
8
+
year={2014},
9
+
publisher={Wiley Online Library}
10
+
}
11
+
@article{bird2020fairlearn,
12
+
title={Fairlearn: A toolkit for assessing and improving fairness in AI},
13
+
author={Bird, Sarah and Dud{\'\i}k, Miro and Edgar, Richard and Horn, Brandon and Lutz, Roman and Milan, Vanessa and Sameki, Mehrnoosh and Wallach, Hanna and Walker, Kathleen},
title = {The scverse project provides a computational ecosystem for single-cell omics data analysis},
9
25
journal = {Nature Biotechnology}
10
26
}
27
+
@book{critical2016secondary,
28
+
title={Secondary analysis of electronic health records},
29
+
author={Critical Data, MIT},
30
+
year={2016},
31
+
publisher={Springer Nature}
32
+
}
33
+
@article{du2023saits,
34
+
title={Saits: Self-attention-based imputation for time series},
35
+
author={Du, Wenjie and C{\^o}t{\'e}, David and Liu, Yan},
36
+
journal={Expert Systems with Applications},
37
+
volume={219},
38
+
pages={119619},
39
+
year={2023},
40
+
publisher={Elsevier}
41
+
}
42
+
@article{du2023pypots,
43
+
title={PyPOTS: a Python toolbox for data mining on Partially-Observed Time Series},
44
+
author={Du, Wenjie},
45
+
journal={arXiv preprint arXiv:2305.18811},
46
+
year={2023}
47
+
}
48
+
@article{kallfelz2021mimic,
49
+
title={MIMIC-IV demo data in the OMOP Common Data Model},
50
+
author={Kallfelz, Michael and Tsvetkova, Anna and Pollard, Tom and Kwong, Manlik and Lipori, Gigi and Huser, Vojtech and Osborn, Jeffrey and Hao, Sicheng and Williams, Andrew},
0 commit comments