Commit 4c978eb
committed
Day 4: Implement Step 1 of timestamp features + pandas syntax review
- Added make_timestamp_features.py (initial draft)
• Implemented Step 1: load news2_scores.csv, convert charttime to datetime, sort by subject_id, stay_id, charttime
• Ensures clean chronological dataset per patient before feature engineering
- Reviewed pandas syntax and project scripts
• Annotated existing code (compute_news2.py, extract_news2_vitals.py, etc.) to understand DataFrame operations
• Gained clarity on indexing, merging, pivoting, and reset_index(drop=True)
- Notes.md updated with Day 4 reflections
• Focused on pandas learning curve, debugging strategies (print df.head()), and rationale for step order in pipeline
• Recognised need to master a small set of pandas operations to move faster in later steps1 parent 3158ec3 commit 4c978eb
50 files changed
Lines changed: 216 additions & 71 deletions
File tree
- data
- interim-data
- raw-data
- hosp
- icu
- src
- data-processing
- ml-data-prep
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Binary file not shown.
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
0 commit comments