Skip to content

keanteng/wqd7005-assignment-1

Repository files navigation

WQD7005 Assignment 1

Python Jupyter Notebook LaTeX

Perform exploratory data analysis (EDA) and advanced data preprocessing on simulated patient data leveraging Generative AI (GenAI), Large Language Models (LLMs), and Small Language Models (SLMs). The dataset will cover six vital signs (oxygen saturation, heart rate, temperature, blood pressure, weight, and blood glucose), questionnaire responses, and timestamps.

AI Tools

GitHub Copilot Google Gemini

Tasks:

  1. Dataset Simulation using GenAI (3 marks)
  • Simulate a dataset representing 500 patients monitored over 1 month. Utilize GenAI to produce realistic numerical variations in vital signs and generate plausible textual questionnaire responses or clinical notes, incorporating scenarios with missing data.
  1. Exploratory Data Analysis (EDA) enhanced by LLMs (4 marks)
  • Conduct comprehensive exploratory data analysis using visualizations and statistical summaries.
  • Utilize Large Language Models (e.g., GPT-4) to interpret complex patterns, automatically summarize findings, identify trends, anomalies, and provide clinically relevant insights.
  1. Advanced Data Preprocessing utilizing SLMs/LLMs (4 marks)
  • Implement preprocessing techniques, including intelligent missing value handling, normalization, and categorical encoding.
  • Apply Small Language Models or fine-tuned LLMs to handle textual data preprocessing tasks, such as classifying questionnaire responses, sentiment analysis, or textual data imputation.
  1. AI-Assisted Summary Report and Visualization (4 marks)
  • Prepare a short, insightful report (2-3 pages) summarizing findings, preprocessing techniques, and key insights from the analysis.
  • Leverage LLMs to draft clear, coherent explanations for visualizations and data-driven insights.

Using this Repository

Load to your local machine:

git clone https://github.com/keanteng/wqd7005-assignment-1

Make sure to install tex-live and latex-workshop to compile the LaTeX. tex-live can be downloaded online and latex-workshop can be installed via VSCode extensions.

About

AI-Enhanced Exploratory Analysis and Preprocessing of Simulated Patient Vital Signs Data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published