Skip to content

ua-datalab/DataScience-Tapas

Repository files navigation

Data Science - Tapas

(Image credit: Veronika Hradilová. Unsplash.com)


Data Science Tapas (aka DS-Tapas)

Applied Data Science Tapas knowledge capsules provide comprehensive educational content spanning multiple domains: from foundational data science principles and methodologies, through advanced machine learning techniques and algorithms, to cutting-edge Deep Learning applications in Artificial Intelligence. These capsules are designed to bridge theoretical concepts with practical implementations, offering insights into both established practices and emerging trends in the field of computational data analysis.

License: GPL v3


Schedule

Spring 2025


Date Title Description Instructor Materials YouTube
05-Feb-2025 Introduction to Python for Data Science Python stands as a leading programming language in data science, known for its powerful capabilities in mathematics, statistics, machine learning, data visualization, and scientific computing. Carlos Lizárraga Notes video
19-Feb-2025 Introduction to Machine Learning Algorithms Classical machine learning methods developed before the rise of deep learning include decision trees, support vector machines, and linear regression. These algorithms work best with structured data and need manual feature engineering. Carlos Lizárraga Notes video
05-Mar-2025 Introduction to Visualization: Theory and Practice Visualization combines art and science to help understand complex information. It requires both theoretical knowledge of how people perceive visual elements like colors and shapes, and practical skills in using tools to create charts and graphs. Like artists, data visualizers tell stories through visual representations. Devin Bayly Notes video
19-Mar-2025 Introduction to Deep Learning for Healthcare Advanced AI techniques like deep learning and sequence modeling play a transformative role in improving healthcare diagnostics and patient care. Understanding the ethical implications of AI in healthcare is crucial as we work to ensure responsible usage of these increasingly integrated technologies. Greg Chism Notes video
02-Apr-2025 Introduction to Speech to text with Whisper AI Whisper AI is a speech recognition technology that converts spoken language into written text. It's an example of AI technology that can transcribe audio content into readable text format. Megh Krishnaswamy Notes video
16-Apr-2025 Introduction to Python Accelerated Data Science with RAPIDS RAPIDS is a suite of software tools that accelerates data science workflows. Built to integrate with Python, it harnesses the parallel processing power of graphics processing units (GPUs). Like having thousands of tiny processors working simultaneously, GPUs enable much faster data processing. RAPIDS essentially acts as a turbocharger for data science projects. Devin Bayly Notes video

Previous Workshops

Spring 2024

Date Title Instructor Materials YouTube
01-17-2024 Command Line Interface Carlos Lizárraga Notes video
01-24-2024 UNIX Shell Command Line Programming Carlos Lizárraga Notes
01-31-2024 Git/Github Michele Cosi Notes video
03-27-2024 Introduction to Markdown Michele Cosi Notes video
05-31-2024 University of Arizona High Performance Computing Michele Cosi Notes

Spring 2023

Date Title Instructor Materials
02-Feb-2023 Network Visualization in R Greg Chism Notes / Recording
09-Feb-2023 Outliers Analysis and Anomalies Detection Carlos Lizárraga Notes / Jupyter Notebook Example
16-Feb-2023 REDCap to R Markdown, reproducibly Heidi Steiner Slides / Materials / Recording
23-Feb-2023 Simulating Data for Study Design in R Greg Chism Notes / Recording
02-Mar-2023 Statistical power analysis in R Greg Chism Notes / Recording
16-Mar-2023 GWAS in Hail Heidi Steiner Slides / Materials
23-Mar-2023 Low-code Exploratory Data Analysis Carlos Lizárraga Notes / Jupyter Notebook Example
30-Mar-2023 Low-code Time Series Analysis Carlos Lizárraga Notes / Jupyter Notebook Example
06-Apr-2023 Metagenomics with phyloseq in R Heidi Steiner Notes
13-Apr-2023 K-means clustering with tidy data principles Greg Chism Notes
20-Apr-2023 Low-code Machine Learning Platforms Carlos Lizárraga Notes / Jupyter Notebook
27-Apr-2023 Designing Quarto slides Heidi Steiner

Created: 01/10/2025 (C. Lizárraga)

Updated: 01/20/2025 (C. Lizárraga)

DataLab, Data Science Institute, University of Arizona.

CC BY-NC-SA 4.0

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •