Skip to content

Commit 5d916f4

Browse files
authored
Merge pull request #1311 from ELC/pydata-virginia-2025
Add Pydata Virginia 2025
2 parents 5d5d4de + 4a357bb commit 5d916f4

File tree

27 files changed

+758
-0
lines changed

27 files changed

+758
-0
lines changed

pydata-virginia-2025/category.json

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
{
2+
"title": "PyData Virginia 2025"
3+
}
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "SciPy is a powerful library for scientific and technical computing in Python. The primary objectives of this presentation are to explore the core concepts of Responsible AI and to demonstrate these concepts with SciPy.",
3+
"duration": 3152,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Andrea Hobby"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/W6fTFSgyhMg/maxresdefault.jpg",
21+
"title": "Responsible AI with SciPy",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=W6fTFSgyhMg"
26+
}
27+
]
28+
}
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "Multi-armed bandits are a reinforcement learning tool often used in environments where the cost or rewards of different choices are unknown or where those functions may change over time. The good news is that as far as implementation goes, bandits are surprisingly easy to implement; however, in practice, the difficulty comes from defining a reward function that best targets your specific use case. In this talk, we will discuss how to use bandit algorithms effectively, taking note of practical strategies for experimental design and deployment of bandits in your applications.",
3+
"duration": 1819,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Benjamin Bengfort"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/jP978VKBl-w/maxresdefault.jpg",
21+
"title": "Practical Multi Armed Bandits",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=jP978VKBl-w"
26+
}
27+
]
28+
}
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "When Bayesian modeling scales up to large datasets, traditional MCMC methods can become impractical due to their computational demands. Variational Inference (VI) offers a scalable alternative, trading exactness for speed while retaining the essence of Bayesian inference.\n\nIn this tutorial, we\u2019ll explore how to implement and compare VI techniques in PyMC, including the Adaptive Divergence Variational Inference (ADVI) and the cutting-edge Pathfinder algorithm.\n\nStarting with simple models like linear regression, we\u2019ll gradually introduce more complex, real-world applications, comparing the performance of VI against Markov Chain Monte Carlo (MCMC) to understand the trade-offs in speed and accuracy.\n\nThis tutorial will arm participants with practical tools to deploy VI in their workflows and help answer pressing questions, like \"What do I do when MCMC is too slow?\", or \"How does VI compare to MCMC in terms of approximation quality?\".",
3+
"duration": 5357,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Chris Fonnesbeck"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/XECLmgnS6Ng/maxresdefault.jpg",
21+
"title": "A Beginner's Guide to Variational Inference",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=XECLmgnS6Ng"
26+
}
27+
]
28+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "Geospatial data can unlock valuable insights. OpenStreetMap includes electric power and telecommunication infrastructure geospatial data, and it is already \u201copen\u201d. This presentation will demonstrate how to use Python to \u201cunlock the insights\u201d available in OSM power and telecommunications geospatial data.",
3+
"duration": 1522,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Cory Eicher"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/kmRyFmMThVo/maxresdefault.jpg",
21+
"title": "Using Python to Unlock Insights from OpenStreetMap Data at Scale",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=kmRyFmMThVo"
26+
}
27+
]
28+
}
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
{
2+
"description": "Tired of waiting for massive datasets to load on your local machine? In this beginner-friendly tutorial, we\u2019ll explore how to scale your data analysis skills from pandas to PySpark using a real-world anime dataset. We\u2019ll walk through the basics of distributed computing, discuss why Spark was created, and demonstrate the benefits of working with PySpark for big data tasks\u2014including reading, cleaning, and transforming millions of records with ease. By the end of this workshop, you\u2019ll understand how PySpark harnesses cluster computing to handle large-scale data and you\u2019ll be comfortable applying these techniques to your own projects.\n\nParticipant Requirements:\n- A laptop (any OS) with an internet connection\n- A Google account (to access Colab notebooks and slides)\n- Familiarity with Python and pandas\n\nHere's the link to the Google Colab to follow along \ud83d\udc47\ud83c\udffe\nhttps://colab.research.google.com/drive/1fi0cTQ1NIE5kDEH0ynp2sqDuVeiBJJWU?usp=sharing\n\nHere are the slides \ud83d\udc47\ud83c\udffe\nhttps://drive.google.com/file/d/11JIih1VzLxTJ9O6PeGzqD_e8vumTZQmw/view?usp=sharing",
3+
"duration": 4399,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://colab.research.google.com/drive/1fi0cTQ1NIE5kDEH0ynp2sqDuVeiBJJWU?usp=sharing",
13+
"url": "https://colab.research.google.com/drive/1fi0cTQ1NIE5kDEH0ynp2sqDuVeiBJJWU?usp=sharing"
14+
},
15+
{
16+
"label": "https://drive.google.com/file/d/11JIih1VzLxTJ9O6PeGzqD_e8vumTZQmw/view?usp=sharing",
17+
"url": "https://drive.google.com/file/d/11JIih1VzLxTJ9O6PeGzqD_e8vumTZQmw/view?usp=sharing"
18+
},
19+
{
20+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
21+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
22+
}
23+
],
24+
"speakers": [
25+
"Cynthia Ukawu"
26+
],
27+
"tags": [],
28+
"thumbnail_url": "https://i.ytimg.com/vi/McbJMdcKp5c/maxresdefault.jpg",
29+
"title": "From Pandas to PySpark",
30+
"videos": [
31+
{
32+
"type": "youtube",
33+
"url": "https://www.youtube.com/watch?v=McbJMdcKp5c"
34+
}
35+
]
36+
}
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "The team behind DVC has spent years tackling data versioning challenges. With the rise of AI, we\u2019ve seen new complexities emerge - especially with multimodal datasets like images, video, audio, and text. This talk shows why multimodal data versioning is different and how Pydantic provides a powerful way to structure and integrate metadata.",
3+
"duration": 1659,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Dmitry Petrov"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/yNBoJSKl49U/maxresdefault.jpg",
21+
"title": "Versioning Multimodal Data: Metadata & Beyond",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=yNBoJSKl49U"
26+
}
27+
]
28+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "Health disparities remain a critical challenge in public health, demanding innovative approaches to uncover inequities and drive actionable change. This webinar will demonstrate how Python can serve as a powerful tool for creating data visualizations that illustrate the unequal burden of HIV across different populations. Participants will learn how Python\u2019s popular libraries, such as Matplotlib, Seaborn, and Plotly, can transform complex datasets into accessible, impactful visuals.\nUsing an HIV dataset containing demographic, geographic, and clinical variables, this session will guide attendees through a series of practical examples. From creating heatmaps and geospatial maps to analyzing temporal trends, the webinar emphasizes how to identify and communicate key social determinants related to race, gender, socioeconomic status, and access to care. Through hands-on demonstrations, attendees will see how Python\u2019s capabilities streamline data analysis and visualization workflows.\nKey takeaways from the session include identifying regions and communities in Texas, disproportionately affected by HIV, uncovering intersectional factors influencing health outcomes, and leveraging visual tools to inform policy and resource allocation. Special attention will be given to designing visuals that resonate with non-technical audiences, ensuring findings are actionable for public health professionals and policymakers.",
3+
"duration": 4007,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Kimberly Deas"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/-BA2eXBoDoc/maxresdefault.jpg",
21+
"title": "Data Viz in Python as a Tool to Study HIV Health Disparities",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=-BA2eXBoDoc"
26+
}
27+
]
28+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
{
2+
"description": "Where do landlords engage in more eviction actions? What characteristics of renters or landlords increase the practice of serial filing? There is widespread interest in using administrative data -- information collected by government and agencies in the implementation of public programs -- to evaluate systems and promote most just outcomes. Working with the Civil Court Data Initiative of Legal Services Corporation, we use data collected from civil court records in Virginia to analyze the behavior of landlords. Expanding on our Virginia Evictors Catalog, we use data on court evictions to build additional data tools to support the work of legal and housing advocates and model key eviction outcomes to contribute to our understanding of landlord behavior.",
3+
"duration": 1755,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Michele Claibourn",
18+
"Samantha Toet"
19+
],
20+
"tags": [],
21+
"thumbnail_url": "https://i.ytimg.com/vi/eE0D79trL2c/maxresdefault.jpg",
22+
"title": "Exploring Eviction Trends in Virginia",
23+
"videos": [
24+
{
25+
"type": "youtube",
26+
"url": "https://www.youtube.com/watch?v=eE0D79trL2c"
27+
}
28+
]
29+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
{
2+
"description": "When every day counted during the COVID-19 pandemic, data science became an essential catalyst in accelerating the path to widespread vaccination. This talk delves into the data-driven strategies that enabled the U.S. government\u2019s vaccine trials to move faster, cutting crucial weeks\u20146 to 8, by our estimates\u2014off the timeline to deployment. Through sophisticated geospatial modeling, we identified and swiftly mobilized trial recruitment efforts in emerging hot zones, ensuring that each candidate pool was both numerically sufficient and demographically representative. Attendees will discover how advanced analytics, predictive modeling, and interdisciplinary collaboration converged to target the right communities at the right time, ultimately expediting vaccine availability. This behind-the-scenes look at rapid-response data science highlights not just the technical innovations, but the decisive cultural and operational shifts that turned real-time insights into life-saving action.",
3+
"duration": 1733,
4+
"language": "eng",
5+
"recorded": "2025-04-18",
6+
"related_urls": [
7+
{
8+
"label": "Conference Website",
9+
"url": "https://pydata.org/virginia2025"
10+
},
11+
{
12+
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
13+
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
14+
}
15+
],
16+
"speakers": [
17+
"Greg Michaelson"
18+
],
19+
"tags": [],
20+
"thumbnail_url": "https://i.ytimg.com/vi/zXKdjBv1SGc/maxresdefault.jpg",
21+
"title": "How data science shortened the COVID-19 pandemic by 2 months",
22+
"videos": [
23+
{
24+
"type": "youtube",
25+
"url": "https://www.youtube.com/watch?v=zXKdjBv1SGc"
26+
}
27+
]
28+
}

0 commit comments

Comments
 (0)