Skip to content

Commit b967c06

Browse files
authored
add events (#39)
1 parent 9580c41 commit b967c06

File tree

26 files changed

+312
-83
lines changed

26 files changed

+312
-83
lines changed

config/_default/menus.yaml

+3
Original file line numberDiff line numberDiff line change
@@ -22,3 +22,6 @@ main:
2222
- name: 🎤 Publications
2323
url: "#pubs"
2424
weight: 60
25+
- name: 🎪 Events
26+
url: "event"
27+
weighty: 70

content/event/20230427/featured.png

985 KB
Loading

content/event/20230427/index.md

+62
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,62 @@
1+
---
2+
title: 'CROSS, Skyhook, and Polyphy'
3+
4+
event: 'Skyhook POSE Workshop Series'
5+
#event_url: https://example.org
6+
7+
location: University of California, Santa Cruz
8+
address:
9+
street: 1156 High St
10+
city: Santa Cruz
11+
region: CA
12+
postcode: '95064'
13+
country: United States
14+
15+
summary: 'Institutional support for creating Paths to Open Source Ecosystems for Open Source Products in Research.'
16+
abstract: ''
17+
18+
# Talk start and end times.
19+
# End time can optionally be hidden by prefixing the line with `#`.
20+
date: '2023-04-27'
21+
#date_end: '2023-04-27'
22+
all_day: true
23+
24+
# Schedule page publish date (NOT talk date).
25+
publishDate: '2023-03-20'
26+
27+
authors: [admin, slieggi]
28+
tags: []
29+
30+
# Is this a featured talk? (true/false)
31+
featured: false
32+
33+
image:
34+
caption: ''
35+
focal_point: Right
36+
37+
url_code: ''
38+
url_pdf: ''
39+
url_slides: ''
40+
url_video: ''
41+
42+
# Markdown Slides (optional).
43+
# Associate this talk with Markdown slides.
44+
# Simply enter your slide deck's filename without extension.
45+
# E.g. `slides = "example-slides"` references `content/slides/example-slides.md`.
46+
# Otherwise, set `slides = ""`.
47+
slides:
48+
49+
# Projects (optional).
50+
# Associate this post with one or more of your projects.
51+
# Simply enter your project's folder or file name without extension.
52+
# E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`.
53+
# Otherwise, set `projects = []`.
54+
projects:
55+
---
56+
57+
We would like to invite you to a day-long hybrid workshop on April 27, 2023, entitled “CROSS, Skyhook, and Polyphy: Institutional support for creating Paths to Open Source Ecosystems for Open Source Products in Research." This hybrid workshop will be held at UC Santa Cruz with remote access for participants unable to attend in-person.
58+
59+
With support from an NSF Pathways to Enable Open Source Ecosystems (POSE) Phase 1 grant, the NSF Institute for Research and Innovation in Software for High-Energy Physics (IRIS-HEP), and the Alfred P. Sloan Foundation, the goal of the workshop is to scope out self-sustaining ecosystems for the [Skyhook](https://github.com/skyhookdm) and [Polyphy](https://polyphy.io/) projects and to create an institutional model within the [Center for Research in Open Source Software (CROSS)](https://cross.ucsc.edu) and the [Open Source Program Office (OSPO) UC Santa Cruz](/) to create paths to sustainable open source ecosystems for research products across UC campuses.
60+
61+
A draft agenda for the event ~~will be provided by March 27~~ is available [here](https://cross.ucsc.edu/news/news/20230427poseevent.html) as well as a [white paper](https://docs.google.com/document/d/1znmoRvnmoZk1YMGWu7wIusC7KxlI6R_sSuXuK5PQ9LQ/edit?usp=sharing) that will form the basis for the day’s discussion. The workshop’s interactive sessions will discuss strategies for creating and sustaining institutional support for creating open source ecosystems, using
62+
Skyhook and Polyphy as examples. We will also hear updates from our CROSS fellows about their research projects that are already creating open source products that likely will benefit from institutional support for open source ecosystems in the future.

content/event/20230817/featured.png

9.28 MB
Loading

content/event/20230817/index.md

+75
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,75 @@
1+
---
2+
title: 'Computational I/O Stack Workshop'
3+
4+
event: 'Skyhook POSE Workshop Series'
5+
#event_url: https://example.org
6+
7+
location: University of California, Santa Cruz
8+
address:
9+
street: 1156 High St
10+
city: Santa Cruz
11+
region: CA
12+
postcode: '95064'
13+
country: United States
14+
15+
summary: A workshop featuring keynote speaker Yoichiro Tanaka (Tohoku University) that will take place on August 17, 2023 at UC Santa Cruz in the Engineering 2 building (room to be confirmed)
16+
abstract: ''
17+
18+
# Talk start and end times.
19+
# End time can optionally be hidden by prefixing the line with `#`.
20+
date: '2023-08-17'
21+
#date_end: '2023-04-27'
22+
all_day: true
23+
24+
# Schedule page publish date (NOT talk date).
25+
publishDate: '2023-03-20'
26+
27+
authors: [carlos.maltzahn, slieggi]
28+
tags: []
29+
30+
# Is this a featured talk? (true/false)
31+
featured: true
32+
33+
image:
34+
caption: ''
35+
focal_point: Right
36+
37+
url_code: ''
38+
url_pdf: ''
39+
url_slides: ''
40+
url_video: ''
41+
42+
# Markdown Slides (optional).
43+
# Associate this talk with Markdown slides.
44+
# Simply enter your slide deck's filename without extension.
45+
# E.g. `slides = "example-slides"` references `content/slides/example-slides.md`.
46+
# Otherwise, set `slides = ""`.
47+
slides:
48+
49+
# Projects (optional).
50+
# Associate this post with one or more of your projects.
51+
# Simply enter your project's folder or file name without extension.
52+
# E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`.
53+
# Otherwise, set `projects = []`.
54+
projects:
55+
---
56+
57+
👋 Join us for an exciting event featuring IEEE Distinguished Lecturer {{% mention yoichiro.tanaka %}} (Tohoku University) discussing technological and institutional innovations to make the computational I/O stack a reality!
58+
59+
{{% callout note %}}
60+
This is now a past event. In case you missed it, we have started posting speaker slides and recordings of the event in the agenda below.
61+
{{% /callout %}}
62+
63+
The introduction of computational data management services into the I/O stack, especially in storage and networking devices, requires both technological innovations and new relations between university and industry. This one-day workshop will convene experts from storage systems, open source, and community architecture to discuss technologies and strategies for a computational I/O stack with low market entry barriers.
64+
65+
The workshop will take place on August 17, 2023 from 10am to 5pm, at UC Santa Cruz, Engineering 2, Room 506 (5th floor, north-west of the lobby/elevators, see [floor plans](https://facilities.soe.ucsc.edu/floor-plans)), and is jointly organized by the [IEEE Magnetics Society's Distinguished Lecturers Program][web-ieee-lecturers], the Skyhook Data Management community with funding by the National Science Foundation ([TI-2229773][web-nsf-award]), the Center for Research in Open Source Software ([cross.ucsc.edu][web-cross]), and the Open Source Program Office, UC Santa Cruz ([ospo.ucsc.edu][web-ospo]).
66+
67+
{{< table path="agenda.csv" header="true" caption="Table: Agenda" >}}
68+
69+
70+
<!-- Resources -->
71+
[web-ieee-lecturers]: https://ieeemagnetics.org/membership/educational-outreach/distinguished-lecturers
72+
[web-nsf-award]: https://www.nsf.gov/awardsearch/showAward?AWD_ID=2229773
73+
[web-cross]: https://cross.ucsc.edu
74+
[web-ospo]: https://ospo.ucsc.edu
75+

content/event/20230907/featured.jpg

197 KB
Loading

content/event/20230907/index.md

+91
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,91 @@
1+
---
2+
title: 'Adam Kennedy (Voltron Data): Polygraph'
3+
4+
event: 'Eusocial Interest Group Meeting'
5+
#event_url: https://example.org
6+
7+
location: University of California, Santa Cruz
8+
address:
9+
street: 1156 High St
10+
city: Santa Cruz
11+
region: CA
12+
postcode: '95064'
13+
country: United States
14+
15+
summary: Adam Kennedy (Voltron Data) is speaking about Polygraph, a new effort to make processing and optimizations of query plans more efficient
16+
abstract: ''
17+
18+
# Talk start and end times.
19+
# End time can optionally be hidden by prefixing the line with `#`.
20+
date: '2023-09-07T14:00:00-0700'
21+
date_end: '2023-09-07T15:00:00-0700'
22+
all_day: false
23+
24+
# Schedule page publish date (NOT talk date).
25+
publishDate: '2023-09-06'
26+
27+
authors: [adam.kennedy]
28+
tags: []
29+
30+
# Is this a featured talk? (true/false)
31+
featured: false
32+
33+
image:
34+
caption: ''
35+
focal_point: Right
36+
37+
url_code: ''
38+
url_pdf: ''
39+
url_slides: ''
40+
url_video: 'https://www.icloud.com/iclouddrive/0920UPOGUXIosE6viyjHJe6BQ#video1862580471'
41+
42+
# Markdown Slides (optional).
43+
# Associate this talk with Markdown slides.
44+
# Simply enter your slide deck's filename without extension.
45+
# E.g. `slides = "example-slides"` references `content/slides/example-slides.md`.
46+
# Otherwise, set `slides = ""`.
47+
slides:
48+
49+
# Projects (optional).
50+
# Associate this post with one or more of your projects.
51+
# Simply enter your project's folder or file name without extension.
52+
# E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`.
53+
# Otherwise, set `projects = []`.
54+
projects:
55+
---
56+
57+
{{% callout note %}}
58+
This is an expanded version of Adam Kennedy's presentation at the [2nd International Workshop on Composable Data Management Systems 2023 (CDMS)](https://ceur-ws.org/Vol-3462/CDMS0.pdf) ([agenda](https://ceur-ws.org/Vol-3462)). The following abstract is copied from [there]((https://ceur-ws.org/Vol-3462/CDMS12.pdf)).
59+
{{% /callout %}}
60+
61+
The maturity and substantial investment in Apache Calcite establish it as the open source standard for query planning and
62+
optimization across numerous data tools. Nevertheless, utilizing Apache Calcite for dynamic query planning in a diverse tool
63+
stack with multiple languages has proven challenging. Through the integration of Apache Arrow, we introduce Polygraph:
64+
a language-independent, parse-free, and efficient format for query plans. Its purpose is to enhance plan interoperability,
65+
diminish latency and overheads, and facilitate dynamic query optimization. This experimental format allows for the efficient
66+
exchange of query plans between tools in diverse languages with minimal serialization overhead.
67+
68+
While future query engines are steering away from Java, Calcite remains the solitary mature option for query planning
69+
across a broad spectrum of workloads. Few alternatives come close to matching its features. However, Calcite relies on
70+
tree-based JSON or XML plan representations that do not readily lend themselves to certain optimizations and necessitate
71+
substantial overhead for serialization, I/O, and parsing. The commingling of planners and engines across languages is rare,
72+
unusual, and complex. Such approaches typically result in ad hoc, internal formats with limited reusability. Addressing
73+
these challenges, Polygraph relocates the query plan to Arrow. Polygraph employs a graph structure encoded with columnar
74+
storage techniques. Preliminary experiments indicate an order of magnitude reduction in query plan size compared to JSON
75+
encoding, without incurring copying and serialization overheads. Arrow provides zero-copy, shared-memory, and parse-free
76+
capabilities, along with fast RPC via Arrow Flight. In this representation, plan consumers only need to load the components
77+
and properties of a query plan required for a given computation. These efficiencies substantially reduce the latency between
78+
plan generation and query execution. Moreover, we envision significant potential for other advancements, including resource
79+
planning, ML preprocessing, and integration into ML training and inference.
80+
81+
Until recently, there was no urgent imperative to represent query plans efficiently. However, the escalating complexity
82+
and size of query graphs will persist as data tools become more deeply integrated into intricate ML workloads. Polygraph’s
83+
agile and decomposable graph representation empowers data engines to contribute to query optimization and resource
84+
management. Enhanced integration with top-tier ML systems becomes more viable, facilitating the incorporation of run-time
85+
compute planning and resource management into the query plan, utilizing tools like Apache Acero. The benefits extend
86+
beyond improvements in space efficiency and latency. Query sub-plans can be optimized in-situ using real-time hardware
87+
metrics. Value relations and broadcast tables can be seamlessly embedded in the plan as Arrow objects, accessed in a zero-copy
88+
manner. Large models can be directly incorporated into the query plan, incurring no loading cost until required. Increased
89+
investment in query plan representations, exemplified by Polygraph, supports the data community in keeping pace with
90+
advancements in new architectures and problem domains, such as AI.
91+

content/event/20230929/index.md

+64
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,64 @@
1+
---
2+
title: 'Workshop: Creating the SkyhookDM Ecosystem for the Computational I/O Stack'
3+
4+
event: '2023 UC Open Source Symposium'
5+
event_url: https://ucospo23.sched.com
6+
7+
location: University of California, Santa Cruz
8+
address:
9+
street: 1156 High St
10+
city: Santa Cruz
11+
region: CA
12+
postcode: '95064'
13+
country: United States
14+
15+
summary: The lack of an open and shared computational I/O software stack ecosystem hampers composability and innovation, and increases design cost. This workshop invites participants to discuss a roadmap for the technologies and governance of the Skyhook Data Management effort.
16+
abstract: ''
17+
18+
# Talk start and end times.
19+
# End time can optionally be hidden by prefixing the line with `#`.
20+
date: '2023-09-29T13:00:00-0700'
21+
date_end: '2023-09-29T14:20:00-0700'
22+
all_day: false
23+
24+
# Schedule page publish date (NOT talk date).
25+
publishDate: '2023-09-23'
26+
27+
authors: [carlos.maltzahn]
28+
tags: []
29+
30+
# Is this a featured talk? (true/false)
31+
featured: false
32+
33+
image:
34+
caption: ''
35+
focal_point: Right
36+
37+
url_code: ''
38+
url_pdf: ''
39+
url_slides: ''
40+
url_video: ''
41+
42+
# Markdown Slides (optional).
43+
# Associate this talk with Markdown slides.
44+
# Simply enter your slide deck's filename without extension.
45+
# E.g. `slides = "example-slides"` references `content/slides/example-slides.md`.
46+
# Otherwise, set `slides = ""`.
47+
slides:
48+
49+
# Projects (optional).
50+
# Associate this post with one or more of your projects.
51+
# Simply enter your project's folder or file name without extension.
52+
# E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`.
53+
# Otherwise, set `projects = []`.
54+
projects:
55+
---
56+
57+
{{% callout note %}}
58+
This event is part of the [2023 UC Open Source Symposium](https://ucospo23.sched.com), September 27-29, 2023 ([this workshop's link](https://ucospo23.sched.com/event/1RHgi/workshop-creating-the-skyhookdm-ecosystem-for-the-computational-io-stack))
59+
{{% /callout %}}
60+
61+
Hardware acceleration for computational I/O, that is the integration of specialized computational devices into the I/O path, is one of the most promising technologies to further improve performance and energy efficiency of analyzing high-volume and high-velocity datasets and streams. Despite the general availability of a number of devices such as Data Processing Units (DPUs, also known as SmartNICs) and Samsung's SmartSSDs, the open source data science ecosystem lacks an open and shared computational I/O software stack ecosystem. This lack hampers composability and innovation, and increases design cost. To address this. the Center for Research in Open Source Software launched Skyhook Data Management to create open source blueprints for a computational I/O stack that can be adopted by industry. With seed funding from industry component makers, SkyhookDM had a promising start: a blueprint using the unmodified Ceph open source distributed storage system was contributed to Apache Arrow in 2022 and has been included in every release since v7.0.0. It serves as a use case for SNIA Computational Storage TWG, and has attracted world-leading experts from industry and national labs.
62+
63+
This workshop invites participants to help put together a roadmap for an open and shared computational I/O software stack ecosystem at UC Santa Cruz following best practices in open source software techniques, strategies, and governance. We will discuss technical and organizational opportunities, leveraging readily available technologies and institutions.
64+

content/event/_index.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Recent & Upcoming Events
2+
title: 🎪 Recent & Upcoming Events
33

44
# Listing view
55
view: compact

content/event/example/featured.jpg

-606 KB
Binary file not shown.

content/event/example/index.md

-63
This file was deleted.

0 commit comments

Comments
 (0)