Skip to content

SQL UDFs in Iceberg #10432

Open
Open
@sxlin

Description

@sxlin

Proposed Change

A user-defined function (UDF) is a callable routine that accepts input parameters, executes the function body, and returns either a single scalar value or a table based on the UDF type.

Most compute engines (e.g. Dremio, Snowflake, Trino and Apache Spark) support UDFs . Without a common standard, the UDFs are hard to share among different engines. It allows versioning of UDFs if they are stored in Iceberg. This proposal standardizes the UDF metadata.

Proposal document

https://docs.google.com/document/d/1BDvOfhrH0ZQiQv9eLBqeAu8k8Vjfmeql9VzIiW1F0vc/edit?usp=sharing

Specifications

  • Table
  • View
  • REST
  • Puffin
  • Encryption
  • Other

Metadata

Metadata

Assignees

No one assigned

    Labels

    not-staleproposalIceberg Improvement Proposal (spec/major changes/etc)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions