Skip to content

Improving Tuple write performance (open for discussion) #1473

Open
@mdonkers

Description

@mdonkers

Observed

  1. high number of slices being allocated for inserting Tuple type objects

Details

Split off from #1426 , to have a further discussion on the direction and its usefulness.


When using the Tuple type for inserting values, the values need an underlying Array / Slice type for ClickHouse Go to process them correctly. But depending on how data is internally presented before inserting, it might mean new slices need to be allocated. If these tuples then form the values of a map, the overhead of those allocations can quickly become significant.

Take for example the following struct:

type ValueWithTypeTuple struct {
	V string
	T int8
}

Currently to insert these, you need to do something like:

func (avt ValueWithTypeTuple) Value() (driver.Value, error) {
	return []any{avt.V, avt.T}, nil
}

So I'm wondering if a specific Tuple type makes sense, with specific types for the most common lengths (like Tuple2, Tuple3 and Tuple4).

The method to insert these would then look something like this:

// Get implements the column.Tuple2 interface from ClickHouse. It returns two values that can be inserted as Tuple.
func (avt ValueWithTypeTuple) Get() (any, any) {
	return &avt.V, int8(avt.T)
}

It then no longer requires the slice allocation, instead directly referencing the values.

I'd like to know any thoughts or other ideas to see if, and what kind of implementation might make sense.

Environment

  • clickhouse-go version: v2.30.0
  • Interface: ClickHouse API
  • Go version: 1.23.4
  • Operating system: Linux
  • ClickHouse version:
  • Is it a ClickHouse Cloud? No
  • ClickHouse Server non-default settings, if any:
  • CREATE TABLE statements for tables involved:
  • Sample data for all these tables, use clickhouse-obfuscator if necessary

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions