Skip to content

Releases: clflushopt/tpchgen-rs

v2.0.2

26 Dec 02:42
v2.0.2
8227b49

Choose a tag to compare

What's Changed

  • infra: automatically publish to crates.io on release by @kevinjqliu in #190
  • infra: configure dependabot for cargo & github action updates by @pmcgleenon in #191
  • chore(deps): bump actions/attest-build-provenance from 2 to 3 by @dependabot[bot] in #192
  • chore(deps): bump actions/upload-artifact from 4 to 5 by @dependabot[bot] in #193
  • chore(deps): bump actions/checkout from 4 to 5 by @dependabot[bot] in #194
  • chore(deps): bump actions/download-artifact from 4 to 6 by @dependabot[bot] in #195
  • chore(deps): bump actions/cache from 3 to 4 by @dependabot[bot] in #196
  • chore(deps): bump actions/checkout from 5 to 6 by @dependabot[bot] in #203
  • feat: tpchgen runners as a lib by @clflushopt in #202
  • infra: macos-13 is deprecated, replace with macos-15-intel by @kevinjqliu in #208
  • chore(deps): bump actions/upload-artifact from 5 to 6 by @dependabot[bot] in #206
  • chore(deps): bump actions/cache from 4 to 5 by @dependabot[bot] in #205
  • chore(deps): bump actions/download-artifact from 6 to 7 by @dependabot[bot] in #204
  • chore(deps): update parquet/arrow/arrow-csv from 56 to 57.1 by @kevinjqliu in #200
  • feat: release v2.0.2 by @clflushopt in #210

New Contributors

Full Changelog: v2.0.1...v2.0.2

v2.0.1

08 Sep 17:40
ad649e8

Choose a tag to compare

What's Changed

Full Changelog: v2.0.0...v2.0.1

v2.0.0

27 Aug 03:03
v2.0.0
f3de11a

Choose a tag to compare

What's Changed

  • bump arrow version to match datafusion by @joseph-isaacs in #153
  • refactor: Extract plan generation to GenerationPlan, add docs and tests by @alamb in #157
  • feat: Add integration tests for tpchgen-cli by @alamb in #156
  • Minor: parallelize cli part validation test by @alamb in #162
  • feat: Chunkify single parts to generate them in parallel by @clflushopt in #155
  • feat: expose answers in tbl format as part of the package by @clflushopt in #159
  • refactor: Make --part and --parts Option<i32> and add better error handling by @alamb in #166
  • refactor: use RangeInclusive<i32> in GenerationPlan by @alamb in #169
  • fix: Create multiple row groups when writing single --parts by @alamb in #168
  • Minor: Update ARCHITECTURE.md by @alamb in #171
  • feat: make parquet row groups size configurable by @kevinjqliu in #158
  • Update Rust to 1.89 by @alamb in #177
  • Upgrade to arrow/parquet 56.0.0 (8%-10% faster) by @alamb in #176
  • Automatically create multiple files with single --part command by @alamb in #175
  • Docs: Readme revamp by @alamb in #179
  • Prepare for 2.0.0 (upgrade version) by @alamb in #178

New Contributors

Full Changelog: v1.1.1...v2.0.0

v1.1.1

06 Jun 04:15
v1.1.1
d16b0db

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.1.1

v1.1.0

29 Apr 21:32
v1.1.0
2ed321e

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.0.0...v1.1.0

v1.0.0

12 Apr 20:30
ea94a80

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.1...v1.0.0

v0.1.1

05 Apr 15:15
6fa2b4e

Choose a tag to compare

What's Changed

Full Changelog: v0.1.0...v0.1.1

v0.1.0

30 Mar 20:26
4568384

Choose a tag to compare

What's Changed

This is the initial release of the tpchgen-rs package that contains the tpchgen data generator crate and tpchgen-cli which is a drop-in replacement for dbgen written in Rust !

Changelog

  • fix: run clippy fix to address some linting issues by @clflushopt in #8
  • feat: Introduce gzipped test fixtures and steps to reproduce. by @clflushopt in #11
  • Add instructions for verifying checked in results by @alamb in #13
  • fix: date format for fixtures by @clflushopt in #15
  • feat: Introduce end-to-end conformance integration tests. by @clflushopt in #16
  • chore: Add issue and pr templates by @clflushopt in #18
  • feat: Create TextPool once per process rather than once per table by @alamb in #19
  • feat: Use tbl format for fixtures and tests by @clflushopt in #20
  • feat: scaffold basic dbgen like api by @clflushopt in #12
  • feat: Update conformance test to run at scale factors up to 1 by @clflushopt in #22
  • feat: Create CONTRIBUTING.md by @clflushopt in #23
  • docs: Some small README improvements by @clflushopt in #25
  • Change Random* generators to return &str or structs rather than String (10% faster) by @alamb in #26
  • feat: optimize single threaded performance by @clflushopt in #21
  • Change date handling to use TPCHDate type rather than String (10% faster) by @alamb in #27
  • feat: Drop unused dependencies and regex crate by @clflushopt in #29
  • feat: Add fmt + clippy to the build action by @clflushopt in #30
  • Avoid copying String when generating LineItem (20% faster) by @alamb in #32
  • Unbox writer (3-5% faster) by @alamb in #33
  • Revert "Avoid copying String when generating LineItem (20% faster… by @alamb in #36
  • Add lifetimes to generators and types by @scsmithr in #35
  • Remove lazy_static dep, replace with std::sync::LazyLock by @scsmithr in #40
  • Pass distribution references to generators by @scsmithr in #43
  • Avoid copying strings for Addresses (10% faster for customers, 3% faster overall) by @alamb in #39
  • feat: Derive Debug for all generator and iterator types by @clflushopt in #47
  • Remove more String copying (5% faster) by @alamb in #37
  • Add decimal type, avoid floating point calculations (7% faster) by @scsmithr in #48
  • Precompute date strings (10% faster) by @alamb in #50
  • Make 'static lifetime explicit in Generators by @alamb in #38
  • Add ARCHITECTURE.md documentation with design goals (keep data generator crate dependencies minimal) by @alamb in #49
  • Add docs, improve doc tests by @alamb in #55
  • Add into_inner for wrapper types, and derive some more traits` by @alamb in #52
  • Implement output as CSV by @alamb in #54
  • Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) by @alamb in #62
  • (POC): Remove some hashmap lookups in distribution lookup by @scsmithr in #59
  • Avoid String creation while loading Distributions (2% less startup time) by @alamb in #63
  • feat: Use lookup tables for julian date format (3-5% improvement) by @clflushopt in #64
  • Parallel output in tpchgen-cli (Nx faster, where N is number of cores) by @alamb in #58
  • Speed up initial TextPool generation more (15% faster) by @alamb in #67
  • feat: remove chrono dependency by @clflushopt in #68
  • Minor README update by @alamb in #70
  • fix: quote columns that may contain comma for csv output by @niebayes in #66
  • Provide access to TPCHDates as (y,m,d) by @alamb in #69
  • Add tpchgen-arrow crate to generate Arrow RecordBatches, implement conversion for LineItem by @alamb in #71
  • Implement arrow conversion for remaining tables by @alamb in #74
  • Support writing directly to parquet files by @alamb in #61
  • Stop early on parquet write error by @alamb in #82
  • Avoid double buffering when writing CSV/TBL by @alamb in #79
  • Quit early on error for tbl/csv by @alamb in #84
  • feat: allow writing to stdout by @clflushopt in #83
  • fix: Use a different shorthand for parquet compression argument by @clflushopt in #86
  • Support writing parquet to stdout, document use of pv by @alamb in #85
  • Limit number of parquet row groups by @alamb in #87
  • Do not generate CSV header multiple times by @alamb in #78
  • Add documentation for zstd compression by @alamb in #88
  • fix: Align the cli with the original dbgen by @clflushopt in #90
  • fix: Make sure table name handling is done by the crate instead of clap by @clflushopt in #92
  • fix: Align all manifest files to use workspace level configuration for v0.1.0 by @clflushopt in #93

New Contributors

Full Changelog: https://github.com/clflushopt/tpchgen-rs/commits/v0.1.0