Releases: clflushopt/tpchgen-rs
Releases · clflushopt/tpchgen-rs
v2.0.2
What's Changed
- infra: automatically publish to crates.io on release by @kevinjqliu in #190
- infra: configure dependabot for cargo & github action updates by @pmcgleenon in #191
- chore(deps): bump actions/attest-build-provenance from 2 to 3 by @dependabot[bot] in #192
- chore(deps): bump actions/upload-artifact from 4 to 5 by @dependabot[bot] in #193
- chore(deps): bump actions/checkout from 4 to 5 by @dependabot[bot] in #194
- chore(deps): bump actions/download-artifact from 4 to 6 by @dependabot[bot] in #195
- chore(deps): bump actions/cache from 3 to 4 by @dependabot[bot] in #196
- chore(deps): bump actions/checkout from 5 to 6 by @dependabot[bot] in #203
- feat: tpchgen runners as a lib by @clflushopt in #202
- infra: macos-13 is deprecated, replace with macos-15-intel by @kevinjqliu in #208
- chore(deps): bump actions/upload-artifact from 5 to 6 by @dependabot[bot] in #206
- chore(deps): bump actions/cache from 4 to 5 by @dependabot[bot] in #205
- chore(deps): bump actions/download-artifact from 6 to 7 by @dependabot[bot] in #204
- chore(deps): update parquet/arrow/arrow-csv from 56 to 57.1 by @kevinjqliu in #200
- feat: release v2.0.2 by @clflushopt in #210
New Contributors
- @pmcgleenon made their first contribution in #191
- @dependabot[bot] made their first contribution in #192
Full Changelog: v2.0.1...v2.0.2
v2.0.1
What's Changed
- fix: disable s390x support in tpchgen-cli-publish-pypi.yml by @clflushopt in #181
- tpchgen-cli: get readme/license/authors from Cargo.toml by @kevinjqliu in #182
- infra: add a twine check before uploading to pypi by @kevinjqliu in #183
- infra: add
continue-on-error: trueto the wheel builds by @kevinjqliu in #184 - CI: ensure python cli wheels and sdist can build by @kevinjqliu in #187
- bump to 2.0.1 to fix README in pypi by @kevinjqliu in #188
Full Changelog: v2.0.0...v2.0.1
v2.0.0
What's Changed
- bump arrow version to match datafusion by @joseph-isaacs in #153
- refactor: Extract plan generation to
GenerationPlan, add docs and tests by @alamb in #157 - feat: Add integration tests for
tpchgen-cliby @alamb in #156 - Minor: parallelize cli part validation test by @alamb in #162
- feat: Chunkify single parts to generate them in parallel by @clflushopt in #155
- feat: expose answers in tbl format as part of the package by @clflushopt in #159
- refactor: Make
--partand--partsOption<i32>and add better error handling by @alamb in #166 - refactor: use
RangeInclusive<i32>inGenerationPlanby @alamb in #169 - fix: Create multiple row groups when writing single
--parts by @alamb in #168 - Minor: Update ARCHITECTURE.md by @alamb in #171
- feat: make parquet row groups size configurable by @kevinjqliu in #158
- Update Rust to 1.89 by @alamb in #177
- Upgrade to arrow/parquet 56.0.0 (8%-10% faster) by @alamb in #176
- Automatically create multiple files with single
--partcommand by @alamb in #175 - Docs: Readme revamp by @alamb in #179
- Prepare for 2.0.0 (upgrade version) by @alamb in #178
New Contributors
- @joseph-isaacs made their first contribution in #153
Full Changelog: v1.1.1...v2.0.0
v1.1.1
What's Changed
- fix: Inconsistent implementation of IntoIterator by @askyx in #148
- feat: prepare v1.1.1 by @clflushopt in #149
New Contributors
Full Changelog: v1.1.0...v1.1.1
v1.1.0
What's Changed
- Add rust-toolchain.toml to control rust version by @alamb in #120
- Change arrow version by @matthewmturner in #122
- fix: Move README to the top level to fix broken link issue by @clflushopt in #125
- Documentation tweaks by @alamb in #117
- feat: fix CLI option docs by @kevinjqliu in #127
- feat: fix CI and run CI for every PR by @kevinjqliu in #128
- [feat] update tpchgen-cli's README by @kevinjqliu in #129
- add python CLI bindings by @kevinjqliu in #121
- fix: Document via examples the behavior of multipart generation by @clflushopt in #135
- Fix up
tpchgen-cli's README by @kevinjqliu in #133 - [feat] pyproject use cargo version by @kevinjqliu in #136
- [feat] publish to pypi by @kevinjqliu in #134
- [feat] skip conformance test for markdown file changes by @kevinjqliu in #137
- Replace ByteArrayBuilder with Vec by @scsmithr in #138
- fix: Drop unused TextPoolGenerator by @clflushopt in #141
- feat: Add TPC-H queries by @clflushopt in #140
- feat: prepare v1.1.0 by @clflushopt in #142
- feat: prepare v1.1.0 by @clflushopt in #143
New Contributors
- @kevinjqliu made their first contribution in #127
Full Changelog: v1.0.0...v1.1.0
v1.0.0
What's Changed
- Move chrono to dev dep by @matthewmturner in #107
- Improve documentation on additional shasum checks by @alamb in #109
- Fix typo by @alamb in #111
- fix: Use lowercase for possible values of table names by @szarnyasg in #114
- refactor: Simplify DuckDB export by @szarnyasg in #112
- feat: Link to @alamb's demo and blog in README by @clflushopt in #115
- feat: Small documentation improvements by @clflushopt in #116
- feat: prepare cargo release for v1.0.0 by @clflushopt in #118
New Contributors
- @matthewmturner made their first contribution in #107
- @szarnyasg made their first contribution in #114
Full Changelog: v0.1.1...v1.0.0
v0.1.1
What's Changed
- Update project documentation and benchmarking by @alamb in #72
- Add
as_strhelper toOrderStatusby @scsmithr in #95 - fix: pin local crate versions by @clflushopt in #94
- fix: minor README improvements by @clflushopt in #96
- Update with benchmark numbers by @alamb in #98
- feat: add to_unix_epoch helper to TPCHDate by @clflushopt in #97
- chore: tweak doc comments, add
inlinefor good measure by @alamb in #102 - feat: pin chrono package to 0.4.39 by @clflushopt in #105
- fix: bump tpchgen-arrow and tpchgen-cli to 0.1.1 by @clflushopt in #106
- fix: bump tpchgen to 0.1.1 by @clflushopt in #108
Full Changelog: v0.1.0...v0.1.1
v0.1.0
What's Changed
This is the initial release of the tpchgen-rs package that contains the tpchgen data generator crate and tpchgen-cli which is a drop-in replacement for dbgen written in Rust !
Changelog
- fix: run clippy fix to address some linting issues by @clflushopt in #8
- feat: Introduce gzipped test fixtures and steps to reproduce. by @clflushopt in #11
- Add instructions for verifying checked in results by @alamb in #13
- fix: date format for fixtures by @clflushopt in #15
- feat: Introduce end-to-end conformance integration tests. by @clflushopt in #16
- chore: Add issue and pr templates by @clflushopt in #18
- feat: Create
TextPoolonce per process rather than once per table by @alamb in #19 - feat: Use tbl format for fixtures and tests by @clflushopt in #20
- feat: scaffold basic dbgen like api by @clflushopt in #12
- feat: Update conformance test to run at scale factors up to 1 by @clflushopt in #22
- feat: Create CONTRIBUTING.md by @clflushopt in #23
- docs: Some small README improvements by @clflushopt in #25
- Change Random* generators to return
&stror structs rather thanString(10% faster) by @alamb in #26 - feat: optimize single threaded performance by @clflushopt in #21
- Change date handling to use TPCHDate type rather than String (10% faster) by @alamb in #27
- feat: Drop unused dependencies and regex crate by @clflushopt in #29
- feat: Add fmt + clippy to the build action by @clflushopt in #30
- Avoid copying
Stringwhen generatingLineItem(20% faster) by @alamb in #32 - Unbox writer (3-5% faster) by @alamb in #33
- Revert "Avoid copying
Stringwhen generatingLineItem(20% faster… by @alamb in #36 - Add lifetimes to generators and types by @scsmithr in #35
- Remove lazy_static dep, replace with std::sync::LazyLock by @scsmithr in #40
- Pass distribution references to generators by @scsmithr in #43
- Avoid copying strings for Addresses (10% faster for customers, 3% faster overall) by @alamb in #39
- feat: Derive Debug for all generator and iterator types by @clflushopt in #47
- Remove more String copying (5% faster) by @alamb in #37
- Add decimal type, avoid floating point calculations (7% faster) by @scsmithr in #48
- Precompute date strings (10% faster) by @alamb in #50
- Make
'staticlifetime explicit in Generators by @alamb in #38 - Add
ARCHITECTURE.mddocumentation with design goals (keep data generator crate dependencies minimal) by @alamb in #49 - Add docs, improve doc tests by @alamb in #55
- Add
into_innerfor wrapper types, and derive some more traits` by @alamb in #52 - Implement output as CSV by @alamb in #54
- Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) by @alamb in #62
- (POC): Remove some hashmap lookups in distribution lookup by @scsmithr in #59
- Avoid String creation while loading
Distributions(2% less startup time) by @alamb in #63 - feat: Use lookup tables for julian date format (3-5% improvement) by @clflushopt in #64
- Parallel output in tpchgen-cli (Nx faster, where N is number of cores) by @alamb in #58
- Speed up initial TextPool generation more (15% faster) by @alamb in #67
- feat: remove chrono dependency by @clflushopt in #68
- Minor README update by @alamb in #70
- fix: quote columns that may contain comma for csv output by @niebayes in #66
- Provide access to TPCHDates as (y,m,d) by @alamb in #69
- Add
tpchgen-arrowcrate to generate ArrowRecordBatches, implement conversion forLineItemby @alamb in #71 - Implement arrow conversion for remaining tables by @alamb in #74
- Support writing directly to parquet files by @alamb in #61
- Stop early on parquet write error by @alamb in #82
- Avoid double buffering when writing CSV/TBL by @alamb in #79
- Quit early on error for tbl/csv by @alamb in #84
- feat: allow writing to stdout by @clflushopt in #83
- fix: Use a different shorthand for parquet compression argument by @clflushopt in #86
- Support writing parquet to stdout, document use of
pvby @alamb in #85 - Limit number of parquet row groups by @alamb in #87
- Do not generate CSV header multiple times by @alamb in #78
- Add documentation for zstd compression by @alamb in #88
- fix: Align the cli with the original dbgen by @clflushopt in #90
- fix: Make sure table name handling is done by the crate instead of clap by @clflushopt in #92
- fix: Align all manifest files to use workspace level configuration for v0.1.0 by @clflushopt in #93
New Contributors
- @alamb made their first contribution in #13
- @scsmithr made their first contribution in #35
- @niebayes made their first contribution in #66
Full Changelog: https://github.com/clflushopt/tpchgen-rs/commits/v0.1.0