Releases: awslabs/deequ
Releases · awslabs/deequ
2.0.16
What's Changed
- Use release_version directly in version bump by @sudsali in #683
- Sort tied categorical histogram frequencies alphabetically by @kyraman in #684
- Fix: WHERE clause with zero matching rows incorrectly fails for hasMi… by @sohama4 in #677
- Update version in pom.xml to 2.0.16-spark-3.5 by @github-actions[bot] in #689
New Contributors
Full Changelog: 2.0.15...2.0.16
2.0.15
2.0.14
What's Changed
- Add support for RowCountMatch rule by @joshuazexter in #652
- Add ReferentialIntegrity DQDL Rule Support by @joshuazexter in #654
- Semi-automate Maven Central publishing workflow by @sudsali in #657
- Add DataFreshness DQDL Rule Support by @EdisonXie in #661
- Added ColumnNamesMatchPattern DQDL rule by @avallabhani in #658
- Add composite rule evaluation with AND/OR operators by @shriyavanvari in #665
- Add DatasetMatch DQDL Rule Support by @joshuazexter in #656
- Added AggregateMatch rule by @avallabhani in #663
- Add SchemaMatch Rule Support by @awang923 in #659
- Add support for ColumnValues DQDL rule by @sudsali in #655
- Add ColumnCount DQDL rule support by @sudsali in #664
- Fix NULL handling in WHERE clause for Compliance analyzer by @EdisonXie in #671
- Add support for ColumnDataType DQDL rule by @sudsali in #662
- Update version in pom.xml to 2.0.14-spark-3.5 by @sudsali in #672
New Contributors
- @sudsali made their first contribution in #657
- @EdisonXie made their first contribution in #661
- @avallabhani made their first contribution in #658
- @awang923 made their first contribution in #659
Full Changelog: 2.0.13...2.0.14
2.0.13
What's Changed
- Add HistogramBinned analyzer and extend HistogramBase shared functionality by @kyraman in #641
- Add HistogramBinned analyzer and serialization support by @kyraman in #644
- Optimize loading all results in FileSystemMetricRepository by @praneeth527 in #626
- Replace Travis CI badge with Github Actions Badge by @Laerte in #643
- HistogramBinned should count empty and null bins in total by @kyraman in #645
- Update version in pom.xml to 2.0.13-spark-3.5 by @kyraman in #649
New Contributors
- @praneeth527 made their first contribution in #626
- @Laerte made their first contribution in #643
Full Changelog: 2.0.12...2.0.13
2.0.12
What's Changed
-
Added Implementation of DQDL Rules and Execution
- add implementation of DQDL rule execution by @happy-coral in #620
- Add implementation of outcome mapping in DeequOutcomeTranslator by @happy-coral in #621
- Add implementation for DQDL rules: CompletenessRule, IsCompleteRule, UniquenessRule, IsUniqueRule, ColumnCorrelationRule by @happy-coral in #622
- Add implementation for DQDL rules: DistinctValuesCount, Entropy, Mean, StandardDeviation, Sum, UniqueValueRatio by @happy-coral in #624
- Update README to describe DQDL support and add Java & Scala DQDL examples by @happy-coral in #634
- Add support for DQDL IsPrimaryKey rule by @happy-coral in #635
- Add support for DQDL ColumnLength rule by @eycho-am in #636
-
Modify Histogram to be in descending frequency by @kyraman in #630
-
Introduce HistogramBase for common histogram behavior by @kyraman in #631
-
Modify maven publishing to use central portal by @eycho-am in #633
-
Add support for DQDL CustomSql rule & Deequ CustomSql check by @happy-coral in #632
-
fix(kll): Add SerDe Implementation for KLLSketch by @mdrakiburrahman in #628
-
Updated version in pom.xml to 2.0.12-spark-3.5 by @eycho-am in #637
New Contributors
- @kyraman made their first contribution in #630
- @mdrakiburrahman made their first contribution in #628
Full Changelog: 2.0.11...2.0.12
2.0.11
What's Changed
- Add AnalyzerOptions to Analyzer serialize / deserialize logic by @kchaturvedi in #597
- Refine row count retrieval to skip redundant Size() scans by @lawofcycles in #605
- Updated version in pom.xml to 2.0.11-spark-3.5 by @eycho-am in #615
New Contributors
- @kchaturvedi made their first contribution in #597
- @lawofcycles made their first contribution in #605
Full Changelog: 2.0.10...2.0.11
2.0.10
New Features
- Are unique check by @eycho-am in #599
- add DQDL parser dependency by @happy-coral in #603
- scaffolding for checking data quality agains DQDL rulesets by @happy-coral in #604
- Implement translation of rules and add converter for RowCount rule by @happy-coral in #606
Maintenance / Fixes
- feature/replace-rdd by @shriyavanvari in #586
- Adds a test to verify that Deequ's isContainedIn constraint correctly handles string values containing single quotes in the verification process. by @D-Minor in #602
New Contributors
- @shriyavanvari made their first contribution in #586
- @D-Minor made their first contribution in #602
- @happy-coral made their first contribution in #603
Full Changelog: 2.0.9...2.0.10
2.0.9
2.0.8
New Features
- Configurable RetainCompletenessRule by @zeotuan in #564
- Optional specification of instance name in CustomSQL analyzer metric. by @tylermcdaniel0 in #569
- Adding Wilson Score Confidence Interval Strategy by @zeotuan in #567
- CustomAggregator by @joshuazexter in #572
- Add commits from master branch to release/2.0.8-spark-3.5 by @eycho-am in #587
Maintenance / Fixes
- fix typo by @bojackli in #574
- Fix performance of building row-level results by @marcantony in #577
New Contributors
- @joshuazexter made their first contribution in #572
- @bojackli made their first contribution in #574
Full Changelog: 2.0.7...2.0.8
2.0.7
What's Changed
Upgrades
New Features
- New type of MetricsRepository by @VenkataKarthikP:
- Using Spark tables as the data source in #518
- Row Level Result Treatment Options by @eycho-am:
- Anomaly Detection Changes by @zeotuan:
- Add Daily Season with Hourly Interval to HoltWinter in #546
- New analyzers:
- RatioOfSums by @scott-gunn in #552
- Column Count Analyzer and Check by @mentekid in #555
Maintenance/Fixes
- Fix Breeze dependency conflict in Anomaly Detection Spark 3.4+ by @zeotuan in #545
- Data Sync / DatasetMatch changes by @VenkataKarthikP:
- Row level results fixes:
- Add analyzerOption to add filteredRowOutcome for isPrimaryKey Check by @eycho-am in #537
- Fix bug in MinLength and MaxLength when NullBehavior.EmptyString by @eycho-am in #538
- [Min/Max] Apply filtered row behavior at the row level evaluation by @rdsharma26 in #543
- [MinLength/MaxLength] Apply filtered row behavior at the row level evaluation by @rdsharma26 in #547
- Fix for satisfies row level results bug by @rdsharma26 in #553
New Contributors
- @VenkataKarthikP made their first contribution in #518
- @scott-gunn made their first contribution in #552
Full Changelog: 2.0.6...2.0.7