fix: Ignore string column statistics for parquet-mr versions before 1.8.2 by lifulong · Pull Request #16744 · facebookincubator/velox

lifulong · 2026-03-12T11:15:35Z

Parquet files written by parquet-mr versions before 1.8.2 contain a bug where string (binary) column statistics min and max are computed using signed byte ordering rather than the standard unsigned lexicographic (memcmp) ordering.
Velox compares row group statistics against filter values using memcmp, which is an unsigned byte comparison. When reading files produced by parquet-mr < 1.8.2, this mismatch causes incorrect row group filtering: row groups that should be scanned are skipped, leading to wrong query results.

fix #16743

netlify · 2026-03-12T11:15:41Z

✅ Deploy Preview for meta-velox canceled.

Name	Link
🔨 Latest commit	`507370c`
🔍 Latest deploy log	https://app.netlify.com/projects/meta-velox/deploys/69bb6564deef0c0008549b82

velox/dwio/parquet/reader/SemanticVersion.cpp

PingLiuPing

Did you check this comment #16743 (comment) ? What's your opinion?
If you agree then we should change the PR description and title to reflect that (i.e the root cause is not min/max missing). If not then the bug still hiding and we need more investigation.

Your parquet shows:
min = "三星应用商店"
max = "vivo预装"

And you are using "360手机助手" as filter.

With signed byte ordering, the above stats are correct. Meaning
三星应用商店 < 360手机助手 < vivo预装
But Velox compare the stats using memcmp which is a standard lexicographic byte comparison. So the order is:
360手机助手 < vivo预装 < 三星应用商店

See

velox/velox/type/Filter.cpp

Line 1165 in e84a676

int compare = memcmp(lhs, rhs.data(), size);

PingLiuPing · 2026-03-13T10:10:17Z

velox/dwio/parquet/tests/reader/ParquetReaderTest.cpp

  EXPECT_EQ(reader->numberOfRows(), 10ULL);
 }

+TEST_F(ParquetReaderTest, parquetMR181SkipsFilterRowGroupsByStringStats) {


Could you add a case similar to TEST_F(ParquetReaderTest, varcharFilters) as well?

can't gen simple data file to reproduce ths issue , is this test case ok, has check parquet version 1.8.1 and 1.8.2

Can you add this case to E2EFilterTest.cpp ?

// Reproduces the real-world scenario from the bug report. Parquet-mr 1.8.1 // computed binary column min/max using signed byte ordering, which differs from // the unsigned lexicographic (memcmp) ordering Velox uses. // // With signed byte ordering: 三星应用商店 < 360手机助手 < vivo预装 // With memcmp byte ordering: 360手机助手 < vivo预装 < 三星应用商店 // // A row group containing {"三星应用商店", "vivo预装"} has memcmp-based stats // min="vivo预装", max="三星应用商店". A filter for "360手机助手" falls below the // memcmp min, so the row group would be incorrectly skipped — even though it // should match under the signed ordering that parquet-mr 1.8.1 used to write // the stats. TEST_F(E2EFilterTest, parquetMRVersionStringStatsRowGroupFiltering) { const std::string kSanXing = "三星应用商店"; const std::string kVivo = "vivo预装"; const std::string k360 = "360手机助手"; auto rowType = ROW({"s"}, {VARCHAR()}); auto writeAndGetStats = [&](const std::string& createdBy, RuntimeStatistics& stats) { options_.memoryPool = E2EFilterTestBase::rootPool_.get(); options_.createdBy = createdBy; // Flush after every 5 rows to create separate row groups. options_.flushPolicyFactory = []() { return std::make_unique<LambdaFlushPolicy>( /*rowsInRowGroup=*/5, /*bytesInRowGroup=*/1'024 * 1'024, []() { return false; }); }; auto sink = std::make_unique<MemorySink>( 200 * 1024 * 1024, FileSink::Options{.pool = leafPool_.get()}); auto* sinkPtr = sink.get(); auto writer = std::make_unique<parquet::Writer>( std::move(sink), options_, rowType); // Row group 1: contains the value we will filter for ("360手机助手"). writer->write(makeRowVector( {"s"}, {makeFlatVector<std::string>( {k360, kSanXing, kVivo, k360, kSanXing})})); // Row group 2: does not contain "360手机助手". writer->write(makeRowVector( {"s"}, {makeFlatVector<std::string>( {kSanXing, kVivo, kSanXing, kVivo, kSanXing})})); writer->close(); dwio::common::ReaderOptions readerOptions{leafPool_.get()}; auto input = std::make_unique<BufferedInput>( std::make_shared<InMemoryReadFile>( std::string(sinkPtr->data(), sinkPtr->size())), readerOptions.memoryPool()); auto reader = makeReader(readerOptions, std::move(input)); auto& parquetReader = dynamic_cast<ParquetReader&>(*reader); EXPECT_EQ(parquetReader.fileMetaData().numRowGroups(), 2); auto scanSpec = std::make_shared<ScanSpec>(""); scanSpec->addAllChildFields(*rowType); // Equality filter: s = "360手机助手". scanSpec->getOrCreateChild(Subfield("s")) ->setFilter(std::make_unique<BytesRange>( k360, false, false, k360, false, false, false)); RowReaderOptions rowReaderOpts; rowReaderOpts.select( std::make_shared<ColumnSelector>(rowType, rowType->names())); rowReaderOpts.setScanSpec(scanSpec); auto rowReader = reader->createRowReader(rowReaderOpts); VectorPtr result = BaseVector::create(rowType, 1, leafPool_.get()); uint64_t totalRows{0}; while (rowReader->next(1'000, result)) { totalRows += result->size(); } EXPECT_EQ(totalRows, 2); rowReader->updateRuntimeStats(stats); }; // parquet-mr 1.8.2: stats are trusted. Under memcmp ordering, row group 1 // has min="360手机助手" max="三星应用商店" which contains "360手机助手", so it // is read. Row group 2 has min="vivo预装" max="三星应用商店" which does not // contain "360手机助手" (it falls below memcmp min), so it is skipped. RuntimeStatistics stats182; writeAndGetStats("parquet-mr version 1.8.2", stats182); EXPECT_EQ(stats182.skippedStrides, 1); EXPECT_EQ(stats182.processedStrides, 1); // parquet-mr 1.8.1: stats are untrusted (signed byte ordering bug), so no // row groups are skipped. Both row groups are scanned. RuntimeStatistics stats181; writeAndGetStats("parquet-mr version 1.8.1", stats181); EXPECT_EQ(stats181.skippedStrides, 0); EXPECT_EQ(stats181.processedStrides, 2); }

This unit test is really professional. Learned a lot.

PingLiuPing · 2026-03-13T12:15:53Z

An example: https://godbolt.org/z/dj9acqzT9

lifulong · 2026-03-16T09:55:26Z

@PingLiuPing thanks for your reply!
I tried to create a file that can reproduce this issue, but currently, we no longer have any engines using parquet-1.8.1 to generate data, so I wasn't able to successfully create a simple data file reproduce this issue. It's not convenient to directly provide the production data files from our company's production environment.
As you said, the problem is indeed with sorting, not with undefined min/max values. It should be an issue with the parquet-1.8.1 version we used in the past. The historical data from 2023 has this problem, while the latest data no longer has such inconsistent results. At the same time, I also found the records of relevant changes in the parquet community:
apache/parquet-java#362
apache/parquet-java#367

PingLiuPing · 2026-03-16T10:01:40Z

@lifulong Thanks. Let’s update the PR description to better reflect the root cause, and add a test case to verify that when toggling SemanticVersion, the row group filter is either applied or skipped accordingly.

lifulong · 2026-03-16T10:18:23Z

@lifulong Thanks. Let’s update the PR description to better reflect the root cause, and add a test case to verify that when toggling SemanticVersion, the row group filter is either applied or skipped accordingly.

ok

lifulong · 2026-03-18T01:42:09Z

@lifulong Thanks. Let’s update the PR description to better reflect the root cause, and add a test case to verify that when toggling SemanticVersion, the row group filter is either applied or skipped accordingly.

@PingLiuPing hi, has update the test, can you review again?

PingLiuPing · 2026-03-18T22:07:50Z

velox/dwio/parquet/tests/reader/ParquetReaderTest.cpp

  EXPECT_EQ(reader->numberOfRows(), 10ULL);
 }

+TEST_F(ParquetReaderTest, parquetMR181SkipsFilterRowGroupsByStringStats) {


shouldIgnoreStatsForParquetMRVersions

PingLiuPing · 2026-03-18T22:42:28Z

velox/dwio/parquet/tests/reader/ParquetReaderTest.cpp

  EXPECT_EQ(reader->numberOfRows(), 10ULL);
 }

+TEST_F(ParquetReaderTest, parquetMR181SkipsFilterRowGroupsByStringStats) {


Can you add this case to E2EFilterTest.cpp ?

// Reproduces the real-world scenario from the bug report. Parquet-mr 1.8.1 // computed binary column min/max using signed byte ordering, which differs from // the unsigned lexicographic (memcmp) ordering Velox uses. // // With signed byte ordering: 三星应用商店 < 360手机助手 < vivo预装 // With memcmp byte ordering: 360手机助手 < vivo预装 < 三星应用商店 // // A row group containing {"三星应用商店", "vivo预装"} has memcmp-based stats // min="vivo预装", max="三星应用商店". A filter for "360手机助手" falls below the // memcmp min, so the row group would be incorrectly skipped — even though it // should match under the signed ordering that parquet-mr 1.8.1 used to write // the stats. TEST_F(E2EFilterTest, parquetMRVersionStringStatsRowGroupFiltering) { const std::string kSanXing = "三星应用商店"; const std::string kVivo = "vivo预装"; const std::string k360 = "360手机助手"; auto rowType = ROW({"s"}, {VARCHAR()}); auto writeAndGetStats = [&](const std::string& createdBy, RuntimeStatistics& stats) { options_.memoryPool = E2EFilterTestBase::rootPool_.get(); options_.createdBy = createdBy; // Flush after every 5 rows to create separate row groups. options_.flushPolicyFactory = []() { return std::make_unique<LambdaFlushPolicy>( /*rowsInRowGroup=*/5, /*bytesInRowGroup=*/1'024 * 1'024, []() { return false; }); }; auto sink = std::make_unique<MemorySink>( 200 * 1024 * 1024, FileSink::Options{.pool = leafPool_.get()}); auto* sinkPtr = sink.get(); auto writer = std::make_unique<parquet::Writer>( std::move(sink), options_, rowType); // Row group 1: contains the value we will filter for ("360手机助手"). writer->write(makeRowVector( {"s"}, {makeFlatVector<std::string>( {k360, kSanXing, kVivo, k360, kSanXing})})); // Row group 2: does not contain "360手机助手". writer->write(makeRowVector( {"s"}, {makeFlatVector<std::string>( {kSanXing, kVivo, kSanXing, kVivo, kSanXing})})); writer->close(); dwio::common::ReaderOptions readerOptions{leafPool_.get()}; auto input = std::make_unique<BufferedInput>( std::make_shared<InMemoryReadFile>( std::string(sinkPtr->data(), sinkPtr->size())), readerOptions.memoryPool()); auto reader = makeReader(readerOptions, std::move(input)); auto& parquetReader = dynamic_cast<ParquetReader&>(*reader); EXPECT_EQ(parquetReader.fileMetaData().numRowGroups(), 2); auto scanSpec = std::make_shared<ScanSpec>(""); scanSpec->addAllChildFields(*rowType); // Equality filter: s = "360手机助手". scanSpec->getOrCreateChild(Subfield("s")) ->setFilter(std::make_unique<BytesRange>( k360, false, false, k360, false, false, false)); RowReaderOptions rowReaderOpts; rowReaderOpts.select( std::make_shared<ColumnSelector>(rowType, rowType->names())); rowReaderOpts.setScanSpec(scanSpec); auto rowReader = reader->createRowReader(rowReaderOpts); VectorPtr result = BaseVector::create(rowType, 1, leafPool_.get()); uint64_t totalRows{0}; while (rowReader->next(1'000, result)) { totalRows += result->size(); } EXPECT_EQ(totalRows, 2); rowReader->updateRuntimeStats(stats); }; // parquet-mr 1.8.2: stats are trusted. Under memcmp ordering, row group 1 // has min="360手机助手" max="三星应用商店" which contains "360手机助手", so it // is read. Row group 2 has min="vivo预装" max="三星应用商店" which does not // contain "360手机助手" (it falls below memcmp min), so it is skipped. RuntimeStatistics stats182; writeAndGetStats("parquet-mr version 1.8.2", stats182); EXPECT_EQ(stats182.skippedStrides, 1); EXPECT_EQ(stats182.processedStrides, 1); // parquet-mr 1.8.1: stats are untrusted (signed byte ordering bug), so no // row groups are skipped. Both row groups are scanned. RuntimeStatistics stats181; writeAndGetStats("parquet-mr version 1.8.1", stats181); EXPECT_EQ(stats181.skippedStrides, 0); EXPECT_EQ(stats181.processedStrides, 2); }

PingLiuPing

Thanks.

PingLiuPing · 2026-03-21T11:23:28Z

@mbasmanova Could you please take a final look at your convenience? The root cause has been clearly identified, and the test coverage is now solid.

mbasmanova · 2026-03-21T12:54:07Z

@PingLiuPing Would it be possible to update the PR description?

PingLiuPing · 2026-03-21T13:06:55Z

@PingLiuPing Would it be possible to update the PR description?

Thank you @mbasmanova, PR description been updated.

lifulong requested a review from majetideepak as a code owner March 12, 2026 11:15

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 12, 2026

lifulong changed the title ~~Fix: compatible with parquet-1.8.1 min/max not defined~~ Fix: fix error equal filter result with parquet-1.8.1 and column meta min/max not defined Mar 12, 2026

lifulong changed the title ~~Fix: fix error equal filter result with parquet-1.8.1 and column meta min/max not defined~~ fix: Fix error equal filter result with parquet-1.8.1 and column meta min/max not defined Mar 12, 2026

mbasmanova requested a review from PingLiuPing March 12, 2026 11:26

PingLiuPing reviewed Mar 12, 2026

View reviewed changes

velox/dwio/parquet/reader/SemanticVersion.cpp Show resolved Hide resolved

lifulong force-pushed the fix_parquet_181_undefined_min_max branch 3 times, most recently from 3865ecc to 1cf0ced Compare March 13, 2026 07:43

PingLiuPing reviewed Mar 13, 2026

View reviewed changes

lifulong mentioned this pull request Mar 16, 2026

Parquet-1.8.1 min/max order error #16743

Open

lifulong changed the title ~~fix: Fix error equal filter result with parquet-1.8.1 and column meta min/max not defined~~ fix: parquet-1.8.1 column meta min/max order use signed format, diff with new parquet version Mar 16, 2026

lifulong force-pushed the fix_parquet_181_undefined_min_max branch from 1cf0ced to a33b383 Compare March 16, 2026 11:20

lifulong changed the title ~~fix: parquet-1.8.1 column meta min/max order use signed format, diff with new parquet version~~ fix: Parquet-1.8.1 column meta min/max order use signed format, diff with new parquet version Mar 16, 2026

PingLiuPing reviewed Mar 18, 2026

View reviewed changes

PingLiuPing changed the title ~~fix: Parquet-1.8.1 column meta min/max order use signed format, diff with new parquet version~~ fix: Ignore string column statistics for parquet-mr versions before 1.8.2 Mar 18, 2026

lifulong force-pushed the fix_parquet_181_undefined_min_max branch 2 times, most recently from fc0176d to a4a81b0 Compare March 19, 2026 02:44

compatible with parquet-1.8.1 min/max not defined

507370c

lifulong force-pushed the fix_parquet_181_undefined_min_max branch from a4a81b0 to 507370c Compare March 19, 2026 02:54

PingLiuPing approved these changes Mar 19, 2026

View reviewed changes

mbasmanova added the ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall label Mar 21, 2026

Conversation

lifulong commented Mar 12, 2026 • edited by PingLiuPing Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for meta-velox canceled.

Uh oh!

Uh oh!

PingLiuPing left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PingLiuPing Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

lifulong Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

PingLiuPing Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

lifulong Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

PingLiuPing commented Mar 13, 2026

Uh oh!

lifulong commented Mar 16, 2026

Uh oh!

PingLiuPing commented Mar 16, 2026

Uh oh!

lifulong commented Mar 16, 2026

Uh oh!

lifulong commented Mar 18, 2026

Uh oh!

PingLiuPing Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

PingLiuPing Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

PingLiuPing left a comment

Choose a reason for hiding this comment

Uh oh!

PingLiuPing commented Mar 21, 2026

Uh oh!

mbasmanova commented Mar 21, 2026

Uh oh!

PingLiuPing commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lifulong commented Mar 12, 2026 •

edited by PingLiuPing

Loading

netlify bot commented Mar 12, 2026 •

edited

Loading

PingLiuPing left a comment •

edited

Loading