Skip to content

fix passthrough flatmap raw size accounting#466

Open
HuamengJiang wants to merge 3 commits intofacebookincubator:mainfrom
HuamengJiang:export-D92242565
Open

fix passthrough flatmap raw size accounting#466
HuamengJiang wants to merge 3 commits intofacebookincubator:mainfrom
HuamengJiang:export-D92242565

Conversation

@HuamengJiang
Copy link

Summary:
Fix passthrough flatmap raw size accounting in mainly two issues:

  • string flatmap keys
  • constant encoding
    Now raw size accounting is fully consistent with field writer stats collection.

Differential Revision: D92242565

Huameng (Michael) Jiang added 3 commits February 4, 2026 02:12
Summary:

When making changes in stats collection path and now cross validating raw/logical size calculation from vector and field writer collection, we found some test failures in feature reaper tests due to test data specifying inconsistent input vector type and schema type. The writer supports this upcast implicitly otherwise, so the right thing to do is to support it in a lightweight fashion.

We do still have to fix the test case where it tried to pass integer to a float column.

Differential Revision: D91559828
Summary:
As we went through the current validations, we can start to deprecate the current top level raw size accounting from the input vector and use the logical size from root column stats instead. We do so via a flag in writer options, which can then be wired up in the different downstream call sites.

Still leaves the flag on for continuous e2e validation (vader).

Differential Revision: D91559827
Summary:
Fix passthrough flatmap raw size accounting in mainly two issues:
* string flatmap keys
* constant encoding 
Now raw size accounting is fully consistent with field writer stats collection.

Differential Revision: D92242565
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 4, 2026
@meta-codesync
Copy link

meta-codesync bot commented Feb 4, 2026

@HuamengJiang has exported this pull request. If you are a Meta employee, you can view the originating Diff in D92242565.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant