feat(engine): ahead of time catalog lookups #20189

ivkalita · 2025-12-10T11:25:24Z

As a preparation for distributed metastore lookups I introduce ahead of time catalog lookups. I explicitly query metastore before the physical plan is built. The workflow is:

build a physical plan to collect catalog requests required
resolve catalog requests to responses (atm I use metastore catalog to do that)
save the responses to a new ResolvedCatalog
pass resolved catalog to physical planner to build a real physical plan

Despite of the metastore catalog still being used this approach allows us to later reuse engine's capability to run distributed queries to resolve catalog requests (by replacing this resolve func). This change will be introduced in a different PR (WIP example).

Checklist

Reviewed the CONTRIBUTING.md guide (required)
Documentation added
Tests updated
Title matches the required conventional commits format, see here
- Note that Promtail is considered to be feature complete, and future development for logs collection will be in Grafana Alloy. As such, feat PRs are unlikely to be accepted unless a case can be made for the feature actually being a bug fix to existing behavior.
Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

As a preparation for distributed metastore lookups I introduce ahead of time catalog lookups. I explicitly query metastore before the physical plan is built. The workflow is: - build a physical plan to collect catalog requests required - resolve catalog requests to responses (atm I use metastore catalog to do that) - save the responses to a new ResolvedCatalog - pass resolved catalog to physical planner to build a real physical plan

github-actions · 2025-12-10T11:27:08Z

💻 Deploy preview available (feat(engine): ahead of time catalog lookups):

https://deploy-preview-loki-20189-zb444pucvq-vp.a.run.app/docs/loki/latest/

benclive · 2025-12-11T11:44:49Z

pkg/engine/internal/planner/physical/catalog.go

+// FilterDescriptorsForShard filters the section descriptors for a given shard.
+// It returns the locations, streams, and sections for the shard.
+// TODO: Improve filtering: this method could be improved because it doesn't resolve the stream IDs to sections, even though this information is available. Instead, it resolves streamIDs to the whole object.
+func FilterDescriptorsForShard(shard ShardInfo, sectionDescriptors []*metastore.DataobjSectionDescriptor) ([]FilteredShardDescriptor, error) {


I don't think we use the ShardInfo anymore with the new scheduler. It might be time to remove it. We can do it after this is merged though.

benclive · 2025-12-11T14:04:18Z

pkg/engine/engine.go

+	}
+
+	unresolved := physical.NewUnresolvedCatalog()
+	collectorPlanner := physical.NewPlanner(physical.NewContext(params.Start(), params.End()), unresolved)


It feel a bit strange to me that we do physical planning twice, once to gather fake requests and once with the real data. Did you do it this way to maintain a separation of concerns or some other reason?

Physical planning is reasonably cheap without the metastore calls now but it might change in the future if we build different physical plans depending on the output of the catalog requests, or even issue new requests based off the old ones. Does it make sense to instead implement a catalog which will resolve requests via distributing metaqueries directly instead of doing this 2 step process?

That said, if this is just a stepping stone to the full implementation I'm happy to approve it to keep things moving!

I wanted to give the engine full control of the (meta)query execution

to streamline the query execution pipeline (engine triggers all the queries, not planner implicitly)

and to unblock certain optimizations. The requests we collect are essentially metaqueries we need to execute, hence if we know them beforehand we can optimize their execution (deduplicate / run in parallel / etc).

At the moment I think there is no difference in how we collect / execute metaqueries (given there's at most 1 catalog request per physical plan, right?), I can simplify the approach to delegate the work to a new catalog.

if we build different physical plans depending on the output of the catalog requests, or even issue new requests based off the old ones

Yes! Current approach doesn't work in this case, so I was thinking about a loop like

for catalogReqs := plan.Unresolved() ;; len(catalogReqs) > 0 { catalogResps := e.queryCatalog(catalogReqs) plan.Resolve(catalogResps) }

I'm still slightly hesitant about executing metaqueries one by one in the order that depends on the physical planner walk just because we won't have enough context to optimize these queries (parallelize / decouple).

Wdyt about all above?

I'm considering simplifying this part for now (applying "implement a catalog which will resolve requests via distributing metaqueries directly" suggestion) and just keeping a comment that we need to revisit this part when we have multiple metaqueries per physical plan.

pull-request-size bot added the size/XL label Dec 10, 2025

ivkalita marked this pull request as ready for review December 10, 2025 11:45

ivkalita requested a review from a team as a code owner December 10, 2025 11:45

benclive reviewed Dec 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(engine): ahead of time catalog lookups #20189

feat(engine): ahead of time catalog lookups #20189

ivkalita commented Dec 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

benclive Dec 11, 2025

Uh oh!

benclive Dec 11, 2025

Uh oh!

ivkalita Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(engine): ahead of time catalog lookups #20189

Are you sure you want to change the base?

feat(engine): ahead of time catalog lookups #20189

Conversation

ivkalita commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

benclive Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

benclive Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

ivkalita Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ivkalita commented Dec 10, 2025 •

edited

Loading