De-sugar First Author Search by JCRPaquin · Pull Request #232 · adsabs/montysolr

JCRPaquin · 2025-04-14T08:27:53Z

What?

The PR masks first author searches created using the dedicated syntax sugar. It also introduces a configuration path to expand this desugaring to other fields as needed.

Why?

After the Solr upgrade from 7 to 9 we experienced a large number of failing position queries. This is because the max clause limit, previously enforced only for boolean clauses, was expanded to more query types. Unfortunately, prefix queries, which are commonly used in author searches, are expanded to all matching terms prior to lookup; these terms are each represented as a single clause and joined together into a (con/dis)junction. Given the size of our collection, it's not uncommon for author searches to pull in 10k+ author names, which is far higher than the default limit.

Alternatives

We could have altered the max clause limit to be something ludicrously high-- this would have mostly reverted the behavior. However, in some limited circumstances the limit would still be breached and users would likely be even more confused than they are today. There's also the chance that a user might discover the increased limit and use it to DDoS ADS/SciX.

Thanks to @sstults for the pairing time spent on this PR.

sstults

I think this looks right.

kelockhart

The expected behavior in the unit tests looks good to me

…uthor-desugar

JCRPaquin added 11 commits March 21, 2025 03:09

Drop pos operator around first author query

9d484e6

Pipe query config handler to subquery parsers

fff0724

Add query config handler to pos parser

0ff28ca

Add first position remapping config key

55ba9cf

Remap first position queries using the config data

7eb7044

Remove if-block that didn't use the new config

1424bcd

Add extra config values

5877b27

Correct first author remapping test expectations

f670c63

Remove first author node processor

6d29667

Repair test cases after first author masking change

b1f6caf

Merge branch 'main' into jcrpaquin/bug/first-author-desugar

50ca5cb

JCRPaquin requested review from kelockhart and sstults April 14, 2025 08:28

sstults approved these changes Apr 14, 2025

View reviewed changes

kelockhart approved these changes Apr 15, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into jcrpaquin/bug/first-a…

8ae5987

…uthor-desugar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

De-sugar First Author Search#232

De-sugar First Author Search#232
JCRPaquin wants to merge 12 commits intomainfrom
jcrpaquin/bug/first-author-desugar

JCRPaquin commented Apr 14, 2025

Uh oh!

sstults left a comment

Uh oh!

kelockhart left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JCRPaquin commented Apr 14, 2025

What?

Why?

Alternatives

Uh oh!

sstults left a comment

Choose a reason for hiding this comment

Uh oh!

kelockhart left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants