Fix duplicated definitions #14573

ed-henrique · 2025-10-11T02:06:04Z

Deduplicates definitions' locations before rendering. When a collision happens for the locations using their URIs and ranges as keys, the one with the highest offset encoding is used (chosen because OffsetEncoding::Utf16 is the highest in the enum, and also the default)

helix-term/src/commands/lsp.rs

…rder

ed-henrique · 2025-10-12T15:06:06Z

I used an auxiliar order Vec to keep the original locations' order after keeping them deduplicated through the HashMap.

helix-term/src/commands/lsp.rs

the-mikedavis · 2025-10-16T14:34:31Z

helix-term/src/commands/lsp.rs

+                if location.offset_encoding > existing.offset_encoding {
+                    *existing = location;
+                }


This handling of offset encoding here is not correct. Offset encoding controls how the character offset in a lsp::Position (within lsp::Range, within Location) should be interpreted: either as a byte offset (UTF-8), UTF-16 code unit offset or character offset (UTF-32). We can't know how a Location using one offset encoding compares to a Location using another offset encoding until we read the file's contents.

Instead let's not attempt to deduplicate locations using different offset encodings, i.e. only deduplicate when the full Location is equal

If we depuplicate only when the full Location is equal, this does not solve the problem in #14551, since the OffsetEncoding is the only difference between the Locations given by the different LSPs. If I remove the curent behavior, the user won't feel any difference between before and after the fix, because they will still be duplicated in the picker.

Also, when actually following the definitions, I couldn't find a difference between cursor or buffer position in either encoding while testing.

How do you think we should handle that? Are there two known LSPs that provide different results when following Locations where the OffsetEncoding is different?

Also, considering this, I used the highest OffsetEncoding as the default, but we could just as well use the first one that appears, if no difference is actually detectable.

You wouldn't notice the difference on ASCII text. For an example, if a line contains a character like 🏴 (U+1F3F4) then the byte offset after the '🏴' would be 4 for UTF-8, 2 for UTF-16 and 1 for UTF-32. So if the contents of a line are not all ASCII then you can't know that two lsp::Ranges with different offset encodings are equal.

I see. So, the duplicates are expected behavior, then? If the whole Location should be taken into account, I can use a IndexSet<Location> and simplify the function even further.

Is this PR still useful or should I original issue be closed concluding that it's expected behavior?

ed-henrique added 2 commits October 10, 2025 22:00

fix(lsp): deduplicate definitions' locations before rendering

21a48f5

test(lsp): add a new test for deduplicate_locations

39395b7

poliorcetics reviewed Oct 12, 2025

View reviewed changes

helix-term/src/commands/lsp.rs Show resolved Hide resolved

ed-henrique added 2 commits October 12, 2025 11:02

fix(lsp): keep original locations' order

9271ece

test(lsp): add a new test for deduplicate_locations that checks for o…

c602a1e

…rder

m4rch3n1ng reviewed Oct 12, 2025

View reviewed changes

helix-term/src/commands/lsp.rs Outdated Show resolved Hide resolved

refactor(lsp): use IndexMap instead of HashMap with an auxiliar Vec

c1b39d7

m4rch3n1ng reviewed Oct 12, 2025

View reviewed changes

helix-term/src/commands/lsp.rs Outdated Show resolved Hide resolved

refactor(lsp): remove unnecessary clone

963ceba

the-mikedavis reviewed Oct 16, 2025

View reviewed changes

fix(lsp): use whole Location to check for duplicates using IndexSet

0aefb2c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix duplicated definitions #14573

Fix duplicated definitions #14573

ed-henrique commented Oct 11, 2025

Uh oh!

Uh oh!

ed-henrique commented Oct 12, 2025

Uh oh!

Uh oh!

Uh oh!

the-mikedavis Oct 16, 2025

Uh oh!

ed-henrique Oct 16, 2025

Uh oh!

ed-henrique Oct 16, 2025

Uh oh!

the-mikedavis Oct 16, 2025

Uh oh!

ed-henrique Oct 16, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Fix duplicated definitions #14573

Are you sure you want to change the base?

Fix duplicated definitions #14573

Conversation

ed-henrique commented Oct 11, 2025

Uh oh!

Uh oh!

ed-henrique commented Oct 12, 2025

Uh oh!

Uh oh!

Uh oh!

the-mikedavis Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ed-henrique Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ed-henrique Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

the-mikedavis Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ed-henrique Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ed-henrique Oct 16, 2025 •

edited

Loading