fix: handle Unicode characters correctly in mutator byte offset calculation #2832

stevennevins · 2025-12-08T13:45:00Z

Fix source code corruption when slither-mutate processes Solidity files containing Unicode characters (e.g., arrow symbols, non-ASCII comments).

Example

// → Unicode arrow
contract Counter {
    function increment() public {
        number++;
    }
}

The → arrow is 3 bytes but 1 character. This 2-byte difference causes mutations to be applied at wrong positions.

Without fix - garbage output:

nurevert()
fufunction setNumber
nu++number

With fix - correct mutations:

revert()
function setNumber
++number

…lations Solc reports source locations as byte offsets, but the mutator code was using Python string indexing which operates on character counts. For files with multi-byte Unicode characters (e.g., Japanese comments), this caused mutations to be applied at wrong positions. - Change test_patch() to read/write files in binary mode - Fix apply_patch() offset calculation to use byte length

stevennevins force-pushed the fix/mutator-unicode-byte-offsets branch from 4d70812 to e5d2efa Compare December 8, 2025 13:46

stevennevins marked this pull request as ready for review December 11, 2025 21:39

stevennevins requested review from bohendo and smonicas as code owners December 11, 2025 21:39

Merge branch 'dev' into fix/mutator-unicode-byte-offsets

da905c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: handle Unicode characters correctly in mutator byte offset calculation #2832

fix: handle Unicode characters correctly in mutator byte offset calculation #2832

stevennevins commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: handle Unicode characters correctly in mutator byte offset calculation #2832

Are you sure you want to change the base?

fix: handle Unicode characters correctly in mutator byte offset calculation #2832

Conversation

stevennevins commented Dec 8, 2025

Example

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant