Standardizing memcpy/memset/etc. as pcode operations?

Many processors have bulk memory instructions: x86 has "rep movs/stos" and "repe cmps", AArch64 has "CPY*/SET*", WebAssembly has "memory.copy/memory.fill", etc. Some also have string operations, for example "rep scas" (strlen) in x86.

There are two implementation strategies for such operations. One is to create a small loop, which will be translated into a loop in the decompiler; x86 takes this approach, for example. The downside is that the decompiled output becomes harder to read. The second is to use a custom pcodeop, which is much more readable but then requires custom emulation and is opaque to the decompiler. In particular, when memcpying to the stack, not all effects will be visible to the decompiler.

**Describe the solution you'd like**

I propose to have a dedicated set of core pcodeops to implement bulk memory operations. These would be understood by SLEIGH and by the decompiler, allowing us to have much nicer decompilation for such constructs. 

As an additional benefit, a dedicated pcodeop could be the target for a memory operation fusing pass in the decompiler, which recognizes common memory operation idioms (e.g. copying in a loop, or unrolled assignment to consecutive memory addresses) and replaces them with a single bulk memory op. This would enable further simplifications and optimizations. The decompiler could then properly model the effects of bulk memory operations. 

I would suggest, at minimum, to have four bulk operations:

- `copy(dst, src, unit_size, count)`: copy from `src` to `dst`, `unit_size` bytes at a time, a total of `count` times
- `fill(dst, value, count)`: fill `dst` with `count` copies of `value`; the unit size is taken from the size of the `value` varnode
- `read_effect(src, size)`: a no-op that indicates that the memory region [src, src+size) is read from. Used to support decompilation of custom pcodeops.
- `write_effect(dst, size)`: a no-op that indicates to the decompiler that the memory region [dst, dst+size) is written to. Used to support decompilation of custom pcodeops.

The latter two are added so that the decompiler can precisely model the memory effects of custom pcodeops. 

Other operations are possible, e.g. memcmp, strlen, strcpy, but they are more situational and may be harder to implement, and thus are not part of this proposal.

**Describe alternatives you've considered**

As noted above, alternative implementation approaches include an inline loop or a custom pcodeop, but both have downsides. 

**Additional context**
Add any other context or screenshots about the feature request here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardizing memcpy/memset/etc. as pcode operations? #5769

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Standardizing memcpy/memset/etc. as pcode operations? #5769

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions