Skip to content

missleading benchmarks #26

Open
Open
@Licenser

Description

@Licenser

Hi,

I wanted to suggest changing the benchmark output slightly, as they are presented, it is somewhat misleading.

The way serde_json and simd-json treat the Dom is very different from how simdjson treats the Dom. Both are valid tradeoffs to make, but comparing them is not very meaningful.

Both serde_json and simd-json when presenting a Dom create a nested data structure that is modifiable and has indexed maps - a data structure on its own. That comes at the cost of allocations and filling data structures, but it's a valid tradeoff when either map are accessed frequently, or the date needs to be modified.

simdjson presents a pointer to the tape as a Dom, which means it does not perform extra allocations but does not allow mutations, and lookups are always in linear time.

Again, both are valid tradeoffs for different use cases. However, comparing them is problematic as what we compare isn't the same result.

I think the best way would be to create a third category aside of Dom, Struct called Tape, which is the fully validated JSON but not put in a nested data structure. serde_json does not provide an interface like that, simd-json does provide to_tape which provides an equivalent data structure to simdjson but without the nicer access functions (so that should be easy to implement oneself or add).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions