Performance improvements by NikoGrano · Pull Request #1621 · schmittjoh/serializer

NikoGrano · 2026-03-11T22:52:01Z

Improve library performance. See commits for exact details.

Q	A
Bug fix?	yes/no
New feature?	no
Doc updated	no
BC breaks?	no
Deprecations?	no
Tests pass?	yes
License	MIT

Types like "string", "DateTime<'Y-m-d'>", "array<MyClass>" are re-parsed on every serialize/deserialize call. Cache eliminates redudant lexer/parser work. Wired as default in SerializerBuilder; users calling setTypeParser() still override entirely.

Avoid creating new ReflectionClass and running array_search() on every call. Use a static flipped constants map for O(1) lookup.

Replace O(n) linear scan with O(1) hash lookup using a static flipped array.

NikoGrano · 2026-03-11T23:04:36Z

+            self::$constantNames = array_flip((new \ReflectionClass(Lexer::class))->getConstants());
+        }

-        return array_search($value, $oClass->getConstants());


array_search defaulted to false if not found and would have caused crash as function has expected output of string.

After this change error might change, maybe add ?? false, but again this will just cause same type related error in php...

In my opinion this is such edge case it should not happen as it would mean this library being broken internally and would just add extra check which should not be ever needed. Any opinion on this could be helpful, or just leave it "as is".

Avoid creating new DateTimeZone objects on every deserialization call for the same timezone string. Instance-level caching.

Isolates type parsing performance: 9 realistic type strings, 1000 revs x 5 iterations, comparing raw Parser vs CachingParser.

uniqid() is just slow and not even cryptographically secure. The prefix only needs to be unique within a single XPath registerXPathNamespace() call, not globally. Replaced with simpler int counter.

…zationVisitor Added instance-leven caches for resolveNamespacePrefix() avoiding repeated SHA1 calculation and isElementNameValid() avoiding repeated preg_match for the same keys.

Cache the handler result from the polymorphic type check and reuse it if the type hasn't changed after the pre_serialize event dispatch, eliminating a redundant hash lookup per object node.

…trategy When nestedGroups is enabled, shouldSkipProperty() used in_array() O(n) to check group membership. Build a hashmap from getGroupsFor() result and use isset() O(1) instead.

Use three fast paths based on stack delta analysis to avoid full scans on every shouldSkipProperty()/shouldSkipClass() call. Same stack count retunrs cached result O(1). Stack shrunk and was not too deep O(1). Stack grew, no maxDepth on stack, not too deep, check only deltas O(delta). hasMaxDepthOnStack flag tracks if any maxDepth attributes exist on the stack. When false, growth never triggers full scan. When true or cazched result is too deep, the logic falls back to original full scan.

In PHP 7.x/8.x arrays outperform SplStack with noticable difference.

NikoGrano · 2026-03-12T19:45:13Z

After these changes I can see some improvements (avg from 5 runs):

Benchmark	Before	After	Δ	Improvement
JsonSerializationBench	218.4ms	201.8ms	−16.6ms	7.6%
JsonMaxDepthSerializationBench	273.7ms	249.4ms	−24.3ms	8.9%
XmlSerializationBench	405.1ms	361.9ms	−43.2ms	10.7%

Memory usage unchanged across all benchmarks (~39.2 MB for JSON, ~11.7 MB for XML).

@scyzoryck What do you think, is this something to finalize and prep for merge?

I might still go trough and try minimize changes, but at this stage this is what we would get if I decide to continue this PR.

scyzoryck · 2026-03-24T10:47:50Z

Hey!
I checked the results on the Github actions - I can see the difference on the XmlSerializationBench, but no difference in Json serialization :( I will check it on my local this week.

NikoGrano · 2026-03-24T11:14:34Z

👍 Just a side note, I found this to be varying between envs. On my workstation I get clean results, but again with older laptop the difference is smaller. Especially PHP 7 vs PHP 8.

scyzoryck · 2026-03-31T14:38:34Z

@NikoGrano can you rebase to rerun tests with the PHP8.5?

NikoGrano force-pushed the performance branch from 6734a66 to 168906f Compare March 11, 2026 22:55

NikoGrano force-pushed the performance branch from 168906f to e6d4568 Compare March 11, 2026 22:58

NikoGrano added 2 commits March 12, 2026 00:59

Cache ReflectionClass constants in Parser::getConstant()

b00cf3f

Avoid creating new ReflectionClass and running array_search() on every call. Use a static flipped constants map for O(1) lookup.

Use isset() instead of in_array() in UnionHandler::isPrimitiveType()

b6333e2

Replace O(n) linear scan with O(1) hash lookup using a static flipped array.

NikoGrano commented Mar 11, 2026

View reviewed changes

NikoGrano added 8 commits March 12, 2026 01:13

Cache DateTimeZone instances in DateHandler

8890b13

Avoid creating new DateTimeZone objects on every deserialization call for the same timezone string. Instance-level caching.

Add TypeParsingBench for phpbench

9a80b3a

Isolates type parsing performance: 9 realistic type strings, 1000 revs x 5 iterations, comparing raw Parser vs CachingParser.

Replace uniqid() with counter in XmlDeserializationVisitor

2ca688f

uniqid() is just slow and not even cryptographically secure. The prefix only needs to be unique within a single XPath registerXPathNamespace() call, not globally. Replaced with simpler int counter.

Cache SHA1 namespace prefix and element name validation in XmlSeriali…

5a35952

…zationVisitor Added instance-leven caches for resolveNamespacePrefix() avoiding repeated SHA1 calculation and isElementNameValid() avoiding repeated preg_match for the same keys.

Fix O(n2) getCurrentPath() in Context using append+reverse

38ea896

Avoid redundant handler lookup in SerializationGraphNavigator

6632df4

Cache the handler result from the polymorphic type check and reuse it if the type hasn't changed after the pre_serialize event dispatch, eliminating a redundant hash lookup per object node.

Rector fix

adf5ad2

Replace in_array() with isset() for nested groups in GroupsExclusionS…

82a579a

…trategy When nestedGroups is enabled, shouldSkipProperty() used in_array() O(n) to check group membership. Build a hashmap from getGroupsFor() result and use isset() O(1) instead.

NikoGrano force-pushed the performance branch from 06a735f to 82a579a Compare March 11, 2026 23:40

NikoGrano force-pushed the performance branch from e479b11 to 5092601 Compare March 12, 2026 13:15

This comment was marked as outdated.

Sign in to view

Replace SplStack with plain arrays

40e3f0b

In PHP 7.x/8.x arrays outperform SplStack with noticable difference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance improvements#1621

Performance improvements#1621
NikoGrano wants to merge 13 commits intoschmittjoh:masterfrom
NikoGrano:performance

NikoGrano commented Mar 11, 2026

Uh oh!

NikoGrano Mar 11, 2026

Uh oh!

This comment was marked as outdated.

NikoGrano commented Mar 12, 2026

Uh oh!

scyzoryck commented Mar 24, 2026

Uh oh!

NikoGrano commented Mar 24, 2026

Uh oh!

scyzoryck commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

NikoGrano commented Mar 11, 2026

Uh oh!

NikoGrano Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

NikoGrano commented Mar 12, 2026

Uh oh!

scyzoryck commented Mar 24, 2026

Uh oh!

NikoGrano commented Mar 24, 2026

Uh oh!

scyzoryck commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants