Fix for issue 20371 related to deep depencies #20392

fgarciacorona · 2025-02-19T10:50:59Z

The goal of this PR is to fix #20371 and improve the performance of csharp runtime when dealing with deep nested dependencies of .proto files.

The fix consists in caching the recursive calls for the search of extensions.
A test dataset has been created with deeply nested .proto files that showcases the performance penalty.
A set of benchmark metrics have been created to evaluate the performance impact before and after the fix.
Unfortunately due to not being able to unload all the existing objects inside the same testcase, it is only possible to run the test before and after enabling the caching mechanism. I am open to any other idea on how to improve this.

The test results before and after the fix are the following:
Before the proposed fix:

FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString
   Source: DescriptorsTest.cs line 40
   Duration: 2,1 min

  Message: 
  Expected: 399
  But was:  76306932


  Stack Trace: 
DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() line 61
1)    at Google.Protobuf.Reflection.DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() in D:\Git\protobuf\csharp\src\Google.Protobuf.Test\Reflection\DescriptorsTest.cs:line 61

  Standard Output: 
Running performance test for extension registry caching: With caching
{ }
GetAllExtensionsCount: 402
GetAllGeneratedExtensionsCount: 573
GetAllDependedExtensionsCount: 76306932
GetAllDependedExtensionsFromMessageCount: 644966683
TotalReturnedExtensionsCount: 118
w/ cache elapsed: 00:02:04.6190917

After the proposed fix:

 FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString
   Source: DescriptorsTest.cs line 40
   Duration: 117 ms

  Standard Output: 
Running performance test for extension registry caching: With caching
{ }
GetAllExtensionsCount: 402
GetAllGeneratedExtensionsCount: 573
GetAllDependedExtensionsCount: 399
GetAllDependedExtensionsFromMessageCount: 39
TotalReturnedExtensionsCount: 118
w/ cache elapsed: 00:00:00.0700036

google-cla · 2025-02-19T10:51:05Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

tonyliaoss · 2025-02-19T16:01:21Z

Hello @fgarciacorona, we won't be able to start reviewing your PR until the CLA has been signed.

Also, I suspect that we won't accept the newly added test protos, but we can discuss that after we start our review process.

fgarciacorona · 2025-02-25T11:12:48Z

Hello @fgarciacorona, we won't be able to start reviewing your PR until the CLA has been signed.

Also, I suspect that we won't accept the newly added test protos, but we can discuss that after we start our review process.

@tonyliaoss CLA has been passed! I cannot modify the wait for user action label

fgarciacorona · 2025-03-04T10:13:23Z

@tonyliaoss Is there any additional action required from my side to have the PR marked as safe for test and proceed with the testing of the PR? Additionally, I think it is missing the C# label.

JasonLunn · 2025-03-05T15:51:02Z

Could you rebase this PR? Github thinks it is changing almost 800 files which makes review impractical.

Edit, nevermind about the rebase. I see what @tonyliaoss means about the test protos, though. Is there no other way to excite the issue that this PR aims to fix?

fgarciacorona · 2025-03-05T23:47:47Z

Could you rebase this PR? Github thinks it is changing almost 800 files which makes review impractical.

Edit, nevermind about the rebase. I see what @tonyliaoss means about the test protos, though. Is there no other way to excite the issue that this PR aims to fix?

Right, there are 3 files that have actually been changed and the rest are half of them .proto and the other half the .cs generated classes from the protos.

I could only think of dynamically generating .proto files highly nested from within the testcase (if you want to reduce the number of existing files in the PR) and then somehow try to run protoc to generate the corresponding classes.

fgarciacorona · 2025-03-11T11:23:58Z

@JasonLunn how could I have the repo marked as "safe for tests" without having to request it for every commit?

JasonLunn · 2025-03-12T14:32:06Z

@JasonLunn how could I have the repo marked as "safe for tests" without having to request it for every commit?

Humans have to apply the tag by policy for third party contributions to make sure they're safe to run on our CI infrastructure.

JasonLunn · 2025-03-12T15:10:51Z

Failed FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString [263 ms]
  Error Message:
     Expected: 402
  But was:  419

  Stack Trace:
     at Google.Protobuf.Reflection.DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() in D:\a\protobuf\protobuf\csharp\src\Google.Protobuf.Test\Reflection\DescriptorsTest.cs:line 59

1)    at Google.Protobuf.Reflection.DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() in D:\a\protobuf\protobuf\csharp\src\Google.Protobuf.Test\Reflection\DescriptorsTest.cs:line 59

Interestingly, this only fails on the Windows test configuration, but it passed on Linux...

Improve performance of Extension search Profiling/Benchmarking code still included and enabled ! Add assertions to unit test and enable caching by default Added .proto files based on a deeper hierarchy

fgarciacorona · 2025-03-14T15:52:34Z

Failed FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString [263 ms]
  Error Message:
     Expected: 402
  But was:  419

  Stack Trace:
     at Google.Protobuf.Reflection.DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() in D:\a\protobuf\protobuf\csharp\src\Google.Protobuf.Test\Reflection\DescriptorsTest.cs:line 59

1)    at Google.Protobuf.Reflection.DescriptorsTest.FileDescriptor_CreateMessageWithDeepDependencies_BuildFromByteString() in D:\a\protobuf\protobuf\csharp\src\Google.Protobuf.Test\Reflection\DescriptorsTest.cs:line 59

Interestingly, this only fails on the Windows test configuration, but it passed on Linux...

@JasonLunn the reason why it was not failing in linux and arm64 is because the metrics for benchmarking are only generated for Debug (windows) but not for Release (linux and arm64).

Additionally we were storing the benchmark metrics in Debug mode in a static variable that was impacted by all unit tests. This needs to be reset before the specific test that looked at them.

fgarciacorona · 2025-03-27T22:46:06Z

@JasonLunn would it be possible to remove the assigned reviewers and I will do a rebase and hopefully based on that the right reviewers will be assigned? I think because I didn't rebase it properly the first time a lot of commits from main triggered so many people.

JasonLunn · 2025-03-28T05:43:44Z

It would expedite review if you could refactor the PR so that there are not ~800 files. I assume you have a script that generated the deeply nested test .proto files in the first place - could you use that in a genrule to generate the test input files at test time so that they don't have to be individually reviewed?

fgarciacorona · 2025-03-28T11:24:47Z

It would expedite review if you could refactor the PR so that there are not ~800 files. I assume you have a script that generated the deeply nested test .proto files in the first place - could you use that in a genrule to generate the test input files at test time so that they don't have to be individually reviewed?

The script that we have anonymized our data, so the proto files are based on actual data. But thanks for the feedback, knowing that a generator could also be used, I will focus on that.

Question though: do the generated .cs files need to be also committed? because I see them committed in the repo for other test .proto files, and that would eventually defeat the purpose (we would have ~400 only). Additionally if we don't commit the generated .cs files I am not sure whether it would be an accepted approach to have the test project not able to build if the test classes have not been generated.

If we can assume that every developer will always run generate_protos.sh script before starting development then the test csharp classes will be generated.

fgarciacorona requested a review from a team as a code owner February 19, 2025 10:51

fgarciacorona requested review from jskeet and removed request for a team February 19, 2025 10:51

jskeet removed their request for review February 19, 2025 11:05

tonyliaoss added cla: no wait for user action labels Feb 19, 2025

google-cla bot added cla: yes and removed cla: no labels Feb 25, 2025

tonyliaoss removed the wait for user action label Feb 25, 2025

fgarciacorona requested review from a team as code owners March 5, 2025 14:14

fgarciacorona requested review from JasonLunn, haberman, dmaclach, googleberg and Logofile and removed request for a team March 5, 2025 14:14

JasonLunn added c# wait for user action 🅰️ safe for tests Mark a commit as safe to run presubmits over labels Mar 5, 2025

github-actions bot removed the 🅰️ safe for tests Mark a commit as safe to run presubmits over label Mar 5, 2025

fgarciacorona marked this pull request as draft March 11, 2025 10:20

fgarciacorona force-pushed the fix_20371_csharp_deep_dependencies branch 2 times, most recently from 3515086 to a521e55 Compare March 11, 2025 11:22

JasonLunn added 🅰️ safe for tests Mark a commit as safe to run presubmits over and removed wait for user action labels Mar 12, 2025

github-actions bot removed the 🅰️ safe for tests Mark a commit as safe to run presubmits over label Mar 12, 2025

JasonLunn added the wait for user action label Mar 12, 2025

fgarciacorona added 3 commits March 14, 2025 16:45

Fix for issue 20371 related to deep depencies

c1d079c

Improve performance of Extension search Profiling/Benchmarking code still included and enabled ! Add assertions to unit test and enable caching by default Added .proto files based on a deeper hierarchy

Added proto files explicitely for deep dependencies tests

b69193f

Reset counters on the static fields before testing

5bb0e0a

fgarciacorona force-pushed the fix_20371_csharp_deep_dependencies branch from a521e55 to 5bb0e0a Compare March 14, 2025 15:45

JasonLunn added 🅰️ safe for tests Mark a commit as safe to run presubmits over and removed wait for user action labels Mar 14, 2025

github-actions bot removed the 🅰️ safe for tests Mark a commit as safe to run presubmits over label Mar 14, 2025

fgarciacorona marked this pull request as ready for review March 14, 2025 16:07

JasonLunn added the wait for user action label Mar 28, 2025

Logofile removed their request for review April 2, 2025 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix for issue 20371 related to deep depencies #20392

Fix for issue 20371 related to deep depencies #20392

Uh oh!

fgarciacorona commented Feb 19, 2025 •

edited

Loading

Uh oh!

google-cla bot commented Feb 19, 2025

Uh oh!

tonyliaoss commented Feb 19, 2025

Uh oh!

fgarciacorona commented Feb 25, 2025 •

edited

Loading

Uh oh!

fgarciacorona commented Mar 4, 2025

Uh oh!

JasonLunn commented Mar 5, 2025 •

edited

Loading

Uh oh!

fgarciacorona commented Mar 5, 2025

Uh oh!

fgarciacorona commented Mar 11, 2025

Uh oh!

JasonLunn commented Mar 12, 2025

Uh oh!

JasonLunn commented Mar 12, 2025

Uh oh!

fgarciacorona commented Mar 14, 2025

Uh oh!

fgarciacorona commented Mar 27, 2025

Uh oh!

JasonLunn commented Mar 28, 2025

Uh oh!

fgarciacorona commented Mar 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix for issue 20371 related to deep depencies #20392

Are you sure you want to change the base?

Fix for issue 20371 related to deep depencies #20392

Uh oh!

Conversation

fgarciacorona commented Feb 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla bot commented Feb 19, 2025

Uh oh!

tonyliaoss commented Feb 19, 2025

Uh oh!

fgarciacorona commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fgarciacorona commented Mar 4, 2025

Uh oh!

JasonLunn commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fgarciacorona commented Mar 5, 2025

Uh oh!

fgarciacorona commented Mar 11, 2025

Uh oh!

JasonLunn commented Mar 12, 2025

Uh oh!

JasonLunn commented Mar 12, 2025

Uh oh!

fgarciacorona commented Mar 14, 2025

Uh oh!

fgarciacorona commented Mar 27, 2025

Uh oh!

JasonLunn commented Mar 28, 2025

Uh oh!

fgarciacorona commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

fgarciacorona commented Feb 19, 2025 •

edited

Loading

fgarciacorona commented Feb 25, 2025 •

edited

Loading

JasonLunn commented Mar 5, 2025 •

edited

Loading

fgarciacorona commented Mar 28, 2025 •

edited

Loading