Support very long string as benchmark arguments #1248

adamsitnik · 2019-09-17T08:49:46Z

AndreyAkinshin · 2019-09-17T10:45:36Z

src/BenchmarkDotNet.Diagnostics.Windows/Sessions.cs

+            if (methodName.Length <= MaxBenchmarkNameLength)
+                return methodName;
+
+            return methodName.Substring(0, MaxBenchmarkNameLength);


It seems that such an approach may lead to the same "BenchmarkName"s for different benchmarks. Can it be a problem?

Very good catch! In such case, we are going to overwrite the trace file.

This is obviously far from perfect. I could generate some unique file name (using guid for example) but it would make it harder for the end-user to connect the trace file name with the benchmark.

@eerhardt let's say that you want to run following benchmark and use EtwProfiler to profile it:

public IEnumerable<object> Arguments() { yield return new string('a', 200_000); yield return new string('a', 200_000 - 1) + "b"; } [Benchmark] [ArgumentsSource(nameof(Arguments))] public void Some(string value)

We can't use full benchmark name with the super long string as a trace file name and we can not take just the first part of the string because the file name would be the same for two benchmarks.

Would using a guid in such case as a trace file name be suprising to the end user?

I would suggest keeping the beginning of the name the same, so I could tell that these 2 benchmarks were Some benchmarks. But then appending the Guid to the end would be understandable to me.

So either leaving out the args all together:

MyNamespace.MyClass.MyMethod.NewGuid

or chopping the arguments NewGuid characters before the max, and then appending NewGuid to the end.

MyNamespace.MyClass.MyMethod.aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaNewGuid

I don't think purely using a Guid for the whole name would be understandable.

It will be nice if generate the same name for the same benchmarking. I'm suggesting to use a hashcode of the whole string at the end of the "shortified" name instead of a random GUID.

It will be nice if generate the same name for the same benchmarking. I'm suggesting to use a hashcode of the whole string at the end of the "shortified" name instead of a random GUID

This is a good suggestion, however, afaik in .NET Core, the hash codes are the same for given string value only for current program lifetime. If you close the app and run it again, then the value is different (afaik for some security reasons)

One possibility would be to use a different hash algorithm, ex. SHA or MurmurHash.

We use Murmur hashing in ML.NET if you want to copy the code:

https://github.com/dotnet/machinelearning/blob/master/src/Microsoft.ML.Core/Utilities/Hashing.cs

tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs

# Conflicts: # src/BenchmarkDotNet.Diagnostics.Windows/Sessions.cs # tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs

…mentLength" This reverts commit 9cc7fbb.

…as a different command length limit)

…ions

adamsitnik · 2020-04-27T15:52:25Z

@AndreyAkinshin @eerhardt it's been a while but the PR is now ready for review.

eerhardt · 2020-04-27T16:48:30Z

// file copied from https://github.com/dotnet/runtime/blob/master/src/libraries/Common/tests/System/IO/PathFeatures.cs

You probably want to retain the copyright on this and Hashing.cs files.

Refers to: src/BenchmarkDotNet/Extensions/PathFeatures.cs:1 in 183927e. [](commit_id = 183927e, deletion_comment = False)

src/BenchmarkDotNet/Extensions/Hashing.cs

eerhardt

Looks good to me (for what its worth).

# Conflicts: # tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs

AndreyAkinshin

When you specify the source of a copied file like

// file copied from https://github.com/dotnet/machinelearning/blob/master/src/Microsoft.ML.Core/Utilities/Hashing.c

it's better to reference a specific commit or a specific tag. The source code in the master branch can be changed at any moment; it may complicate tracking the origin of the copied source code.

src/BenchmarkDotNet/Extensions/Hashing.cs

src/BenchmarkDotNet/Extensions/PathFeatures.cs

adamsitnik · 2020-07-16T07:10:11Z

it's better to reference a specific commit or a specific tag

done

adamsitnik added 4 commits September 17, 2019 10:25

implement test

dd7120b

extend FullNameProvider with a possibility to provide maxArgumentLength

9cc7fbb

make EtwProfiler support benchmarks with long string arguments, fixes #…

6ba97eb

…1198

Support benchmarks with very long string arguments, fixes #1247

8d8f793

adamsitnik requested review from AndreyAkinshin and WojciechNagorski September 17, 2019 08:49

AndreyAkinshin reviewed Sep 17, 2019

View reviewed changes

tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs Outdated Show resolved Hide resolved

code review: test more complex case

db9338a

adamsitnik mentioned this pull request Apr 27, 2020

System.IO.FileNotFoundException with EtwProfiler #1431

Closed

adamsitnik added 10 commits April 27, 2020 14:36

Merge remote-tracking branch 'origin/master' into longStringAsArguments

d88fa84

# Conflicts: # src/BenchmarkDotNet.Diagnostics.Windows/Sessions.cs # tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs

Revert "extend FullNameProvider with a possibility to provide maxArgu…

7d98858

…mentLength" This reverts commit 9cc7fbb.

use hashcode as session name

db2f35c

copy code from CoreFX that detects long paths support

75f3c02

copy the Hashing implementation from ML.NET repo

f350c6f

make sure that GetTraceFilePath handles path length limits

4383e71

make sure we don't send too long command to Process.Start (every OS h…

dc0dd97

…as a different command length limit)

use Hashing for ETW session name as well

e40198c

mention BDN in the session name to allow for identifying BDN ETW sess…

213ba7e

…ions

refactor

183927e

adamsitnik requested a review from AndreyAkinshin April 27, 2020 15:51

adamsitnik added this to the v0.12.2 milestone Apr 27, 2020

eerhardt reviewed Apr 27, 2020

View reviewed changes

src/BenchmarkDotNet/Extensions/Hashing.cs Outdated Show resolved Hide resolved

eerhardt previously approved these changes Apr 27, 2020

View reviewed changes

adamsitnik added 2 commits April 28, 2020 10:13

code review fixes

941b848

minor refactor

d711c14

adamsitnik dismissed eerhardt’s stale review via d711c14 April 28, 2020 08:13

adamsitnik mentioned this pull request Jun 22, 2020

ConcurrencyVisualizerProfiler Attribute Causing Exception #1198

Closed

Merge branch 'master' into longStringAsArguments

a3ce013

# Conflicts: # tests/BenchmarkDotNet.IntegrationTests/ArgumentsTests.cs

AndreyAkinshin requested changes Jul 16, 2020

View reviewed changes

adamsitnik commented Jul 16, 2020

View reviewed changes

src/BenchmarkDotNet/Extensions/Hashing.cs Outdated Show resolved Hide resolved

adamsitnik commented Jul 16, 2020

View reviewed changes

src/BenchmarkDotNet/Extensions/PathFeatures.cs Outdated Show resolved Hide resolved

use permalinks

74c699c

AndreyAkinshin approved these changes Jul 16, 2020

View reviewed changes

AndreyAkinshin merged commit 59080cd into master Jul 16, 2020

AndreyAkinshin deleted the longStringAsArguments branch July 16, 2020 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support very long string as benchmark arguments #1248

Support very long string as benchmark arguments #1248

adamsitnik commented Sep 17, 2019 •

edited

Loading

Uh oh!

AndreyAkinshin Sep 17, 2019

Uh oh!

adamsitnik Sep 23, 2019

Uh oh!

eerhardt Sep 23, 2019

Uh oh!

AndreyAkinshin Sep 29, 2019

Uh oh!

adamsitnik Oct 5, 2019

Uh oh!

eerhardt Oct 7, 2019

Uh oh!

Uh oh!

adamsitnik commented Apr 27, 2020

Uh oh!

eerhardt commented Apr 27, 2020

Uh oh!

Uh oh!

eerhardt left a comment

Uh oh!

AndreyAkinshin left a comment

Uh oh!

Uh oh!

Uh oh!

adamsitnik commented Jul 16, 2020

Uh oh!

Uh oh!

Support very long string as benchmark arguments #1248

Support very long string as benchmark arguments #1248

Conversation

adamsitnik commented Sep 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndreyAkinshin Sep 17, 2019

Choose a reason for hiding this comment

Uh oh!

adamsitnik Sep 23, 2019

Choose a reason for hiding this comment

Uh oh!

eerhardt Sep 23, 2019

Choose a reason for hiding this comment

Uh oh!

AndreyAkinshin Sep 29, 2019

Choose a reason for hiding this comment

Uh oh!

adamsitnik Oct 5, 2019

Choose a reason for hiding this comment

Uh oh!

eerhardt Oct 7, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

adamsitnik commented Apr 27, 2020

Uh oh!

eerhardt commented Apr 27, 2020

Uh oh!

Uh oh!

eerhardt left a comment

Choose a reason for hiding this comment

Uh oh!

AndreyAkinshin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adamsitnik commented Jul 16, 2020

Uh oh!

adamsitnik commented Sep 17, 2019 •

edited

Loading