Parallelize MSBuild, Roslyn analysis, and Neo4j inserts across projects#15
Open
mscottford wants to merge 6 commits intovladbatushkov:mainfrom
Open
Parallelize MSBuild, Roslyn analysis, and Neo4j inserts across projects#15mscottford wants to merge 6 commits intovladbatushkov:mainfrom
mscottford wants to merge 6 commits intovladbatushkov:mainfrom
Conversation
- Targets net10.0 (up from net9.0) - Buildalyzer/Workspaces 7.1.0 → 8.0.0 - Microsoft.CodeAnalysis/CSharp 4.13.0 → 5.3.0 (Analyzers pin removed) - Neo4j.Driver 5.27.0 → 6.0.0 - System.CommandLine 2.0.0-beta4 → 2.0.5 (first stable release) - Dockerfile base image updated to dotnet/sdk:10.0-alpine - Neo4j image updated to 2026.03.1 with corrected env var names - docker-compose.yml: removed obsolete version attribute, replaced hardcoded Windows volume path with relative ./SystemUnderTest path - Code updated for breaking API changes in System.CommandLine 2.0 (SetAction, Options.Add, Aliases.Add, Required, AllowMultipleArgumentsPerToken) and Neo4j.Driver 6.0 (IDriver.CloseAsync removed, use await using) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…from analysis When AddToWorkspace(workspace, addProjectReferences: true) is called for a project, Buildalyzer eagerly adds its referenced projects into the workspace. The previous deduplication check silently dropped those projects on their own loop iteration, causing them to receive no CONTAINS triple and no code analysis. Adds a regression test using a two-project fixture solution where ProjectA references ProjectB — ensuring both appear in the analysis context regardless of workspace insertion order. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Stage 1: Build projects in parallel using .AsParallel() in GetAnalysisContext; each p.Build() spawns an isolated MSBuild process so there is no shared state. AdhocWorkspace mutations remain sequential (not thread-safe). - Stage 3: Analyze and insert all projects concurrently using Task.WhenAll with Task.Run so CPU-bound syntax tree walking runs across thread-pool threads. The delete step is extracted to DbManager.DeleteData and run once upfront before the parallel loop to avoid races. - Stage 4: A SemaphoreSlim(4) gates concurrent Neo4j sessions to avoid exhausting the connection pool on large solutions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
The workspace population step (AddToWorkspace) is the slowest sequential phase on large solutions and previously emitted no output, causing a silent pause between "Building projects - finished." and "done." Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This was referenced Apr 13, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
GetAnalysisContextnow use.AsParallel(). Eachp.Build()spawns an isolated MSBuild process, so there is no shared mutable state.AdhocWorkspacemutations remain sequential (Roslyn is not thread-safe for concurrent mutations).Analyzeis replaced withTask.WhenAlloverTask.Runtasks, so CPU-bound syntax tree walking runs across thread-pool threads concurrently. The graph delete (--deleteflag) is extracted toDbManager.DeleteDataand called once upfront before the parallel loop to avoid races.SemaphoreSlim(4)gates concurrent Neo4j sessions to avoid exhausting the connection pool on large solutions.AddToWorkspacephase, which was previously a silent pause on large solutions.Test plan
dotnet test— existing regression test should pass🤖 Generated with Claude Code