Pnast/docs/mic 6522 docs (#13)

patricktnast · web-flow · commit 2dcb9c056dde · 2026-01-16T09:35:04.000-08:00
* [COPILOT] refactor extraction code to separate module

* format

* [COPILOT] consolidate benchmark and phase configs.

* refactor extraction to create a 'configuration'

* remove unused imports

* fix method sig

* minor fixes

* cleanup

* add basic unit tests

* add back result summary columns

* make callpattern more ergonomic

* condense

* add cli for summarization

* [COPILOT] Add tests

* edits for readability

* change nan check to warning

* format

* add summarize run at the end of the run_benchmark loop

* [COPILOT] extract plotting functions to new module

* [COPILOT] refactor plots

* adjust so that we only create fractions for bottleneck patterns, which are defined in a particular way.

* make bottleneck patterns more strict

* [COPILOT] add nb generation

* adjust organization

* format

* [COPILOT] add explicit extraction configuration

* remove preset

* remove other examples

* consolidate tests

* format

* add context manager

* use a fixture instead

* fix typo

* [COPILOT] generate documentation to readme

* trim down

* remove previous section

* remove duplicate param

* nits

* rename callpattern

* add line number to extraction

* add test to ensure we can select correct line

* use pipeline call as ex instead

* format

* updates

* format

* rename callpattern

* adjust test

* add name to tests
diff --git a/README.rst b/README.rst
@@ -101,19 +101,139 @@ You'll find six directories inside the main
   data or process outputs.
 
 
-Running Simulations
--------------------
+Profiling and Benchmarking
+---------------------------
 
-Before running a simulation, you should have a model specification file.
-A model specification is a complete description of a vivarium model in
-a yaml format.  An example model specification is provided with this repository
-in the ``model_specifications`` directory.
+This repository provides tools for profiling and benchmarking Vivarium simulations
+to analyze their performance characteristics. See the tutorials at
+https://vivarium.readthedocs.io/en/latest/tutorials/running_a_simulation/index.html
+and https://vivarium.readthedocs.io/en/latest/tutorials/exploration.html for general instructions
+on running simulations with Vivarium.
 
-With this model specification file and your conda environment active, you can then run simulations by, e.g.::
+Configuring Scaling Simulations
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 
-   (vivarium_profiling) :~$ simulate run -v /<REPO_INSTALLATION_DIRECTORY>/vivarium_profiling/src/vivarium_profiling/model_specifications/model_spec.yaml
+This repository includes a custom ``MultiComponentParser`` plugin that allows you to
+easily create scaling simulations by defining multiple instances of diseases and risks
+using a simplified YAML syntax.
 
-The ``-v`` flag will log verbosely, so you will get log messages every time
-step. For more ways to run simulations, see the tutorials at
-https://vivarium.readthedocs.io/en/latest/tutorials/running_a_simulation/index.html
-and https://vivarium.readthedocs.io/en/latest/tutorials/exploration.html
+To use the parser, add it to your model specification::
+
+    plugins:
+        required:
+            component_configuration_parser:
+                controller: "vivarium_profiling.plugins.parser.MultiComponentParser"
+
+Then use the ``causes`` and ``risks`` multi-config blocks:
+
+**Causes Configuration**
+
+Define multiple disease instances with automatic numbering::
+
+    components:
+        causes:
+            lower_respiratory_infections:
+                number: 4          # Creates 4 disease instances
+                duration: 28       # Disease duration in days
+                observers: True    # Auto-create DiseaseObserver components
+
+This creates components named ``lower_respiratory_infections_1``,
+``lower_respiratory_infections_2``, etc., each with its own observer if enabled.
+
+**Risks Configuration**
+
+Define multiple risk instances and their effects on causes::
+
+    components:
+        risks:
+            high_systolic_blood_pressure:
+                number: 2
+                observers: False    # Set False for continuous risks
+                affected_causes:
+                    lower_respiratory_infections:
+                        effect_type: nonloglinear
+                        measure: incidence_rate
+                        number: 2   # Affects first 2 LRI instances
+
+            unsafe_water_source:
+                number: 2
+                observers: True     # Set True for categorical risks
+                affected_causes:
+                    lower_respiratory_infections:
+                        effect_type: loglinear
+                        number: 2
+
+See ``model_specifications/model_spec_scaling.yaml`` for a complete working example
+of a scaling simulation configuration.
+
+
+Running Benchmark Simulations
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The ``profile_sim`` command profiles runtime and memory usage for a single simulation of a vivarium model
+given a model specification file. The underlying simulation model can
+be any vivarium-based model, including the aforementioned scaling simulations as well as models in a 
+separate repository. This will generate, in addition to the standard simulation outputs, profiling data 
+depending on the profiler backend provided. By default, runtime profiling is performed with ``cProfile``, but 
+you can also use ``scalene`` for more detailed call stack analysis.
+
+The ``run_benchmark`` command runs multiple iterations of one or more model specification, in order to compare
+the results. It requires at least one baseline model (specified as ``model_spec_baseline.yaml``) for comparison,
+and any other number of 'experiment' models to benchmark against the baseline, which can be passed via glob patterns.
+You can separately configure the sample size of runs for the baseline and experiment models. The command aggregates
+the profiling results and generates summary statistics and visualizations for a default set of important function calls
+to help identify performance bottlenecks.
+
+The command creates a timestamped directory containing:
+
+- ``benchmark_results.csv``: Raw profiling data for each run
+- ``summary.csv``: Aggregated statistics (automatically generated)
+- ``performance_analysis.png``: Performance charts (automatically generated)
+- Additional analysis plots for runtime phases and bottlenecks
+
+
+Analyzing Benchmark Results
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The ``summarize`` command processes benchmark results and creates visualizations.
+This runs automatically after ``run_benchmark``, but can also be run manually
+for custom analysis after the fact.
+
+By default, this creates the following files in the specified output directory:
+
+- ``summary.csv``: Aggregated statistics with mean, median, std, min, max
+  for all metrics, plus percent differences from baseline
+- ``performance_analysis.png``: Runtime and memory usage comparison charts
+- ``runtime_analysis_*.png``: Individual phase runtime charts (setup, run, etc.)
+- ``bottleneck_fraction_*.png``: Bottleneck fraction scaling analysis
+
+You can also generate an interactive Jupyter notebook including the same default plots
+and summary dataframe with a ``--nb`` flag, in which case the command also creates an
+``analysis.ipynb`` file in the output directory.
+
+
+Customizing Result Extraction
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+By default, the benchmarking tools extract standard profiling metrics:
+
+- Simulation phases: setup, initialize_simulants, run, finalize, report
+- Common bottlenecks: gather_results, pipeline calls, population views
+- Memory usage and total runtime
+
+You can customize which metrics to extract by creating an extraction config YAML file.
+See ``extraction_config_example.yaml`` for a complete annotated example.
+
+**Basic Pattern Structure**::
+
+    patterns:
+      - name: my_function          # Logical name for the metric
+        filename: my_module.py     # Source file containing the function
+        function_name: my_function # Function name to match
+        extract_cumtime: true      # Extract cumulative time (default: true)
+        extract_percall: false     # Extract time per call (default: false)
+        extract_ncalls: false      # Extract number of calls (default: false)
+
+In turn, this yaml can be passed to the ``run_benchmark`` and ``summarize`` commands
+using the ``--extraction_config`` flag. ``summarize`` will automatically create runtime
+analysis plots for the specified functions.