ENH: more intuitive profiling-target selection #337

TTsangSC · 2025-04-21T11:25:10Z

Synopsis

This PR aims to make the selection of auto-profiling targets more intuitive (see e.g. issue #318) by:

(Updated 18 May; the previously-proposed -e flag is removed) Making -p/--prof-mod "eager" by default, which essentially generates and loads an extra setup module to ensure that the supplied targets are all added to the profiler, regardless of whether they are directly imported by the run script/module or not.
(Added 18 May) The old behavior can be recovered by passing the --no-preimports flag.
Making methods like LineProfiler.add_module() and .add_imported_function_or_module() more aggressive in adding namespace members, instead of just giving up if the member isn't a class or a types.FunctionType; this allows us to leverage PR FIX: (+ENH?) fixed and extended dispatching for LineProfiler.__call__() #332 to profile e.g. class methods and properties in imported modules/classes.
(Added 24 May) Added scope-checking to function(-like) namespace members so that we don't accidentally profile imported functions.
(Added 26 May) Import targets that are packages are recursed/descended into.
(Added 27 May) Added handling of callable wrappers wrapping classes (see use-case); LineProfiler (and by extension, GlobalProfiler) can now also be directly used as a class decorator.
(Added 29 May) kernprof now takes the new flags -v/--verbose (to which --view is now aliased) and -q/--quiet, which allows for either outputting diagnostics or suppressing outputs.
(Added 14 Jun) Merged PRs:
- [wip] Debugging TTsangSC/line_profiler#1 (from @Erotemic)
  - Added logging facilities and backend switching
  - Added "dev-mode" environment-variable switches e.g. for tempfile preservation and diagnostics output
- Eager prof mod tempfile updates TTsangSC/line_profiler#2
  - Now making sure that tempfiles exist when tracebacks are formatted
  - Now removing functions that only exist in tempfiles from the written profiling results
(Late edit; added 7 Jul) Merged PR Review suggestions TTsangSC/line_profiler#3 (from @Erotemic):
- Simplified logging and its output
- Extensive readability refactoring (unnesting functions, isolation of side effects)

Code changes

kernprof.py:
- Updated .__doc__
- find_script():
  Added optional argument exit_on_error so that we don't always have to try: ... except SystemExit: ... when using it
- main():
  - Updated help text for option -p/--prof-mod
  - (Added 18 May) Made -p targets eagerly pre-imported for profiling by default
  - (Added 18 May) Added flag --no-preimports for restoring the old behavior (only profile -p targets that are imported in the executed script/module)
  - (Added 18 May) Now clearing the .enable_count and .disable()-ing the created profiler to further reduce the side effects main() has
  - (Added 26 May) Eager pre-imports now recurse/descend into packages by default; this can be suppressed by specifying the import target as <pkg>.__init__
  - (Added 29 May) Replaced the -v/--view boolean flag with the following:
    - -v/--verbose/--view: increments verbosity
    - -q/--quiet: decrements verbosity
    The verbosity levels are as follows:
    - 2: show diagnostic output (e.g. the pre-import module written, the function call used for running the script)
    - 1: show profiling results (equivalent to the previous --view)
    - 0: default behavior (equivalent to the previous default)
    - -1: suppress kernprof help messages (e.g. Wrote profile results to <filename>)
    - -2: suppress stdout of the executed code
    - -3: suppress stderr of the executed code
  - (Added 15 Jun) Tempfile removal (pre-imports, executed script) now deferred so that tracebacks are properly formatted
  - (Added 15 Jun) Profiling data from functions living in tempfiles are filtered out of the written lprof file, so that users don't see the Could not find file error message when viewing the file with python -m line_profiler
  - (Added 15 Jun) The following "dev-mode" environmental switches now control the behavior of main():
    - ${LINE_PROFILER_DEBUG}: ensures debugging/diagnostic output even at reduced verbosity levels
    - ${LINE_PROFILER_NO_EXEC}: does a "dry-run" without actually writing profiling results or executing the following:
      - The pre-profiling setup file (supplied by -s/--setup)
      - The pre-import file (generated from -p/--prof-mod)
      - The profiled script
    - ${LINE_PROFILER_KEEP_TEMPDIRS}: keeps the tempfiles written (the pre-import file and the profiled script (if supplied via the stdin or the -c flag))
    - ${LINE_PROFILER_STATIC_ANALYSIS}: use only static and path-based analysis (line_profiler.autoprofile.util_static) to handle pre-imports, instead of going through the import system (importlib.util and pkgutil)
- Misc:
  (Updated 19 May) Updated various docstrings to be more sphinx-friendly
line_profiler/__init__.py:
(Added 19 May) Updated various docstrings to be more sphinx-friendly
line_profiler/_logger.py:
(Merged 14 Jun; contributed by @Erotemic) New module for logging facilities
line_profiler/_diagnostics.py:
(Merged 14 Jun; contributed by @Erotemic) New module for "dev-mode" switches, package-wide logging, etc.
line_profiler/autoprofile/eager_preimports.py[i]:
New module for implementing eager pre-imports: explicitly importing all the profiling targets in a generated script to add them to the profiler
- split_dotted_path():
  Function for determining where the module stops and the chained attribute access starts in a dotted path like package.submodule.SomeClass.some_attribute
- write_eager_import_module():
  Function for writing the module which imports the targets and adds them to the profiler
- resolve_profiling_targets():
  (Added 15 Jun) Function for resolving dotted paths into the exact targets to import and pass to the profiler
line_profiler/autoprofile/line_profiler_utils.py[i]::add_imported_function_or_module():
- (Updated 24 May) Added optional argument scoping_policy for limiting what functions/-wrappers, classes, and modules when they are found as members to other classes and/or modules
- Added optional argument wrap for controlling whether to replace added class and module members with @LineProfiler.wrap_callable wrappers
- Changed return type of to int (1 if anything has been added to the profiler, 0 otherwise) for consistency with the .add_callable(), .add_module(), and .add_class() methods
- Refactored to permit the adding of callable wrappers (e.g. class methods and properties), nested classes, etc.
line_profiler/line_profiler_utils.py[i]:
(Added 19 May) New module for utilities
- StringEnum:
  Convenience subclass/backport of enum.StringEnum
line_profiler/line_profiler.py[i]:
- LineProfiler:
  - .wrap_callable():
    (Updated 1 Jun) Now a no-op on C(-ython)-level callables (e.g. the various types callable types that aren't FunctionType or MethodType)
  - .add_callable():
    - Updated return-type annotation
    - (Update 1 Jun) Now a no-op (and correctly returns 0) on C(-ython)-level callables (e.g. the various types callable types that aren't FunctionType or MethodType)
    - (Added 24 May) Added optional argument guard for controlling what callables to pass onto .add_function()
    - (Added 15 Jun) Added optional argument name for logging purposes
    - (Update 15 Jun) Now writing a debug message to the log for each function object added via .add_function()
  - .add_module():
    - (Updated 24 May; supersedes the previous match_scope) Added optional argument scoping_policy for limiting what functions/-wrappers, classes, and modules to descend into when they are found as members to other classes and/or modules
    - Added optional argument wrap for controlling whether to replace added class and module members with @LineProfiler.wrap_callable wrappers
    - Refactored to permit the adding of callable wrappers (e.g. class methods and properties), nested classes, etc.
    - Added recursion/duplication check so that self- and mutually-referential namespaces don't cause problems
    - (Update 15 Jun) Now writing a debug message to the log for each object which has at least one member added via .add_callable()
  - .add_class():
    New method (shares implementation with .add_module())
  - Misc:
    (Updated 19 May) Updated various docstrings to be more sphinx-friendly
line_profiler/profiler_mixin.py[i]::ByCountProfilerMixin:
- (Updated 27 May) Updated various docstrings to be more sphinx-friendly
- wrap_callable():
  (Updated 27 May) Added handling for (1) classes and (2) callable wrappers (e.g. classmethod) which wraps around classes instead of functions; by extension, LineProfiler and GlobalProfiler can now be used as class decorators
- wrap_class():
  (Added 27 May) New method for wrapping around classes (by wrapping all its locally-defined methods and similar)
- get_underlying_functions():
  (Added 27 May) New class method migrated and refactored from line_profiler/line_profiler.py::_get_underlying_functions()
  - Added handling for recursions
  - Added handling for classes
  - Fixed corner case when a callable object with a C-level .__call__() method is passed
  - (Updated 15 Jun) Added handling for Cython-level callables (treated the same as C-level callables, i.e. not line-profile-able (see Issue Line profiling in Cython seems to be totally broken #200))
line_profiler/scoping_policy.py[i]::ScopingPolicy:
(Added 19 May; migrated from line_profiler/line_profiler.py 27 May) New string enum for documenting and implementing the valid values of the scoping_policy parameter

Doc changes

docs/source/auto/line_profiler.rst, line_profiler.autoprofile.rst:
(Updated 27 May) Added new doc pages to the indices
docs/source/auto/line_profiler.autoprofile.ast_profile_transformer.rst:
(Updated 26 May) Renamed from the typo-ed ast_profle_transformer.rst (see FIX: auto-profile transformer typo #325), fixing the disappearance of the page from the output HTML
docs/source/auto/line_profiler.autoprofile.run_module.rst, autoprofile.eager_preimports.rst, profile_mixin.rst, scoping_policy.rst:
(Updated 27 May) Added (missing) doc pages for preexisting and new modules

Test-suite changes

tests/test_autoprofile.py:
- test_autoprofile_exec_package(), test_autoprofile_exec_module():
  - (Updated 18 May) Updated in accordance with the new behavior of kernprof -p
  - (Updated 19 May) Refactored to simplify the parametrization signatures
  - (Added 18 May) Added subtests for --no-preimports
  - (Added 26 May) Added subtests for package descent
- test_autoprofile_callable_wrapper_objects():
  New test that the callable wrappers like class methods and properties are added to the profiler on import
tests/test_eager_preimports.py:
New test module for line_profiler/autoprofile/eager_preimports.py
- test_write_eager_import_module_wrong_adder():
  Test that write_eager_import_module() complains about bad adder (callable to be used verbatim to add objects, i.e. 'profile.add_imported_function_or_module') values
- test_written_module_pep8_compliance():
  (Added 29 May) Test that write_eager_import_module() write a module that is PEP-8-compliant (requires flake8)
- test_written_module_error_handling():
  (Added 29 May) Test the warning/error-raising behavior of write_eager_import_module() and the written module:
  - Targets which can't be resolved (e.g. nonexistent module) results in a generation-time warning
  - Targets which are resolved but doesn't exist (e.g. nonexistent member of a module) results in an execution-time warning
  - Targets which exists but are pathological (e.g. module which raises an error when being executed/imported) results in:
    - An execution-time warning if it is included indirectly (via descent/recursion), or
    - An exception if it is a direct import target
- test_split_dotted_path_staticity(), test_resolve_profiling_targets_staticity():
  (Added 15 Jun) Test the different behaviors when handling dotted paths with pure static analysis and via the import system
- (Removed 19 May) ~~test_doctest_*()~~
tests/test_explicit_profile.py:
- test_profiler_add_methods():
  New test for the new wrap argument of LineProfiler.add_imported_function_or_module(), .add_module(), and .add_class()
- test_profiler_add_class_recursion_guard():
  New test for the handling of self-/mutually-referential classes
- test_profiler_warn_unwrappable():
  (Added 27 Apr) New test for the warning issued when wrappers around added namespace members cannot be set back to the namespace
- test_profiler_class_scope_matching():
  (Updated 24 May; supersedes the previous test_profiler_scope_matching()) New test for how the scoping_policy argument limits descent into classes
- test_profiler_func_scope_matching() (resp. test_profiler_module_scope_matching()):
  (Added 24 May) Corresponding new tests for scoping_policy and the profiling of functions (resp. descent into modules)
tests/test_line_profiler.py:
- test_profiler_c_callable_no_op():
  (Updated 1 Jun) New test for the no-op on C(-ython)-level callables
- test_class_decorator():
  (Added 27 May) New test for decorating classes
- test_add_class_wrapper():
  (Added 27 May) New test for using .add_callable() on a callable wrapper (classmethod) which wraps around a class instead of a normal function
tests/test_kernprof.py:
- test_kernprof_verbosity():
  (Added 29 May) New test for kernprof's output at different verbosity levels

Conflicts

This PR conflicts with #335 because both made (at times overlapping) modifications to kernprof.py and the test suite. (Should be easy to resolve though since I wrote both.)

Acknowledgements

If was originally proposed by @Erotemic that we rework how kernprof --prof-mod functions.
The extension of the capabilities of LineProfiler.add_imported_function_or_module() builds upon FIX: (+ENH?) fixed and extended dispatching for LineProfiler.__call__() #332 and is inspired by/back-ported from pytest_autoprofile.profiler.LineProfiler.

codecov · 2025-04-21T12:06:38Z

Codecov Report

Attention: Patch coverage is 77.57475% with 135 lines in your changes missing coverage. Please review.

Project coverage is 69.45%. Comparing base (6889534) to head (0b8cc7e).
Report is 75 commits behind head on main.

Files with missing lines	Patch %	Lines
line_profiler/profiler_mixin.py	38.05%	54 Missing and 16 partials ⚠️
line_profiler/autoprofile/eager_preimports.py	87.42%	13 Missing and 9 partials ⚠️
line_profiler/_logger.py	85.58%	11 Missing and 5 partials ⚠️
line_profiler/scoping_policy.py	80.72%	14 Missing and 2 partials ⚠️
line_profiler/line_profiler_utils.py	62.50%	6 Missing ⚠️
line_profiler/line_profiler.py	96.34%	1 Missing and 2 partials ⚠️
line_profiler/autoprofile/line_profiler_utils.py	81.81%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #337      +/-   ##
==========================================
+ Coverage   65.33%   69.45%   +4.12%     
==========================================
  Files          13       18       +5     
  Lines        1073     1611     +538     
  Branches      234      341     +107     
==========================================
+ Hits          701     1119     +418     
- Misses        310      415     +105     
- Partials       62       77      +15

Files with missing lines	Coverage Δ
line_profiler/__init__.py	`100.00% <ø> (ø)`
line_profiler/_diagnostics.py	`100.00% <100.00%> (ø)`
line_profiler/autoprofile/line_profiler_utils.py	`84.61% <81.81%> (+34.61%)`	⬆️
line_profiler/line_profiler.py	`77.77% <96.34%> (+4.55%)`	⬆️
line_profiler/line_profiler_utils.py	`62.50% <62.50%> (ø)`
line_profiler/_logger.py	`85.58% <85.58%> (ø)`
line_profiler/scoping_policy.py	`80.72% <80.72%> (ø)`
line_profiler/autoprofile/eager_preimports.py	`87.42% <87.42%> (ø)`
line_profiler/profiler_mixin.py	`40.46% <38.05%> (+1.07%)`	⬆️

... and 4 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a344928...0b8cc7e. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Erotemic · 2025-04-29T01:25:30Z

Haven't looked at this yet. Busy crunching for a paper deadline. But I did come across a use-case where I attempted to demo auto-profiling but it came up with an error.

I would expect that python -m kernprof -lvr -e calendar -m calendar would be able to demo using the stdlib calendar main, and it does somewhat but it the way we are handling decorators seems like it's not working with the global enum they use?

However, I tested: python -m kernprof -lvr -e uuid -m uuid, and that did seem to work nicely.

TTsangSC · 2025-04-29T05:27:12Z

It seems that global enum is indeed problematic, but the problem lies deeper than kernprof:

>>> import runpy
>>> runpy.run_module('calendar', {}, '__main__')
Traceback (most recent call last):
  File "<python-input-1>", line 1, in <module>
    runpy.run_module('calendar', {}, '__main__')
    ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen runpy>", line 229, in run_module
  File "<frozen runpy>", line 88, in _run_code
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 813, in <module>
    main()
    ~~~~^^
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 800, in main
    result = cal.formatyear(datetime.date.today().year, **optdict)
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 420, in formatyear
    for (i, row) in enumerate(self.yeardays2calendar(theyear, m)):
                              ~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 317, in yeardays2calendar
    months = [self.monthdays2calendar(year, m) for m in Month]
              ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 289, in monthdays2calendar
    days = list(self.itermonthdays2(year, month))
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 245, in itermonthdays2
    for i, d in enumerate(self.itermonthdays(year, month), self.firstweekday):
                ~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 233, in itermonthdays
    day1, ndays = monthrange(year, month)
                  ~~~~~~~~~~^^^^^^^^^^^^^
  File "/opt/homebrew/Cellar/[email protected]/3.13.2/Frameworks/Python.framework/Versions/3.13/lib/python3.13/calendar.py", line 172, in monthrange
    ndays = mdays[month] + (month == FEBRUARY and isleap(year))
                                     ^^^^^^^^
NameError: name 'FEBRUARY' is not defined

Since runpy is the standard way of doing the equivalent of python -m in code, I'd say that is problematic behavior either on the part of runpy or enum. Apparently someone got a similar error in the discussion of python/cpython#103935 but got brushed aside since it didn't exactly have to do with the issue at hand...

The problem seems to be that @enum.global_enum does some witchery to sys.modules to retrieve the module whose globals should be modified, instead of going through the stack frames to retrieve it (which of course has its own issues). Since runpy.run_module() doesn't touch sys.modules by default, the enums are inserted into the wrong namespace, hence the error.

Calling runpy.run_module() with alter_sys=True seems to solve the issue; will write a separate PR to fix that later today.

Erotemic · 2025-05-18T01:19:42Z

Conflicts need to be fixed here.

I'm also not sure of adding both -p and -e. It seems like we might want to just make -p be eager by default and then maybe have a flag for the user to opt-out of preimporting (e.g. --preimport=False or --no-preimport).

TTsangSC · 2025-05-18T01:49:50Z

Fair enough, since the eager behavior is the more intuitive one; will do.

TTsangSC · 2025-05-18T07:27:19Z

Dunno why but the CI tests seem substantially more sluggish on newer Pythons:

Python version\Platform	`ubuntu`	`macOS`	`windows`
`3.8`	18.95s	21.35s	31.40s
`3.10`	28.72s	34.74s	43.88s
`3.12`	31.08s	33.46s	50.26s
`3.13`	28.51s	32.30s	46.19s

It's even more egregious on my own machine:

Python version\Command	`run_tests.py` (i.e. `/`coverage`)	`pytest` (i.e. w/o `coverage`)
`3.8`	16.37s	8.62s
`3.13`	33.46s	23.51s

#327 might be partially to blame since that's extra stuff (though menial) that Python 3.12+ has to do,¹ but it couldn't have been the only reason since 3.10 was also slow...

this brings us back to one of our old discussions – if that becomes problematic (unlikely as it is), we may have to consider migrating line_profiler._line_profiler._sys_monitoring_[de]register() to C(-ython) implementations . ↩

Erotemic · 2025-05-18T18:50:07Z

Hmm, I ran ./run_tests.py twice on main on my local venvs. I got:

3.13: 17.30s, 17.26s
3.12: 17.24s, 17.10s
3.11: 11.97s, 11.86s
3.10: 11.13s, 10.76s
3.9: 11.36s, 11.06s

So, my 3.10 tests were faster than yours. We should add some performance regression tests, but I'm not supper worried about a 30 second CI runtime. I deal with projects that have 20 minute-per-run CI times.

If only there was some sort of profiling tool that we could use to test which lines are slower between versions... :)

TTsangSC · 2025-05-18T19:28:10Z

Making line_profiler "self-hosting" – being able to profile itself – will be the holy grail. IDK how doable that will be or if it's even possible (maybe not, due to the highly probable recursion shenanigans), but it also feels somewhat in reach considering the progress we've been making.

I did try using my own plugin (https://gitlab.com/TTsangSC/pytest-autoprofile) to profile the test suite, but there are two major bottlenecks:

For in-process tests, that would imply having more than one profilers active. That is currently not allowed (in 3.12+) thanks to ENH: sys.monitoring compliance #327, but even if we were to be more lax with that and allow all LineProfiler instances to share the sys.monitoring lock:
- We may still run into the aforementioned recursion issue.
- All instances will need to have their trace functions run. Maybe Restore trace callback when the profiler is disabled #334 will help but it's probably not getting us all the way there.
For subprocess tests (the majority of the test suite, at least the part which calls kernprof), there's obviously no way for the profiler to track them.

Maybe I can try to do as coverage does to combine profiling data from different Python processes (see also #219, which IMO seems like a good idea but needs to be developed), but I haven't got that figured out yet, since we need to know when a Python process is started in order to set up the instrumentation. Or maybe I'm overthinking it and we can either:

Set up explicit profiling throughout line_profiler (or a copy thereof) and use LINE_PROFILE=1, or
Generate a good ol' shimmed-in .pth file on-the-fly so that the new process automatically sets up the profiling tooling.

EDIT: a possible solution to the multiple-profiler problem may be that we make the C-level profiler a singleton responsible for all the actual profiling and tracing in the process, and have line_profiler.line_profiler.LineProfiler just:

Be a wrapper over it – which it kinda already is – instead of inheriting from it, and either
Subscribe from each instance to the functions that it is profiling, or
Handle the bookkeeping and filtering of profiling data in each instance's get_stats().

And thanks for the data – I've had the suspicion that it's my machine that's acting odd since we didn't have that kind of slowdown in CI... really hoping that I won't have to do anything more involved than reinstalling my Pythons to fix that.

Erotemic

Finally got to this. I was able to review everything. Once we take care of these comments we can merge and move to the next one.

Erotemic · 2025-05-18T19:06:40Z

kernprof.py

@@ -605,6 +738,9 @@ def _main(options, module=False):
                print(f'{py_exe} -m pstats "{options.outfile}"')
            else:
                print(f'{py_exe} -m line_profiler -rmt "{options.outfile}"')
+        # Fully disable the profiler


Why is this necessary?

Some tests like tests/test_kernprof.py:: test_kernprof_sys_restoration() (and a good chunk of tests/test_line_profiler.py) are in-process, and for that it's best that kernprof.py::main() cleans up after itself, or we'll get failures like https://github.com/pyutils/line_profiler/actions/runs/15092575370/job/42423114894 (because the previous profiler instance isn't disabled and still has the sys.monitoring lock).

One can argue that I should've wrote said test so that it instead handles the cleanup, but since other components in kernprof.py have been imported in other tests, I'd say it's the best to treat kernprof as a module (instead of a mere script) and minimize the side effects that its public functions has upon being called.

Ah, that makes a lot of sense. I agree with minimizing side effects and looking at kernprof as a module.

Erotemic · 2025-05-18T19:29:10Z

line_profiler/line_profiler.py

-    def add_module(self, mod):
-        """ Add all the functions in a module and its classes.
+    def _add_namespace(self, duplicate_tracker, namespace, *,
+                       match_scope='none', wrap=False):


Small comment here about how this is used to recursively iterate through members of a containers and wrap profile-able objects according to match-scope .

As a nitpick, I think the duplicate_tracker should be a keyword only argument that defaults to None, and is initialized to a set if it is None. Then we can get grid of the cryptic empty set instances created in the add_class and add_module functions. I would also recommend calling it "seen", and just using "seen.add" instead of assigning a variable to its add method. That just feels more idiomatic and obvious to me.

Could probably do the same with add_func and wrap_func, as they are only used once. It won't make the lines too long and actually has a minor positive impact on performance as you don't do an attribute access if you don't need it.

Fair enough, the idea was to pre-fetch commonly-used callables into the namespace to avoid attribute access in a tight-ish loop, but then again the two shouldn't really cost that differently unless we have some .__getattribute__() magic (which we don't).

Erotemic · 2025-05-18T19:39:57Z

line_profiler/line_profiler.py

+                                 'none']):
+                Whether (and how) to match the scope of member classes
+                and decide on whether to add them:
+                - 'exact': only add classes defined locally in this


I'm wondering if there is a better name for match_scope. I think a name ending in "_policy" might make it more clear that the right choice for this might depend on context.

I think I have a grasp on what it's doing but an example might helpful, and when you might want to change the policy.

Is it user-facing at all, I only see it used in line_profiler.autoprofile.line_profiler_utils.add_imported_function_or_module, and I don't see how a user could change it. Is this added mostly in case we want to modify behavior later? It's adding quite a bit of complexity for something that seems to always take the value "siblings".

It also might be cleaner and more extensible to define it as a StrEnum, which requires Python 3.11, but we can add a backport in a utils module:

class StrEnum(str, Enum): """ Minimal string enum that works with Python 3.6+ """ def __str__(self): return self.value @classmethod def _missing_(cls, value): # Allow case-insensitive lookups for member in cls: if member.value.lower() == value.lower(): return member return None

Then we could move the documentation for each into the MatchScope docstr and say refer to :class:MatchScope to make each method docstr more concise.

I propose in that case that we change the name to scoping_policy.

As of now end-users (assuming that they only ever use kernprof.py) indeed has no way of changing it since they won't be directly calling LineProfiler.add_class(), LineProfiler.add_module(), or line_profiler.autoprofile.line_profiler_utils.add_imported_function_or_module(), but that can (and maybe should) change with #335. But yes maybe at this moment YAGNI.

StrEnum sounds like a good idea, will do.

I'm ok with scoping_policy. I'm not in love with it, but I can't think of anything better. It might be worth sleeping on it and trying to come up with a more intuitive name. I feel like this arg is going to cause confusion, but I understand why its useful and I think we should have it.

It makes sense that pyproject toml will expose the option to the user.

Once we've set up the utils module I guess it will also make sense to update #335 so that line_profiler.cli_utils is merged into that, but that's something for future us to worry about.

Erotemic · 2025-05-18T19:46:19Z

tests/test_eager_preimports.py

@@ -0,0 +1,190 @@
+"""
+Tests for `line_profiler.autoprofile.eager_preimports`.


Wait, are we not running doctests in the CI?

That's a huge oversight on my part if that is the case - or I'm forgetting a reason for not having it. The test requirements contain xdoctest, and I also see --xdoctest in .github/workflows/tests.yml, but it's not in run_tests.py, which it probably should be.

I don't think we should need to add this file. Not sure why it's called test_eager_preimports.py am I missing something?

Ah yes I missed that we do have doctests in CI via xdoctest. In that case most of this test module is probably unnecessary. However:

The module is for testing line_profiler/autoprofile/eager_preimports.py, hence the name.

There's currently one bona-fide test (test_write_eager_import_module_wrong_adder()) here which isn't a doctest wrapper. If we don't keep the module we might want to think about whither to rehabilitate it.

Ah, I missed that small test at the end. Let's keep test_write_eager_import_module_wrong_adder in this file, and remove everything else.

I should probably update pyproject.toml to add xdoctest by default. I'll do that in a different PR.

Erotemic · 2025-05-18T19:47:01Z

tests/test_explicit_profile.py

@@ -7,6 +8,15 @@
 import ubelt as ub


+@contextlib.contextmanager
+def enter_tmpdir():


Let's make this class-based for my own sanity. I find it too easy to get confused with contextlib decorators.

Ah yes, considering the discussions we've had over #340. Will do.

Erotemic · 2025-05-18T20:21:11Z

tests/test_autoprofile.py

+     (False, None, False, True,
+      False, False, False, False, False, False),
+     (True, None, False, True,
+      False, False, False, False, False, False)])


Similar refeactor here:

@pytest.mark.parametrize( ['use_kernprof_exec', 'prof_mod', 'extra_args', 'expected_funcs'], [ (False, 'test_mod.submod2,test_mod.subpkg.submod3.add_three', ['--no-preimports'], ['add_two']), (False, 'test_mod.submod2,test_mod.subpkg.submod3.add_three', [], ['add_two', 'add_three', 'add_operator']), (False, 'test_mod.submod1', [], ['add_one', 'add_operator']), (False, 'test_mod.subpkg.submod4', ['--prof-imports'], ['add_one', 'add_two', 'add_four', 'add_operator', '_main']), (False, None, ['--prof-imports'], []), (True, None, ['--prof-imports'], []), ] ) def test_autoprofile_exec_module(use_kernprof_exec, prof_mod, extra_args, expected_funcs): """ Test the execution of a module. """ temp_dpath = ub.Path(tempfile.mkdtemp()) _write_demo_module(temp_dpath) if use_kernprof_exec: args = ['kernprof'] else: args = [sys.executable, '-m', 'kernprof'] if prof_mod is not None: args.extend(['-p', prof_mod]) args.extend(extra_args) args.extend(['-l', '-m', 'test_mod.subpkg.submod4', '1', '2', '3']) proc = ub.cmd(args, cwd=temp_dpath, verbose=2) print(proc.stdout) print(proc.stderr) proc.check_returncode() prof = temp_dpath / 'test_mod.subpkg.submod4.lprof' args = [sys.executable, '-m', 'line_profiler', os.fspath(prof)] proc = ub.cmd(args, cwd=temp_dpath) raw_output = proc.stdout print(raw_output) proc.check_returncode() all_possible_funcs = ['add_one', 'add_two', 'add_three', 'add_four', 'add_operator', '_main'] for func in all_possible_funcs: assert (f'Function: {func}' in raw_output) == (func in expected_funcs)

Great idea, yeah all the positional args in the tests are getting a bit out of hand. Will do.

Erotemic · 2025-05-18T20:21:27Z

tests/test_autoprofile.py

+     (False, None, False, True,
+      False, False, False, False),
+     (True, None, False, True,
+      False, False, False, False)])


This is why I don't like pytest parameterize. It gets so messy. For some reason I'm having a hard time with the suggestion github feature, but I'm thinking a refactor like this might make it slightly easier to read:

@pytest.mark.parametrize( ['use_kernprof_exec', 'prof_mod', 'extra_args', 'expected_funcs'], [ (False, 'test_mod.submod1', [], ['add_one', 'add_operator']), # By using --no-preimports, only explicitly listed prof_mod is profiled (False, 'test_mod.submod1', ['--no-preimports'], ['add_one']), (False, 'test_mod.submod2', ['--prof-imports'], ['add_two', 'add_operator']), (False, 'test_mod', ['--prof-imports'], ['add_one', 'add_two', 'add_operator', '_main']), # Multiple -p modules without --prof-imports (False, ['test_mod', 'test_mod.submod1,test_mod.submod2'], [], ['add_one', 'add_two', 'add_operator', '_main']), (False, None, ['--prof-imports'], []), (True, None, ['--prof-imports'], []), ] ) def test_autoprofile_exec_package(use_kernprof_exec, prof_mod, extra_args, expected_funcs): """ Test the execution of a package. """ temp_dpath = ub.Path(tempfile.mkdtemp()) _write_demo_module(temp_dpath) if use_kernprof_exec: args = ['kernprof'] else: args = [sys.executable, '-m', 'kernprof'] if prof_mod is not None: if isinstance(prof_mod, str): prof_mod = [prof_mod] for pm in prof_mod: args.extend(['-p', pm]) args.extend(extra_args) args.extend(['-l', '-m', 'test_mod', '1', '2', '3']) proc = ub.cmd(args, cwd=temp_dpath, verbose=2) print(proc.stdout) print(proc.stderr) proc.check_returncode() prof = temp_dpath / 'test_mod.lprof' args = [sys.executable, '-m', 'line_profiler', os.fspath(prof)] proc = ub.cmd(args, cwd=temp_dpath) raw_output = proc.stdout print(raw_output) proc.check_returncode() all_possible_funcs = ['add_one', 'add_two', 'add_operator', '_main'] for func in all_possible_funcs: assert (f'Function: {func}' in raw_output) == (func in expected_funcs)

Double check that I didn't miss something in this refactor. I used ChatGPT to generate it.

Erotemic · 2025-05-18T20:23:22Z

line_profiler/line_profiler.py

+        return count
+
+    @staticmethod
+    def _add_module_filter(mod, match_scope):


Feels weird to have these as staticmethods. Maybe if we change the match_scope to a StrEnum, we can add these as methods there?

Erotemic · 2025-05-18T20:24:07Z

line_profiler/line_profiler.py

@@ -28,9 +31,28 @@
 # NOTE: This needs to be in sync with ../kernprof.py and __init__.py
 __version__ = '4.3.0'

+# These objects are callables, but are defined in C so we can't handle
+# them anyway
+c_level_callable_types = (types.BuiltinFunctionType,


Data constant should generally be all caps.

I've heard opposing views about this, but yes personally I agree and I do the same in my personal projects. Since you gave your blessing, will do it here.

Erotemic · 2025-05-18T20:24:30Z

kernprof.py

+    if not options.prof_mod:
+        options.no_preimports = True
+    if options.line_by_line and not options.no_preimports:
+        # We assume most items in `.prof_mod` to be import-able without


Let's factor this new block out into its own function to try to minimize how long each individual function is. This could probably be moved into the autoprofile.eager_preimports submodule.

... yeah this is obvious in hindsight, that function was getting a bit too big. Will do.

TTsangSC · 2025-05-19T17:17:41Z

Sorry for taking this long, I kinda got distracted by trying to get sphinx to work.

One thing that I've noticed is that EDIT: while

:py:deco:`profile`

is correctly rendered into @profile, it fails to resolve to either @line_profiler.profile or @line_profiler.explicit_profiler.profile despite my adding the latter to the corresponding files in docs/source/auto/, like

.. E.g. this was in ``docs/source/auto/line_profiler.rst``

.. py:decorator:: line_profiler.profile

    :py:class:`~.GlobalProfiler` instance.

(I've since rolled the changes in the RST files back because the links don't work anyway.) And it wasn't an issue of duplicate references anyway, since even if I were to do

:py:deco:`~.profile`

and only keep one of the two .. py:decorator:: ... blocks (so that there is no ambiguity which the ref should resolve to), it would get rendered into @~.profile and still doesn't get linked to the block. Maybe I should've just given up and used :py:data:...

Erotemic · 2025-05-19T19:21:21Z

Fun,

I found that:

:py:deco:`line_profiler.explicit_profiler.GlobalProfiler`

does work to produce @line_profiler.explicit_profiler.GlobalProfiler, and using ChatGPT I found that:

:py:deco:`profile <line_profiler.explicit_profiler.GlobalProfiler>`

Will allow you to use custom text @profile and overwrite the link location to GlobalProfiler.

Erotemic · 2025-05-22T14:10:34Z

So I'm testing this out in a real world use-case. And its definitely working in that if I include enough -p arguments, I get pretty much what I want, but it would be very nice if there was a way to tell -p to also include all submodules if you give it a top-level package name.

Is that something that changing the scope policy could help with?

Here is the example I was playing with: https://gist.github.com/Erotemic/3abd223c55b761b39fc7746ea4307faf

I want to see where the webdataset and wids package are taking time. I didn't write them, but I want to use line-profiler to explore the code that's called based on my example.

Another improvement I see that might help is that when we write the output for a specific method, we give the method name and the file it was from, but we don't report the name of the class that it belongs to. This is separate from this PR, but we should record and report this information. (I think we can get this behavior if we use code.co_qualname instead of code.co_name in the label function in _line_profiler.pyx; Haven't checked if there are unintended consequences yet)

TTsangSC · 2025-05-22T16:53:35Z

Is that something that changing the scope policy could help with?

Currently there are several idiosyncrasies with the scoping_policy system:

We don't EVER descend into other modules found in a namespace. One may argue that e.g.
- scoping_policy='descendants' should imply that if --prof-mod=pkg then --prof-mod=pkd.submod is fully profiled too... and
- For the default scoping_policy='siblings', if --prof-mod=pkg.submod1 and if pkg.submod1 imports pkg.submod2, the latter should be profiled too.
However things can quickly go sideways if we do allow descending into modules, and there probably isn't a way to do this consistently while keeping the default semantics the same as before (when --prof-mod=module is specified and you do indeed import module in the profiled script) – if that's ever a priority.
Another thing is that the scoping_policy only affects classes but not functions, which are unconditionally profiled whether they're found in a namespace.
Lastly, we don't have a policy corresponding to disabling descension into child namespaces altogether. I contemplated adding one for the longest time (if only for completeness), but I'm not sure what to name it. Perhaps ScopingPolicy.NONE should be renamed ScopingPolicy.YES instead (since we always descend), and this new one should be called ScopingPolicy.NO? ... maybe that's why you weren't too on-board with naming it scoping_policy, and indeed there doesn't seem to be an entirely consistent way to name them.

One fix could be that we provide separate scoping-policy switches for classes, modules, and functions. The current way main does it (and I attempted to clumsily emulate with the one policy 'siblings') is basically {'classes': 'yes', 'module': 'no', 'functions': 'yes'} – note that as with the above bullet point 'yes' means "always descend into the object" and 'no' means "always ignore the object". In that case it will be useful for us to first decide here what defaults make the most sense, then we can update the PR to include that.

a way to tell -p to also include all submodules

Recursive descension into submodules can also be finicky in that

Whether a submodule turns up in the namespace of a parent package depends on whether anything has caused its import.
In most use-cases we probably won't want that for all the targets in --prof-mod but just a few limited ones.

To address the first, we can maybe use pkgutil.iter_modules() to force pre-imports of submodules. (Or again I can try to port some import hooks over from pytest_autoprofile.importers.ProfileModulesImporter ... but this PR is already plenty big and I think it'd be better to leave that for a follow-up PR. That, or maybe the importlib-based stuff can supersede the entirety of line_profiler.autoprofile.eager_preimports.)

And for the second we may want to include a separate flag in kernprof from --prof-mod. Maybe --recursive-prof-mod? In pytest_autoprofile I have a --recursive-autoprof flag which allows both for explicitly providing recursive targets (w/args) and implicitly (w/o args) setting all --always-autoprof (basically my --prof-mod equivalent) targets to be recursive.

code.co_qualname

That should be all advantages to the end-user and I'm very much on board with it (in another PR as you've suggested). It will probably break some tests here in line_profiler and in downstream tools using it though... but that should be easy to fix.

TTsangSC · 2025-05-22T20:11:26Z

Addendum: just checked and .co_qualname is 3.11+. I guess we can always check PY_VERSION_HEX and use one or the other accordingly...

TTsangSC · 2025-05-24T00:01:01Z

I'm still working on ironing out the stuff we've discussed in the last three comments; will get back to you in a couple hours.

Erotemic · 2025-05-24T00:01:59Z

I'm going to take a shot at writing a patch that will eager import all submodules when the argument to -p is a package without the .__init__ (in which case it will behave like normal). This is basically going to port code over from mkinit that I use to enumerate a package tree and autogenerate __init__.py files. I think that will offer the best balance between power and flexibility.

EDIT: Well, I may not get to the actual patch to this lib, but I can point to the code that can make it happen. It was in xdoctest, not mkinit, and it was already ported to this package in util_static.package_modpaths. We should be able to pass all paths to that and extend the -p argument list with its results. For non-directory packages, they should just return themselves.

TTsangSC · 2025-05-24T00:16:52Z

Sounds interesting, I'll take a look at that, thanks for the pointers.

Any input on the ScopingPolicy semantics (separate policies for functions, classes, and modules; renaming the policies) though? This is where I'm currently at:

    :py:class:`StrEnum` for scoping policies, that is, how it is                        
    decided whether to:                                                                 
                                                                                        
    * Profile a function found in a namespace (a class or a module), and                
    * Descend into nested namespaces so that their methods and functions                
      are profiled,                                                                     
                                                                                        
    when using :py:meth:`LineProfiler.add_class`,                                       
    :py:meth:`LineProfiler.add_module`, and                                             
    :py:func:`~.add_imported_function_or_module()`.                                     
                                                                                
    Available policies are:                                                     
                                                                                
    :py:attr:`ScopingPolicy.EXACT`                                              
        Only profile *functions* found in the namespace fulfilling              
        :py:attr:`ScopingPolicy.CHILDREN` as defined below, without             
        descending into nested namespaces                                       
                                                                                
    :py:attr:`ScopingPolicy.CHILDREN`                                           
        Only profile/descend into *child* objects, which are:                   
                                                                                
        * Classes and functions defined *locally* in the very                   
          module, or in the very class as its "inner classes" and               
          methods                                                               
        * Direct submodules, in case when the namespace is a module             
          object representing a package                                         
                                                                                
    :py:attr:`ScopingPolicy.DESCENDANTS`                                        
        Only profile/descend into *descendant* objects, which are:              
                                                                                
        * Child classes, functions, and modules, as defined above in            
          :py:attr:`ScopingPolicy.CHILDREN`                                     
        * Their child classes, functions, and modules, ...                      
        * ... and so on                                                         
                                                                                
    :py:attr:`ScopingPolicy.SIBLINGS`                                           
        Only profile/descend into *sibling* and descendant objects,             
        which are:                                                              
                                                                                
        * Descendant classes, functions, and modules, as defined above          
          in :py:attr:`ScopingPolicy.DESCENDANTS`                               
        * Classes and functions (and descendants thereof) defined in the        
          same parent namespace to this very class, or in modules (and          
          subpackages and their descendants) sharing a parent package           
          to this very module                                                   
        * Modules (and subpackages and their descendants) sharing a             
          parent package, when the namespace is a module                        
                                                                                
    :py:attr:`ScopingPolicy.NONE`                                               
        Don't check scopes;  profile all functions found in the local           
        namespace of the class/module, and descend into all nested              
        namespaces recursively                                                  
                                                                                
        Note:                                                                   
            This is probably a very bad idea for module scoping;                
            proceed with care.

so the previous EXACT is renamed to CHILDREN, and EXACT now serves as the "don't descend" option that I proposed before.

Erotemic · 2025-05-24T00:24:59Z

I don't think you need siblings. You could just target the common parent. You always have a tree right? So their can't be more than one parent? Examples or a table like what is shown: https://chatgpt.com/share/6831118e-2f34-8002-b371-6233b0c5132c would be helpful.

TTsangSC · 2025-05-24T00:33:18Z

The point for siblings was, e.g.

We're profiling a.api.
a.api imports stuff (say func) from a._impl, which is considered an implementation detail.
But a.api also imports a ton of random stuff like inspect, sys, and functools.

By setting the scoping policy to 'siblings' and passing -p a.api, we can pin-point func which:

Actually originated from the a package, and
Is made a part of the public API in a.api.

Hence the user don't need to know that where func (and other stuff) comes from, just that it's part of the package and available via a.api. But otherwise you're right in that the other options probably cover most bases, EDIT and maybe the user can just pass -p a.api.func in that case.

line_profiler/autoprofile/eager_preimport.py[i] New module for generating a dummy module where the profiling targets are pre-imported, so that they can be added to the profiler; main functionalities: - `split_dotted_path()`: split a dotted path into a module part and an attribute part - `write_eager_import_module()`: write the text of a module which does all the supplied imports and adds them to the profiler

tests/test_eager_preimports.py create_doctest_wrapper(), regularize_doctests() New functions to create hooks running doctests, even when `--doctest-modules` or `--xdoctest` is not passed test_doctest_*() Hook tests for the `line_profiler.autoprofile.eager_preimports` doctest test_write_eager_import_module_wrong_adder() Test for passing bad `adder` values to `write_eager_import_module()`

kernprof.py __doc__ Updated with the new option main() Added new option `-e`/`--eager-preimports` for eagerly importing the `--prof-mod` targets, so that they are all unconditionally profiled (where possible) regardless of whether they are imported in the test script/module

line_profiler/autoprofile/eager_preimports.py[i] __all__ New module attribute split_dotted_path() Added new argument `static` for choosing whether to use static analysis (`line_profiler.autoprofile.util_static.modname_to_modpath()`) or the import system (`importlib.util.find_spec()`) to resolve dotted paths resolve_profiling_targets() Split from `write_eager_import_module()` for easier testing write_eager_import_module() - Fixed malformed RST in docstring - Added new argument `static` for choosing whether to use static analysis or the import system to: - Resolve dotted paths (see `split_dotted_path()`), and - Find subpackages and -modules (`~.util_static.package_modpaths()` vs `pkgutil.walk_packages()`)

tests/test_eager_preimports.py sample_package() - Renamed from `sample_module()` - Simplified implementation preserve_sys_state(), sample_namespace_package() New fixtures gen_names() New utility function test_{split_dotted_path,resolve_profiling_targets}_staticity() New tests for how the `static` parameter influences the behavior of `line_profiler.autoprofile.eager_preimports.split_dotted_path()` and `.resolve_profiling_targets()`

kernprof.py::main(), _write_preimports() Now choosing whether to use static-only analysis when writing the pre-import module by `line_profiler._diagnostics.STATIC_ANALYSIS` line_profiler/_diagnostics.py::STATIC_ANALYSIS New "dev-mode" variable set by the environment variable `${LINE_PROFILER_STATIC_ANALYSIS}`

line_profiler/line_profiler.py[i] LineProfiler.add_callable() - New argument `name` for referring to the added object in log messages - Now emitting a log message for each of the underlying functions LineProfiler.add_class(), .add_module() Now emitting a log message if any member in the class/module has been added

line_profiler/profiler_mixin.py[i] is_cython_callable() - Now a separate function - Now checking the name of the object's type instead of doing an instance check, so that we retain compatibility with extension modules built with different Cython versions is_c_level_callable() Now calls `is_cython_callable()`, instead of hard-coding `type(line_profiler._line_profiler.label)` into `C_LEVEL_CALLABLE_TYPES`

TTsangSC · 2025-06-16T01:15:38Z

@Erotemic

Just got everything merged:

7b8b7ef– abc6818 (i.e. Eager prof mod tempfile updates TTsangSC/line_profiler#2):
- Tempfiles now preserved until the tracebacks are formatted
- Profiling data from functions defined in tempfiles now scrubbed from the written results so as to avoid the Could not find file error message when showing them with python -m line_profiler
12ae3e2, 6488a9a
Integrating features added in [wip] Debugging TTsangSC/line_profiler#1:
- kernprof now uses (and temporarily overrides) line_profiler._diagnostics.log for writing both normal and debugging outputs
- Debug mode, tempfile preservation, and dry-running in kernprof now toggled by the line_profiler._diagnostics switches
4f81a8c– 7288e1a
- Added a static argument to various line_profiler.autoprof.eager_preimports functions to toggle between target resolution using solely path-based and static analysis or the full import system
- Added corresponding tests
- Added line_profiler._diagnostics.STATIC_ANALYSIS (environmental switch: ${LINE_PROFILER_STATIC_ANALYSIS}) for controlling which is used by kernprof._write_preimports()
7f23e23
Added logging to LineProfiler.add_callable(), .add_module(), and .add_class()
e842d23
Refactored check for Cython callables so as to cover fused-type callables and Cython callables built with different Cython versions

The static switch has been added in response to our last discussion (comment #1, commend #2), so that we now have completely separate code paths for the two, as demonstrated in the second linked comment. However, I have refrained from making kernprof default to static=True (i.e. using package_modpaths() as you've suggested) owing to the following reasons:

Realistically speaking, unlike when we're AST-rewriting modules there is no advantage to only rely on static analysis. Yes, going through the import system may cause modules to be imported (e.g. find_spec('foo.bar.baz') would try to import foo and foo.bar) and alter the state of sys.modules, but since we ARE importing and profiling the targets by executing the written pre-import module anyway it isn't really a drawback.
On the other hand, static analysis cannot (nor can it reasonably be expected to) follow common use-cases as are made possible by the import system. Most notably:
- It can't catch customized module lookup made possible via meta-paths (e.g. distutils which is installed via a hook into setuptools).
- It can't handle namespace packages (as illustrated by the test tests/test_eager_preimports.py:: test_resolve_profiling_targets_staticity()).

So I just ended up adding the line_profiler._diagnostics.STATIC_ANALYSIS toggle, which defaults to false. If we ever wanted to override that and fall back to static analysis, we can always use the envvar switch.

Another thing: should we be documenting the envvar switches, or do we not bother since they aren't meant for end-users (and can end up on the chopping board without notice) anyway?

TTsangSC · 2025-06-29T13:18:11Z

Hi, just checking in. Is there any update? Cheers.

Erotemic · 2025-06-29T19:10:55Z

I've had to handle other tasks with higher priority. I haven't forgotten about these PRs. My task list is getting smaller, but not at the rate I would've liked. Reviewing for NeurIPS has taken a lot of my time over the last month, in addition to other unforeseen responsibilities.

When I come back to this, the main thing I'm going to look for is autoprofiling of the entire module (e.g. when I -p ubelt I will expect that every function inside of ubelt is auto-profiled). It doesn't matter if its done statically or dynamically, but the expansion of all submodules within a package is the important bit. From my brief reading of the comments it looks like you may have that in, if it is, then it will be a fairly easy review mostly about style, efficiency, and safety, but that is the major feature I want to make the next line-profiler release.

Erotemic

I've made a PR that I think addresses a lot of concerns I raised here. Take a look and let me know what you think.

TTsangSC#3

Erotemic · 2025-07-05T18:44:12Z

line_profiler/profiler_mixin.py

+        return cls._get_underlying_functions(func)
+
+    @classmethod
+    def _get_underlying_functions(cls, func, seen=None, stop_at_classes=False):


This function is quite complex, and difficult to reason about. I'm not a huge fan of the check function being defined as a comprehension variable and a variable in the main scope. I suppose lack of comments is also making it hard to get a handle on.

I recently learned about a newish complexity measure called "cognitive complexity", which addresses some issues with cyclomatic complexity. There is a tool complexipy that can score a function, and this gets a complexity rating of 35, which is very large. There's just way too much going on here. I threw it at ChatGPT, and it broke it down into 3 functions: _traverse_class, _get_underlying_functions and _unwrap_function. Not sure if that is the best way to break it down, but I think we do want to clean this up a bit before merging.

check function being defined as a comprehension variable

Not entirely sure about this... do you mean the

if any(check(func) for check in (...)): ...

part? In that case I'm not sure if separating the checks into their own if blocks would enhance readability all that much...

Anyway yeah we should probably break this up into smaller functions/methods.

Erotemic · 2025-07-05T18:59:26Z

kernprof.py

+            printer(*args, **kwargs)
+            logger.debug(sio.getvalue())
+
+    def print_code_block_diagnostics(


I think adding full code output of what we are profiling might be overkill. I also prefer to avoid local scope functions whenever possible. In addition to adding a lot of complexity, they push design away from simplicity. Then, later on these are assigned as attribute of the options namespace, so they leak out of the local scope. It can start to be a nightmare for someone else reading the code.

I think we can just get rid of any code_diagnostics calls.

Erotemic · 2025-07-05T19:01:36Z

kernprof.py

-    options = real_parser.parse_args(args)
+    options = SimpleNamespace(**vars(real_parser.parse_args(args)))
+    # TODO: make flags later where appropriate
+    options.dryrun = diagnostics.NO_EXEC


I'd rather avoid setting attributes on a namespace, again in the name of complexity reduction.

Although after my PR, I think achieving this here is out of scope.

Erotemic · 2025-07-05T19:07:00Z

kernprof.py

    else:
-        return _main(options, module)
+        options.message = logger.info
+    options.diagnostics = print_diagnostics


I really don't like giving the namespace an attribute with the same name as something different in the global namespace. Yes, it is technically distinct, but it makes searching for the name confusing as it means two things.

Erotemic · 2025-07-05T19:48:08Z

kernprof.py

+    temp_mod_path = _touch_tempfile(dir=options.tmpdir,
+                                    prefix='kernprof-eager-preimports-',
+                                    suffix='.py')
+    write_module = functools.partial(


I try to avoid functools.partial when it make sense. Often I find just making a kwargs dictionary and calling the function explicitly with it is nearly as concise, but you also have the benefit of the function being called directly without any indirection. Its much easier for static analysis tools to reason about the code when you don't have to consider the function itself as a variable.

Erotemic · 2025-07-05T19:56:32Z

kernprof.py

+            return
+        return func(*args, **kwargs)
+
+    def dump_filtered_stats(prof, filename):


Can likely move this out of the local scope and just add the tmpdir as an argument.

Erotemic · 2025-07-05T20:59:10Z

I've been able to work through items on my priority list, and I can give this some time.

The first thing I checked is if my webdataset / wids use case was showing me all of the functions being hit, and it is, so I'm very happy here. I think other users of the tool will be as well.

Another thing: should we be documenting the envvar switches, or do we not bother since they aren't meant for end-users (and can end up on the chopping board without notice) anyway?

We should consider environs not part of the stable API. I see no need to document them outside of docstrings in the file they are defined in, and even inside their file, they can remain undocumented.

I'd like to cleanup the code just a little bit more. I've made these changes here: TTsangSC#3

Please take a look. I think just one or two more iterations before we merge this, and it's all cleanup. The feature set I want from this PR looks good and complete.

Review suggestions

Erotemic · 2025-07-06T22:21:25Z

Woo, finally made it. Check to see if any other PRs have conflicts and I'll start looking through those.

TTsangSC · 2025-07-06T22:38:55Z

Thanks. Oof, there are some quick cleanup changes that I intended to push before merging; will write another PR for those.

Since we touched a lot in this PR there's probably a lot of rebasing and resolving to be done in the others. Of course I'll look at all of them over the week but please do tell me if you want me to prioritize any of them and/or if any particular ones catch your interest.

FIX: debug-mode and logging bugs in #337

TTsangSC force-pushed the eager-prof-mod branch from 65b1f46 to c33d62b Compare April 27, 2025 01:26

TTsangSC mentioned this pull request Apr 29, 2025

FIX: kernprof -m: presence of the executed module as sys.modules['__main__'] #339

Merged

TTsangSC force-pushed the eager-prof-mod branch from 49ad5cb to ea44266 Compare May 5, 2025 14:18

TTsangSC force-pushed the eager-prof-mod branch from ea44266 to 17dd5b2 Compare May 17, 2025 17:32

TTsangSC force-pushed the eager-prof-mod branch from 17dd5b2 to 8a649af Compare May 18, 2025 05:06

Erotemic requested changes May 18, 2025

View reviewed changes

TTsangSC force-pushed the eager-prof-mod branch from ca7f552 to 0e56350 Compare May 20, 2025 20:54

Erotemic mentioned this pull request May 23, 2025

Use co_qualname in reporting when possible #345

Merged

TTsangSC mentioned this pull request May 23, 2025

ENH: use multiple profiler instances #347

Merged

TTsangSC added 3 commits May 24, 2025 06:50

TTsangSC added 5 commits June 15, 2025 09:57

TTsangSC mentioned this pull request Jun 19, 2025

FIX: restore Cython compatibility + pivot to sys.monitoring #352

Merged

Erotemic added 9 commits July 5, 2025 15:00

remove code diagnostics

447ab0a

remove code diagnostics

d942680

Simplified logger usage

1b88d97

refactor functools.partial in write_module

6763031

Refactor dump_filtered_stats

e059764

Simplify _write_preimports

161ddf8

Move _call_with_diagnostics to global scope

cad8998

Refactor main to reduce complexity

1fdd0da

Breakup _profile_main into smaller parts to reduce complexity

4a420db

Erotemic requested changes Jul 5, 2025

View reviewed changes

TTsangSC mentioned this pull request Jul 6, 2025

Review suggestions TTsangSC/line_profiler#3

Merged

Merge pull request #3 from Erotemic/review-suggestions

0b8cc7e

Review suggestions

Erotemic merged commit d0d117d into pyutils:main Jul 6, 2025
36 checks passed

TTsangSC deleted the eager-prof-mod branch July 6, 2025 22:39

This was referenced Jul 6, 2025

Post-337 patches: docs, fixes, minor refactoring #353

Merged

ENH: read TOML files for configurations #335

Open

Erotemic added a commit that referenced this pull request Jul 7, 2025

Merge pull request #354 from TTsangSC/logging-fixes

cd09484

FIX: debug-mode and logging bugs in #337

TTsangSC mentioned this pull request Jul 8, 2025

FIX: Update hash tables of affected profiler instances when rewriting code objects #351

Merged

		@@ -0,0 +1,190 @@
		"""
		Tests for `line_profiler.autoprofile.eager_preimports`.

ENH: more intuitive profiling-target selection #337

ENH: more intuitive profiling-target selection #337

Uh oh!

Conversation

TTsangSC commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Synopsis

Code changes

Doc changes

Test-suite changes

Conflicts

Acknowledgements

Uh oh!

codecov bot commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Erotemic commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TTsangSC commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Erotemic commented May 18, 2025

Uh oh!

TTsangSC commented May 18, 2025

Uh oh!

TTsangSC commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

Erotemic commented May 18, 2025

Uh oh!

TTsangSC commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Erotemic left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TTsangSC May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TTsangSC May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

TTsangSC commented Apr 21, 2025 •

edited

Loading

codecov bot commented Apr 21, 2025 •

edited

Loading

Erotemic commented Apr 29, 2025 •

edited

Loading

TTsangSC commented Apr 29, 2025 •

edited

Loading

TTsangSC commented May 18, 2025 •

edited

Loading

TTsangSC commented May 18, 2025 •

edited

Loading

TTsangSC May 18, 2025 •

edited

Loading

TTsangSC May 18, 2025 •

edited

Loading

TTsangSC commented May 19, 2025 •

edited

Loading

Erotemic commented May 19, 2025 •

edited

Loading

Erotemic commented May 22, 2025 •

edited

Loading

Erotemic commented May 24, 2025 •

edited

Loading

TTsangSC commented May 24, 2025 •

edited

Loading