Skip to content

Latest commit

 

History

History
4550 lines (4160 loc) · 82.2 KB

bm-20240615-azure-x86_64-python-a19bb261a327e1008f21-3.13.0b2+-a19bb26-pystats-pickle.md

File metadata and controls

4550 lines (4160 loc) · 82.2 KB

Pystats results

  • benchmark: pickle
  • fork: python
  • ref: a19bb261a327e1008f219b62f6465941c981899b
  • commit hash: a19bb26
  • commit date: 2024-06-15T19:10:50+00:00

Execution counts

Execution counts for Tier 1 instructions.

The "miss ratio" column shows the percentage of times the instruction executed that it deoptimized. When this happens, the base unspecialized instruction is not counted.

Name Count Self Cumulative Miss ratio
POP_TOP 53,420 34.8% 34.8%
ENTER_EXECUTOR 42,180 27.5% 62.2%
LOAD_FAST 13,360 8.7% 70.9%
PUSH_NULL 12,800 8.3% 79.3%
LOAD_FAST_LOAD_FAST 12,400 8.1% 87.3%
CALL_BUILTIN_FAST_WITH_KEYWORDS 12,000 7.8% 95.2%
STORE_FAST 1,420 0.9% 96.1%
CALL 960 0.6% 96.7%
FOR_ITER_TUPLE 700 0.5% 97.2%
JUMP_BACKWARD 680 0.4% 97.6%
GET_ITER 480 0.3% 97.9%
FOR_ITER_RANGE 380 0.2% 98.2%
LOAD_GLOBAL_MODULE 360 0.2% 98.4%
LOAD_GLOBAL 280 0.2% 98.6%
LOAD_DEREF 240 0.2% 98.7%
LOAD_ATTR_MODULE 240 0.2% 98.9%
LOAD_ATTR 200 0.1% 99.0%
CALL_NON_PY_GENERAL 180 0.1% 99.1%
RETURN_VALUE 160 0.1% 99.2%
CALL_FUNCTION_EX 160 0.1% 99.3%
RESUME_CHECK 120 0.1% 99.4%
NOP 80 0.1% 99.5%
BUILD_LIST 80 0.1% 99.5%
BUILD_TUPLE 80 0.1% 99.6%
CALL_INTRINSIC_1 80 0.1% 99.6%
COPY_FREE_VARS 80 0.1% 99.7%
FOR_ITER 80 0.1% 99.7%
LIST_EXTEND 80 0.1% 99.8%
BINARY_OP_SUBTRACT_FLOAT 60 0.0% 99.8%
CALL_BUILTIN_CLASS 60 0.0% 99.9%
LOAD_ATTR_WITH_HINT 60 0.0% 99.9%
LOAD_GLOBAL_BUILTIN 60 0.0% 99.9%
BINARY_OP 40 0.0% 100.0%
RESUME 40 0.0% 100.0%

Pair counts

Pair counts for top 100 opcode pairs

Pairs of specialized operations that deoptimize and are then followed by the corresponding unspecialized instruction are not counted as pairs.

Pair Count Self Cumulative
ENTER_EXECUTOR POP_TOP 40,920 26.6% 26.6%
POP_TOP ENTER_EXECUTOR 40,900 26.6% 53.3%
PUSH_NULL LOAD_FAST_LOAD_FAST 12,400 8.1% 61.3%
LOAD_FAST PUSH_NULL 12,400 8.1% 69.4%
CALL_BUILTIN_FAST_WITH_KEYWORDS POP_TOP 12,000 7.8% 77.2%
POP_TOP LOAD_FAST 11,780 7.7% 84.9%
LOAD_FAST_LOAD_FAST CALL_BUILTIN_FAST_WITH_KEYWORDS 11,600 7.6% 92.4%
ENTER_EXECUTOR ENTER_EXECUTOR 1,260 0.8% 93.3%
STORE_FAST LOAD_FAST 1,260 0.8% 94.1%
LOAD_FAST_LOAD_FAST CALL 800 0.5% 94.6%
FOR_ITER_TUPLE STORE_FAST 600 0.4% 95.0%
POP_TOP JUMP_BACKWARD 580 0.4% 95.4%
LOAD_FAST GET_ITER 480 0.3% 95.7%
CALL POP_TOP 420 0.3% 96.0%
CALL CALL_BUILTIN_FAST_WITH_KEYWORDS 400 0.3% 96.2%
GET_ITER FOR_ITER_TUPLE 380 0.2% 96.5%
FOR_ITER_RANGE STORE_FAST 380 0.2% 96.7%
JUMP_BACKWARD FOR_ITER_RANGE 300 0.2% 96.9%
JUMP_BACKWARD FOR_ITER_TUPLE 300 0.2% 97.1%
LOAD_ATTR_MODULE PUSH_NULL 180 0.1% 97.2%
PUSH_NULL LOAD_FAST 160 0.1% 97.3%
LOAD_DEREF PUSH_NULL 160 0.1% 97.4%
PUSH_NULL CALL 120 0.1% 97.5%
PUSH_NULL CALL_NON_PY_GENERAL 120 0.1% 97.6%
LOAD_GLOBAL LOAD_GLOBAL_MODULE 120 0.1% 97.7%
LOAD_GLOBAL_MODULE LOAD_ATTR_MODULE 120 0.1% 97.7%
FOR_ITER_TUPLE JUMP_BACKWARD 100 0.1% 97.8%
NOP LOAD_DEREF 80 0.1% 97.9%
POP_TOP NOP 80 0.1% 97.9%
RETURN_VALUE RETURN_VALUE 80 0.1% 98.0%
BUILD_LIST LOAD_DEREF 80 0.1% 98.0%
BUILD_TUPLE STORE_FAST 80 0.1% 98.1%
CALL_FUNCTION_EX COPY_FREE_VARS 80 0.1% 98.1%
CALL_INTRINSIC_1 CALL_FUNCTION_EX 80 0.1% 98.2%
LIST_EXTEND CALL_INTRINSIC_1 80 0.1% 98.2%
LOAD_ATTR LOAD_ATTR_MODULE 80 0.1% 98.3%
LOAD_DEREF LIST_EXTEND 80 0.1% 98.3%
LOAD_FAST BUILD_LIST 80 0.1% 98.4%
LOAD_FAST CALL_FUNCTION_EX 80 0.1% 98.4%
LOAD_FAST LOAD_ATTR 80 0.1% 98.5%
STORE_FAST LOAD_GLOBAL 80 0.1% 98.5%
STORE_FAST LOAD_GLOBAL_MODULE 80 0.1% 98.6%
LOAD_GLOBAL_MODULE LOAD_GLOBAL_MODULE 80 0.1% 98.6%
GET_ITER FOR_ITER_RANGE 60 0.0% 98.7%
CALL CALL_NON_PY_GENERAL 60 0.0% 98.7%
CALL_FUNCTION_EX RESUME_CHECK 60 0.0% 98.8%
COPY_FREE_VARS RESUME_CHECK 60 0.0% 98.8%
LOAD_ATTR PUSH_NULL 60 0.0% 98.8%
LOAD_GLOBAL LOAD_ATTR 60 0.0% 98.9%
BINARY_OP_SUBTRACT_FLOAT RETURN_VALUE 60 0.0% 98.9%
CALL_BUILTIN_CLASS STORE_FAST 60 0.0% 99.0%
CALL_NON_PY_GENERAL POP_TOP 60 0.0% 99.0%
CALL_NON_PY_GENERAL LOAD_FAST 60 0.0% 99.0%
CALL_NON_PY_GENERAL STORE_FAST 60 0.0% 99.1%
LOAD_ATTR_MODULE STORE_FAST 60 0.0% 99.1%
LOAD_ATTR_WITH_HINT STORE_FAST 60 0.0% 99.2%
LOAD_GLOBAL_BUILTIN LOAD_FAST 60 0.0% 99.2%
LOAD_GLOBAL_MODULE BUILD_TUPLE 60 0.0% 99.2%
LOAD_GLOBAL_MODULE LOAD_ATTR 60 0.0% 99.3%
RESUME_CHECK LOAD_DEREF 60 0.0% 99.3%
GET_ITER FOR_ITER 40 0.0% 99.3%
POP_TOP LOAD_GLOBAL 40 0.0% 99.4%
POP_TOP LOAD_GLOBAL_MODULE 40 0.0% 99.4%
RETURN_VALUE LOAD_GLOBAL 40 0.0% 99.4%
RETURN_VALUE LOAD_GLOBAL_MODULE 40 0.0% 99.4%
CALL STORE_FAST 40 0.0% 99.5%
FOR_ITER STORE_FAST 40 0.0% 99.5%
JUMP_BACKWARD FOR_ITER 40 0.0% 99.5%
LOAD_ATTR STORE_FAST 40 0.0% 99.5%
LOAD_FAST BINARY_OP 40 0.0% 99.6%
LOAD_FAST CALL 40 0.0% 99.6%
LOAD_FAST BINARY_OP_SUBTRACT_FLOAT 40 0.0% 99.6%
LOAD_FAST CALL_BUILTIN_CLASS 40 0.0% 99.6%
LOAD_FAST LOAD_ATTR_MODULE 40 0.0% 99.7%
LOAD_FAST LOAD_ATTR_WITH_HINT 40 0.0% 99.7%
LOAD_GLOBAL LOAD_GLOBAL 40 0.0% 99.7%
LOAD_GLOBAL_MODULE LOAD_GLOBAL 40 0.0% 99.8%
RESUME_CHECK LOAD_GLOBAL_BUILTIN 40 0.0% 99.8%
BINARY_OP RETURN_VALUE 20 0.0% 99.8%
BINARY_OP BINARY_OP_SUBTRACT_FLOAT 20 0.0% 99.8%
CALL LOAD_FAST 20 0.0% 99.8%
CALL CALL_BUILTIN_CLASS 20 0.0% 99.8%
CALL_FUNCTION_EX RESUME 20 0.0% 99.8%
COPY_FREE_VARS RESUME 20 0.0% 99.9%
FOR_ITER FOR_ITER_RANGE 20 0.0% 99.9%
FOR_ITER FOR_ITER_TUPLE 20 0.0% 99.9%
JUMP_BACKWARD POP_TOP 20 0.0% 99.9%
JUMP_BACKWARD ENTER_EXECUTOR 20 0.0% 99.9%
LOAD_ATTR LOAD_ATTR_WITH_HINT 20 0.0% 99.9%
LOAD_GLOBAL BUILD_TUPLE 20 0.0% 99.9%
LOAD_GLOBAL LOAD_FAST 20 0.0% 99.9%
LOAD_GLOBAL LOAD_GLOBAL_BUILTIN 20 0.0% 100.0%
RESUME LOAD_DEREF 20 0.0% 100.0%
RESUME LOAD_GLOBAL 20 0.0% 100.0%
RESUME_CHECK LOAD_GLOBAL 20 0.0% 100.0%

Predecessor/Successor Pairs

Top 5 predecessors and successors of each Tier 1 opcode.

This does not include the unspecialized instructions that occur after a specialized instruction deoptimizes.

GET_ITER

Successors and predecessors for GET_ITER
Predecessors Count Percentage
LOAD_FAST 480 100.0%
Successors Count Percentage
FOR_ITER_TUPLE 380 79.2%
FOR_ITER_RANGE 60 12.5%
FOR_ITER 40 8.3%

NOP

Successors and predecessors for NOP
Predecessors Count Percentage
POP_TOP 80 100.0%
Successors Count Percentage
LOAD_DEREF 80 100.0%

POP_TOP

Successors and predecessors for POP_TOP
Predecessors Count Percentage
ENTER_EXECUTOR 40,920 76.6%
CALL_BUILTIN_FAST_WITH_KEYWORDS 12,000 22.5%
CALL 420 0.8%
CALL_NON_PY_GENERAL 60 0.1%
JUMP_BACKWARD 20 0.0%
Successors Count Percentage
ENTER_EXECUTOR 40,900 76.6%
LOAD_FAST 11,780 22.1%
JUMP_BACKWARD 580 1.1%
NOP 80 0.1%
LOAD_GLOBAL 40 0.1%

PUSH_NULL

Successors and predecessors for PUSH_NULL
Predecessors Count Percentage
LOAD_FAST 12,400 96.9%
LOAD_ATTR_MODULE 180 1.4%
LOAD_DEREF 160 1.2%
LOAD_ATTR 60 0.5%
Successors Count Percentage
LOAD_FAST_LOAD_FAST 12,400 96.9%
LOAD_FAST 160 1.2%
CALL 120 0.9%
CALL_NON_PY_GENERAL 120 0.9%

RETURN_VALUE

Successors and predecessors for RETURN_VALUE
Predecessors Count Percentage
RETURN_VALUE 80 50.0%
BINARY_OP_SUBTRACT_FLOAT 60 37.5%
BINARY_OP 20 12.5%
Successors Count Percentage
RETURN_VALUE 80 50.0%
LOAD_GLOBAL 40 25.0%
LOAD_GLOBAL_MODULE 40 25.0%

BINARY_OP

Successors and predecessors for BINARY_OP
Predecessors Count Percentage
LOAD_FAST 40 100.0%
Successors Count Percentage
RETURN_VALUE 20 50.0%
BINARY_OP_SUBTRACT_FLOAT 20 50.0%

BUILD_LIST

Successors and predecessors for BUILD_LIST
Predecessors Count Percentage
LOAD_FAST 80 100.0%
Successors Count Percentage
LOAD_DEREF 80 100.0%

BUILD_TUPLE

Successors and predecessors for BUILD_TUPLE
Predecessors Count Percentage
LOAD_GLOBAL_MODULE 60 75.0%
LOAD_GLOBAL 20 25.0%
Successors Count Percentage
STORE_FAST 80 100.0%

CALL

Successors and predecessors for CALL
Predecessors Count Percentage
LOAD_FAST_LOAD_FAST 800 83.3%
PUSH_NULL 120 12.5%
LOAD_FAST 40 4.2%
Successors Count Percentage
POP_TOP 420 43.8%
CALL_BUILTIN_FAST_WITH_KEYWORDS 400 41.7%
CALL_NON_PY_GENERAL 60 6.2%
STORE_FAST 40 4.2%
LOAD_FAST 20 2.1%

CALL_FUNCTION_EX

Successors and predecessors for CALL_FUNCTION_EX
Predecessors Count Percentage
CALL_INTRINSIC_1 80 50.0%
LOAD_FAST 80 50.0%
Successors Count Percentage
COPY_FREE_VARS 80 50.0%
RESUME_CHECK 60 37.5%
RESUME 20 12.5%

CALL_INTRINSIC_1

Successors and predecessors for CALL_INTRINSIC_1
Predecessors Count Percentage
LIST_EXTEND 80 100.0%
Successors Count Percentage
CALL_FUNCTION_EX 80 100.0%

COPY_FREE_VARS

Successors and predecessors for COPY_FREE_VARS
Predecessors Count Percentage
CALL_FUNCTION_EX 80 100.0%
Successors Count Percentage
RESUME_CHECK 60 75.0%
RESUME 20 25.0%

ENTER_EXECUTOR

Successors and predecessors for ENTER_EXECUTOR
Predecessors Count Percentage
POP_TOP 40,900 97.0%
ENTER_EXECUTOR 1,260 3.0%
JUMP_BACKWARD 20 0.0%
Successors Count Percentage
POP_TOP 40,920 97.0%
ENTER_EXECUTOR 1,260 3.0%

FOR_ITER

Successors and predecessors for FOR_ITER
Predecessors Count Percentage
GET_ITER 40 50.0%
JUMP_BACKWARD 40 50.0%
Successors Count Percentage
STORE_FAST 40 50.0%
FOR_ITER_RANGE 20 25.0%
FOR_ITER_TUPLE 20 25.0%

JUMP_BACKWARD

Successors and predecessors for JUMP_BACKWARD
Predecessors Count Percentage
POP_TOP 580 85.3%
FOR_ITER_TUPLE 100 14.7%
Successors Count Percentage
FOR_ITER_RANGE 300 44.1%
FOR_ITER_TUPLE 300 44.1%
FOR_ITER 40 5.9%
POP_TOP 20 2.9%
ENTER_EXECUTOR 20 2.9%

LIST_EXTEND

Successors and predecessors for LIST_EXTEND
Predecessors Count Percentage
LOAD_DEREF 80 100.0%
Successors Count Percentage
CALL_INTRINSIC_1 80 100.0%

LOAD_ATTR

Successors and predecessors for LOAD_ATTR
Predecessors Count Percentage
LOAD_FAST 80 40.0%
LOAD_GLOBAL 60 30.0%
LOAD_GLOBAL_MODULE 60 30.0%
Successors Count Percentage
LOAD_ATTR_MODULE 80 40.0%
PUSH_NULL 60 30.0%
STORE_FAST 40 20.0%
LOAD_ATTR_WITH_HINT 20 10.0%

LOAD_DEREF

Successors and predecessors for LOAD_DEREF
Predecessors Count Percentage
NOP 80 33.3%
BUILD_LIST 80 33.3%
RESUME_CHECK 60 25.0%
RESUME 20 8.3%
Successors Count Percentage
PUSH_NULL 160 66.7%
LIST_EXTEND 80 33.3%

LOAD_FAST

Successors and predecessors for LOAD_FAST
Predecessors Count Percentage
POP_TOP 11,780 88.2%
STORE_FAST 1,260 9.4%
PUSH_NULL 160 1.2%
CALL_NON_PY_GENERAL 60 0.4%
LOAD_GLOBAL_BUILTIN 60 0.4%
Successors Count Percentage
PUSH_NULL 12,400 92.8%
GET_ITER 480 3.6%
BUILD_LIST 80 0.6%
CALL_FUNCTION_EX 80 0.6%
LOAD_ATTR 80 0.6%

LOAD_FAST_LOAD_FAST

Successors and predecessors for LOAD_FAST_LOAD_FAST
Predecessors Count Percentage
PUSH_NULL 12,400 100.0%
Successors Count Percentage
CALL_BUILTIN_FAST_WITH_KEYWORDS 11,600 93.5%
CALL 800 6.5%

LOAD_GLOBAL

Successors and predecessors for LOAD_GLOBAL
Predecessors Count Percentage
STORE_FAST 80 28.6%
POP_TOP 40 14.3%
RETURN_VALUE 40 14.3%
LOAD_GLOBAL 40 14.3%
LOAD_GLOBAL_MODULE 40 14.3%
Successors Count Percentage
LOAD_GLOBAL_MODULE 120 42.9%
LOAD_ATTR 60 21.4%
LOAD_GLOBAL 40 14.3%
BUILD_TUPLE 20 7.1%
LOAD_FAST 20 7.1%

STORE_FAST

Successors and predecessors for STORE_FAST
Predecessors Count Percentage
FOR_ITER_TUPLE 600 42.3%
FOR_ITER_RANGE 380 26.8%
BUILD_TUPLE 80 5.6%
CALL_BUILTIN_CLASS 60 4.2%
CALL_NON_PY_GENERAL 60 4.2%
Successors Count Percentage
LOAD_FAST 1,260 88.7%
LOAD_GLOBAL 80 5.6%
LOAD_GLOBAL_MODULE 80 5.6%

RESUME

Successors and predecessors for RESUME
Predecessors Count Percentage
CALL_FUNCTION_EX 20 50.0%
COPY_FREE_VARS 20 50.0%
Successors Count Percentage
LOAD_DEREF 20 50.0%
LOAD_GLOBAL 20 50.0%

BINARY_OP_SUBTRACT_FLOAT

Successors and predecessors for BINARY_OP_SUBTRACT_FLOAT
Predecessors Count Percentage
LOAD_FAST 40 66.7%
BINARY_OP 20 33.3%
Successors Count Percentage
RETURN_VALUE 60 100.0%

CALL_BUILTIN_CLASS

Successors and predecessors for CALL_BUILTIN_CLASS
Predecessors Count Percentage
LOAD_FAST 40 66.7%
CALL 20 33.3%
Successors Count Percentage
STORE_FAST 60 100.0%

CALL_BUILTIN_FAST_WITH_KEYWORDS

Successors and predecessors for CALL_BUILTIN_FAST_WITH_KEYWORDS
Predecessors Count Percentage
LOAD_FAST_LOAD_FAST 11,600 96.7%
CALL 400 3.3%
Successors Count Percentage
POP_TOP 12,000 100.0%

CALL_NON_PY_GENERAL

Successors and predecessors for CALL_NON_PY_GENERAL
Predecessors Count Percentage
PUSH_NULL 120 66.7%
CALL 60 33.3%
Successors Count Percentage
POP_TOP 60 33.3%
LOAD_FAST 60 33.3%
STORE_FAST 60 33.3%

FOR_ITER_RANGE

Successors and predecessors for FOR_ITER_RANGE
Predecessors Count Percentage
JUMP_BACKWARD 300 78.9%
GET_ITER 60 15.8%
FOR_ITER 20 5.3%
Successors Count Percentage
STORE_FAST 380 100.0%

FOR_ITER_TUPLE

Successors and predecessors for FOR_ITER_TUPLE
Predecessors Count Percentage
GET_ITER 380 54.3%
JUMP_BACKWARD 300 42.9%
FOR_ITER 20 2.9%
Successors Count Percentage
STORE_FAST 600 85.7%
JUMP_BACKWARD 100 14.3%

LOAD_ATTR_MODULE

Successors and predecessors for LOAD_ATTR_MODULE
Predecessors Count Percentage
LOAD_GLOBAL_MODULE 120 50.0%
LOAD_ATTR 80 33.3%
LOAD_FAST 40 16.7%
Successors Count Percentage
PUSH_NULL 180 75.0%
STORE_FAST 60 25.0%

LOAD_ATTR_WITH_HINT

Successors and predecessors for LOAD_ATTR_WITH_HINT
Predecessors Count Percentage
LOAD_FAST 40 66.7%
LOAD_ATTR 20 33.3%
Successors Count Percentage
STORE_FAST 60 100.0%

LOAD_GLOBAL_BUILTIN

Successors and predecessors for LOAD_GLOBAL_BUILTIN
Predecessors Count Percentage
RESUME_CHECK 40 66.7%
LOAD_GLOBAL 20 33.3%
Successors Count Percentage
LOAD_FAST 60 100.0%

LOAD_GLOBAL_MODULE

Successors and predecessors for LOAD_GLOBAL_MODULE
Predecessors Count Percentage
LOAD_GLOBAL 120 33.3%
STORE_FAST 80 22.2%
LOAD_GLOBAL_MODULE 80 22.2%
POP_TOP 40 11.1%
RETURN_VALUE 40 11.1%
Successors Count Percentage
LOAD_ATTR_MODULE 120 33.3%
LOAD_GLOBAL_MODULE 80 22.2%
BUILD_TUPLE 60 16.7%
LOAD_ATTR 60 16.7%
LOAD_GLOBAL 40 11.1%

RESUME_CHECK

Successors and predecessors for RESUME_CHECK
Predecessors Count Percentage
CALL_FUNCTION_EX 60 50.0%
COPY_FREE_VARS 60 50.0%
Successors Count Percentage
LOAD_DEREF 60 50.0%
LOAD_GLOBAL_BUILTIN 40 33.3%
LOAD_GLOBAL 20 16.7%

Specialization stats

Specialization stats by family

BINARY_OP

specialization stats for BINARY_OP family
Kind Count Ratio
deferred

Lists the number of "deferred" (i.e. not specialized) instructions executed.

20 20.0%
hit

Specialized instructions that complete.

60 60.0%
Success Count Ratio
Success 20 100.0%
Failure 0 0.0%

CALL

specialization stats for CALL family
Kind Count Ratio
deferred

Lists the number of "deferred" (i.e. not specialized) instructions executed.

480 3.7%
hit

Specialized instructions that complete.

12,060 92.6%
Success Count Ratio
Success 480 100.0%
Failure 0 0.0%

FOR_ITER

specialization stats for FOR_ITER family
Kind Count Ratio
deferred

Lists the number of "deferred" (i.e. not specialized) instructions executed.

40 3.4%
hit

Specialized instructions that complete.

1,080 93.1%
Success Count Ratio
Success 40 100.0%
Failure 0 0.0%

LOAD_ATTR

specialization stats for LOAD_ATTR family
Kind Count Ratio
deferred

Lists the number of "deferred" (i.e. not specialized) instructions executed.

100 20.0%
hit

Specialized instructions that complete.

300 60.0%
Success Count Ratio
Success 100 100.0%
Failure 0 0.0%

LOAD_GLOBAL

specialization stats for LOAD_GLOBAL family
Kind Count Ratio
deferred

Lists the number of "deferred" (i.e. not specialized) instructions executed.

140 20.0%
hit

Specialized instructions that complete.

420 60.0%
Success Count Ratio
Success 140 100.0%
Failure 0 0.0%

Specialization effectiveness

specialization effectiveness

All entries are execution counts. Should add up to the total number of Tier 1 instructions executed.

Instructions Count Ratio
Basic

Instructions that are not and cannot be specialized, e.g. LOAD_FAST.

137,820 89.7%
Not specialized

Instructions that could be specialized but aren't, e.g. LOAD_ATTR, BINARY_SLICE.

1,560 1.0%
Specialized hits

Specialized instructions, e.g. LOAD_ATTR_MODULE that complete.

14,220 9.3%
Specialized misses

Specialized instructions, e.g. LOAD_ATTR_MODULE that deopt.

0 0.0%

Deferred by instruction

Breakdown of deferred (not specialized) instruction counts by family
Name Count Ratio
CALL 480 61.5%
LOAD_GLOBAL 140 17.9%
LOAD_ATTR 100 12.8%
FOR_ITER 40 5.1%
BINARY_OP 20 2.6%
BINARY_SLICE 0 0.0%
STORE_SLICE 0 0.0%
BINARY_SUBSCR 0 0.0%
GET_ITER 0 0.0%
NOP 0 0.0%

Misses by instruction

Breakdown of misses (specialized deopts) instruction counts by family

Call stats

Inlined calls and frame stats

This shows what fraction of calls to Python functions are inlined (i.e. not having a call at the C level) and for those that are not, where the call comes from. The various categories overlap.

Also includes the count of frame objects created.

Count Ratio
Calls to PyEval_EvalDefault 0 0.0%
Calls to Python functions inlined 160 100.0%
Calls via PyEval_EvalFrame (total) 0 0.0%
Calls via PyEval_EvalFrame (vector) 0 0.0%
Calls via PyEval_EvalFrame (generator) 0 0.0%
Calls via PyEval_EvalFrame (legacy) 0 0.0%
Calls via PyEval_EvalFrame (function vectorcall) 0 0.0%
Calls via PyEval_EvalFrame (build class) 0 0.0%
Calls via PyEval_EvalFrame (slot) 0 0.0%
Calls via PyEval_EvalFrame (function ex) 160 100.0%
Calls via PyEval_EvalFrame (api) 0 0.0%
Calls via PyEval_EvalFrame (method) 0 0.0%
Frame objects created 0 0.0%
Frames pushed 160 100.0%

Object stats

Allocations, frees and dict materializatons

Below, "allocations" means "allocations that are not from a freelist". Total allocations = "Allocations from freelist" + "Allocations".

"Inline values" is the number of values arrays inlined into objects.

The cache hit/miss numbers are for the MRO cache, split into dunder and other names.

Count Ratio
Allocations from freelist 4,096,400 15.1%
Frees to freelist 4,096,340
Allocations 22,999,340 84.9%
Allocations to 512 bytes 16,445,700 60.7%
Allocations to 4 kbytes 3,276,800 12.1%
Allocations over 4 kbytes 3,276,840 12.1%
Frees 22,997,255
Inline values 0
Interpreter increfs 5,162,620 2.2%
Interpreter decrefs 7,682,100 3.0%
Increfs 232,079,793 97.8%
Decrefs 246,823,411 97.0%
Materialize dict (on request) 0
Materialize dict (new key) 0
Materialize dict (too big) 0
Materialize dict (str subclass) 0
Method cache hits 5,530,188
Method cache misses 1,023,532
Method cache collisions 1,176,566
Method cache dunder hits 7,219,558
Method cache dunder misses 153,242

GC stats

GC collections and effectiveness

Collected/visits gives some measure of efficiency.

Generation Collections Objects collected Object visits
0 0 0 0
1 0 0 0
2 0 0 0

Optimization (Tier 2) stats

statistics about the Tier 2 optimizer
Count Ratio
Optimization attempts

The number of times a potential trace is identified. Specifically, this occurs in the JUMP BACKWARD instruction when the counter reaches a threshold.

140
Traces created

The number of traces that were successfully created.

40 28.6%
Trace stack overflow

A trace is truncated because it would require more than 5 stack frames.

0 0.0%
Trace stack underflow

A potential trace is abandoned because it pops more frames than it pushes.

0 0.0%
Trace too long

A trace is truncated because it is longer than the instruction buffer.

0 0.0%
Trace too short

A potential trace is abandoced because it it too short.

100 71.4%
Inner loop found

A trace is truncated because it has an inner loop

120 85.7%
Recursive call

A trace is truncated because it has a recursive call.

0 0.0%
Low confidence

A trace is abandoned because the likelihood of the jump to top being taken is too low.

0 0.0%
Executors invalidated

The number of executors that were invalidated due to watched dictionary changes.

0 0.0%
Traces executed

The number of traces that were executed

123,740
Uops executed

The total number of uops (micro-operations) that were executed

23,352,340 18,872.1%
Count Ratio
Optimizer attempts

The number of times the trace optimizer (_Py_uop_analyze_and_optimize) was run.

40
Optimizer successes

The number of traces that were successfully optimized.

40 100.0%
Optimizer no memory

The number of optimizations that failed due to no memory.

0 0.0%
Remove globals builtins changed

The builtins changed during optimization

0 0.0%
Remove globals incorrect keys

The keys in the globals dictionary aren't what was expected

0 0.0%

Trace length histogram

trace length histogram
Range Count Ratio
<= 1 0 0.0%
<= 2 0 0.0%
<= 4 0 0.0%
<= 8 0 0.0%
<= 16 0 0.0%
<= 32 0 0.0%
<= 64 0 0.0%
<= 128 0 0.0%
<= 256 20 50.0%
<= 512 20 50.0%

Optimized trace length histogram

optimized trace length histogram
Range Count Ratio
<= 1 0 0.0%
<= 2 0 0.0%
<= 4 0 0.0%
<= 8 0 0.0%
<= 16 0 0.0%
<= 32 0 0.0%
<= 64 0 0.0%
<= 128 0 0.0%
<= 256 40 100.0%

Trace run length histogram

trace run length histogram
Range Count Ratio
<= 1 0 0.0%
<= 2 0 0.0%
<= 4 0 0.0%
<= 8 80 0.1%
<= 16 0 0.0%
<= 32 0 0.0%
<= 64 0 0.0%
<= 128 0 0.0%
<= 256 40,580 32.8%
<= 512 40,840 33.0%

Uop execution stats

uop execution stats
Name Count Self Cumulative Miss ratio
_SET_IP 2,485,760 10.6% 10.6%
_CHECK_VALIDITY 2,485,760 10.6% 21.3%
_POP_TOP 2,445,200 10.5% 31.8%
_PUSH_NULL 2,445,200 10.5% 42.2%
_CALL_BUILTIN_FAST_WITH_KEYWORDS 2,445,200 10.5% 52.7%
_CHECK_PERIODIC 2,445,200 10.5% 63.2%
_LOAD_FAST 2,445,200 10.5% 73.6%
_LOAD_FAST_4 2,445,200 10.5% 84.1%
_LOAD_FAST_6 2,445,200 10.5% 94.6%
_TIER2_RESUME_CHECK 163,200 0.7% 95.3%
_GUARD_NOT_EXHAUSTED_TUPLE 163,120 0.7% 96.0% 25.0%
_ITER_CHECK_TUPLE 163,120 0.7% 96.7%
_STORE_FAST 162,820 0.7% 97.4%
_ITER_NEXT_TUPLE 122,260 0.5% 97.9%
_JUMP_TO_TOP 81,700 0.3% 98.3%
_EXIT_TRACE 81,500 0.3% 98.6%
_START_EXECUTOR 81,500 0.3% 98.9%
_COLD_EXIT 42,240 0.2% 99.1%
_GUARD_NOT_EXHAUSTED_RANGE 40,640 0.2% 99.3% 0.2%
_ITER_CHECK_RANGE 40,640 0.2% 99.5%
_GET_ITER 40,560 0.2% 99.7%
_ITER_NEXT_RANGE 40,560 0.2% 99.8%
_LOAD_FAST_5 40,560 0.2% 100.0%

Pair counts

Pair counts for top 100 Non-JIT uop pairs

Pairs of specialized operations that deoptimize and are then followed by the corresponding unspecialized instruction are not counted as pairs.

Pair Count Self Cumulative
_PUSH_NULL _LOAD_FAST 2,445,200 10.5% 10.5%
_SET_IP _CALL_BUILTIN_FAST_WITH_KEYWORDS 2,445,200 10.5% 20.9%
_CALL_BUILTIN_FAST_WITH_KEYWORDS _CHECK_PERIODIC 2,445,200 10.5% 31.4%
_CHECK_PERIODIC _CHECK_VALIDITY 2,445,200 10.5% 41.9%
_CHECK_VALIDITY _POP_TOP 2,445,200 10.5% 52.4%
_LOAD_FAST _LOAD_FAST_6 2,445,200 10.5% 62.8%
_LOAD_FAST_4 _PUSH_NULL 2,445,200 10.5% 73.3%
_LOAD_FAST_6 _SET_IP 2,445,200 10.5% 83.8%
_POP_TOP _LOAD_FAST_4 2,322,940 9.9% 93.7%
_ITER_CHECK_TUPLE _GUARD_NOT_EXHAUSTED_TUPLE 163,120 0.7% 94.4%
_TIER2_RESUME_CHECK _ITER_CHECK_TUPLE 122,560 0.5% 94.9%
_GUARD_NOT_EXHAUSTED_TUPLE _ITER_NEXT_TUPLE 122,260 0.5% 95.5%
_ITER_NEXT_TUPLE _STORE_FAST 122,260 0.5% 96.0%
_STORE_FAST _LOAD_FAST_4 122,260 0.5% 96.5%
_POP_TOP _JUMP_TO_TOP 81,700 0.3% 96.9%
_JUMP_TO_TOP _TIER2_RESUME_CHECK 81,700 0.3% 97.2%
_START_EXECUTOR _TIER2_RESUME_CHECK 81,500 0.3% 97.6%
_GUARD_NOT_EXHAUSTED_TUPLE _EXIT_TRACE 40,860 0.2% 97.7%
_ITER_CHECK_RANGE _GUARD_NOT_EXHAUSTED_RANGE 40,640 0.2% 97.9%
_TIER2_RESUME_CHECK _ITER_CHECK_RANGE 40,640 0.2% 98.1%
_GET_ITER _CHECK_VALIDITY 40,560 0.2% 98.3%
_POP_TOP _EXIT_TRACE 40,560 0.2% 98.4%
_SET_IP _GET_ITER 40,560 0.2% 98.6%
_CHECK_VALIDITY _ITER_CHECK_TUPLE 40,560 0.2% 98.8%
_GUARD_NOT_EXHAUSTED_RANGE _ITER_NEXT_RANGE 40,560 0.2% 98.9%
_ITER_NEXT_RANGE _STORE_FAST 40,560 0.2% 99.1%
_LOAD_FAST_5 _SET_IP 40,560 0.2% 99.3%
_STORE_FAST _LOAD_FAST_5 40,560 0.2% 99.5%
_GUARD_NOT_EXHAUSTED_RANGE _EXIT_TRACE 80 0.0% 99.5%

Unsupported opcodes

unsupported opcodes

Optimizer errored out with opcode

Optimization stopped after encountering this opcode

Rare events

Counts of rare/unlikely events
Event Count
set class

Setting an object's class, obj.__class__ = ...

0
set bases

Setting the bases of a class, cls.__bases__ = ...

0
set eval frame func

Setting the PEP 523 frame eval function _PyInterpreterState_SetFrameEvalFunc()

0
builtin dict

Modifying the builtins, __builtins__.__dict__[var] = ...

0
func modification

Modifying a function, e.g. func.__defaults__ = ..., etc.

0
watched dict modification

A watched dict has been modified

0
watched globals modification

A watched globals() dict has been modified

0

Meta stats

Meta statistics
Count
Number of data files 20

Stats gathered on: 2024-06-16