[onert] Performance Regression: Latency Increase & Memory Reduction with MobileNetV2

### Description
We previously saw the following results for MobileNetV2 on CPU:
Prepare       : 10.193 ms
Avg I/O       : 0.081 ms
Avg Run       : 10.520 ms
RSS EXECUTE   : 72 160 KB

Today’s results show:
Prepare       : 10.606 ms
Avg I/O       : 0.084 ms
Avg Run       : 21.962 ms
RSS EXECUTE   : 37 484 KB

### Measurement Reference
- Previous measurement: https://github.com/Samsung/ONE/pull/15192#issuecomment-2817726278
- Current measurement at commit b2f193a3663


### Steps to Reproduce
1. Run
```
python3 runtime/onert/sample/minimal-python/inference_benchmark.py mobilenetv2 --backends cpu --input-shape 1,224,224,3 --repeat 100
```
2. Compare the “Avg Run” and “EXECUTE RSS” values to previous runs


### Expected Behavior
- Avg Run should remain close to ~10.5 ms
- RSS EXECUTE should remain close to ~72 MB


### Actual Behavior
- Avg Run has nearly doubled to ~22 ms
- RSS EXECUTE has dropped by roughly 50%


### Impact
This regression could indicate a change in memory allocation strategy or an unintended slowdown in the execution path. It needs investigation.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onert] Performance Regression: Latency Increase & Memory Reduction with MobileNetV2 #15362

Description

Measurement Reference

Steps to Reproduce

Expected Behavior

Actual Behavior

Impact

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[onert] Performance Regression: Latency Increase & Memory Reduction with MobileNetV2 #15362

Description

Description

Measurement Reference

Steps to Reproduce

Expected Behavior

Actual Behavior

Impact

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions