refactor: avoid to allocate device memory for mean and std in every loop by ktro2828 · Pull Request #58 · ktro2828/mmros

ktro2828 · 2025-10-02T20:47:51Z

Description

This pull request refactors the configuration and preprocessing logic for all 2D detector and segmenter classes to improve efficiency and clarity. The main change is moving the image normalization parameters (mean and std) to device memory once during configuration construction, rather than reloading and copying them on every preprocessing call. Additionally, constructors now use move semantics for configuration objects, ensuring better resource management.

Configuration and Device Memory Improvements:

Refactored all config structs (Detector2dConfig, InstanceSegmenter2dConfig, PanopticSegmenter2dConfig, SemanticSegmenter2dConfig) to upload mean and std arrays to device memory in their constructors, replacing host-side vectors with device pointers (CudaUniquePtr<float[]>). This avoids repeated device memory allocations and copies during preprocessing. [1] [2] [3] [4]

Constructor and Resource Management Updates:

Updated all detector and segmenter class constructors to accept configuration objects via move semantics (&&), and store them using std::move, ensuring efficient resource transfer and ownership. [1] [2] [3] [4] [5] [6] [7] [8]

Preprocessing Efficiency:

Simplified the preprocess methods in all detector and segmenter classes to use the device-side mean and std arrays directly, removing redundant host-to-device memory operations and related temporary allocations. [1] [2] [3] [4]

How was this PR tested?

Confirmed build passed
Confirmed some projects worked including yolox, deimv2, pidnet, eomt

Notes for reviewers

None.

Effects on system behavior

None.

Signed-off-by: ktro2828 <kotaro.uetake@tier4.jp>

Copilot

Pull Request Overview

This PR refactors configuration and preprocessing logic for 2D detector and segmenter classes to improve performance by avoiding repeated device memory allocations. The main optimization moves image normalization parameters (mean and std) to device memory once during configuration construction instead of reallocating them on every preprocessing call.

Refactored all config structs to upload mean and std arrays to device memory in constructors
Updated constructors to use move semantics for better resource management
Simplified preprocessing methods to use pre-allocated device memory directly

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
mmros/include/mmros/detector/detector2d.hpp	Added constructor to Detector2dConfig for device memory allocation and updated class constructor signature
mmros/include/mmros/detector/instance_segmenter2d.hpp	Added constructor to InstanceSegmenter2dConfig for device memory allocation and updated class constructor signature
mmros/include/mmros/detector/panoptic_segmenter2d.hpp	Added constructor to PanopticSegmenter2dConfig for device memory allocation and updated class constructor signature
mmros/include/mmros/detector/semantic_segmenter2d.hpp	Added constructor to SemanticSegmenter2dConfig for device memory allocation and updated class constructor signature
mmros/src/detector/detector2d.cpp	Updated constructor to use move semantics and simplified preprocessing to use device-allocated arrays
mmros/src/detector/instance_segmeter2d.cpp	Updated constructor to use move semantics and simplified preprocessing to use device-allocated arrays
mmros/src/detector/panoptic_segmenter2d.cpp	Updated constructor to use move semantics and simplified preprocessing to use device-allocated arrays
mmros/src/detector/semantic_segmenter2d.cpp	Updated constructor to use move semantics and simplified preprocessing to use device-allocated arrays

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-02T20:49:07Z

+  Detector2dConfig(
+    const std::vector<double> & _mean, const std::vector<double> & _std,
+    archetype::BoxFormat2D _box_format, double _score_threshold)
+  : box_format(_box_format), score_threshold(_score_threshold)


The constructor should validate that the mean and std vectors have the same size and are not empty before allocating device memory. Consider adding size validation to prevent runtime errors.

Copilot · 2025-10-02T20:49:07Z

+  InstanceSegmenter2dConfig(
+    const std::vector<double> & _mean, const std::vector<double> & _std,
+    archetype::BoxFormat2D _box_format, double _score_threshold)
+  : box_format(_box_format), score_threshold(_score_threshold)


The constructor should validate that the mean and std vectors have the same size and are not empty before allocating device memory. Consider adding size validation to prevent runtime errors.

Copilot · 2025-10-02T20:49:07Z

+  PanopticSegmenter2dConfig(
+    const std::vector<double> & _mean, const std::vector<double> & _std,
+    archetype::BoxFormat2D _box_format, double _score_threshold)
+  : box_format(_box_format), score_threshold(_score_threshold)


The constructor should validate that the mean and std vectors have the same size and are not empty before allocating device memory. Consider adding size validation to prevent runtime errors.

Copilot · 2025-10-02T20:49:08Z

 {
-  std::vector<double> mean;  //!< Image mean.
-  std::vector<double> std;   //!< Image std.
+  SemanticSegmenter2dConfig(const std::vector<double> & _mean, const std::vector<double> & _std)


The constructor should validate that the mean and std vectors have the same size and are not empty before allocating device memory. Consider adding size validation to prevent runtime errors.

refactor: avoid to allocate device memory for mean and std in every loop

5d466a1

Signed-off-by: ktro2828 <kotaro.uetake@tier4.jp>

Copilot AI review requested due to automatic review settings October 2, 2025 20:47

ktro2828 linked an issue Oct 2, 2025 that may be closed by this pull request

[PERF] Inefficient memory allocation in preprocessing loop #53

Closed

Copilot AI reviewed Oct 2, 2025

View reviewed changes

ktro2828 merged commit 1998b95 into main Oct 2, 2025
1 check failed

ktro2828 deleted the refactor/detector/allocate-mean-std branch October 2, 2025 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: avoid to allocate device memory for mean and std in every loop#58

refactor: avoid to allocate device memory for mean and std in every loop#58
ktro2828 merged 1 commit intomainfrom
refactor/detector/allocate-mean-std

ktro2828 commented Oct 2, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Copilot AI Oct 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ktro2828 commented Oct 2, 2025

Description

How was this PR tested?

Notes for reviewers

Effects on system behavior

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants