Optimization Techniques

**Referenced Files in This Document** - [combinate.cpp](file://hikyuu_cpp/hikyuu/analysis/combinate.cpp) - [combinate.h](file://hikyuu_cpp/hikyuu/analysis/combinate.h) - [WalkForwardSystem.h](file://hikyuu_cpp/hikyuu/trade_sys/system/imp/WalkForwardSystem.h) - [WalkForwardSystem.cpp](file://hikyuu_cpp/hikyuu/trade_sys/system/imp/WalkForwardSystem.cpp) - [OptimalSelectorBase.h](file://hikyuu_cpp/hikyuu/trade_sys/selector/imp/optimal/OptimalSelectorBase.h) - [OptimalSelectorBase.cpp](file://hikyuu_cpp/hikyuu/trade_sys/selector/imp/optimal/OptimalSelectorBase.cpp) - [PerformanceOptimalSelector.cpp](file://hikyuu_cpp/hikyuu/trade_sys/selector/imp/optimal/PerformanceOptimalSelector.cpp) - [OptimalEvaluateSelector.cpp](file://hikyuu_cpp/hikyuu/trade_sys/selector/imp/optimal/OptimalEvaluateSelector.cpp) - [_System.cpp](file://hikyuu_pywrap/trade_sys/_System.cpp) - [_Selector.cpp](file://hikyuu_pywrap/trade_sys/_Selector.cpp) - [Performance.h](file://hikyuu_cpp/hikyuu/trade_manage/Performance.h) - [Performance.cpp](file://hikyuu_cpp/hikyuu/trade_manage/Performance.cpp) - [test_SYS_WalkForward.cpp](file://hikyuu_cpp/unit_test/hikyuu/trade_sys/system/test_SYS_WalkForward.cpp) - [test_SE_PerformanceOptimal.cpp](file://hikyuu_cpp/unit_test/hikyuu/trade_sys/selector/test_SE_PerformanceOptimal.cpp) - [test_SE_MaxFundsOptimal.cpp](file://hikyuu_cpp/unit_test/hikyuu/trade_sys/selector/test_SE_MaxFundsOptimal.cpp) - [Turtle_SG.py](file://hikyuu/examples/Turtle_SG.py)

Introduction

This document explains the optimization techniques supported by hikyuu’s backtesting framework with a focus on walk-forward analysis for robust strategy parameter optimization and validation. It covers:

How the combinate module enables rapid testing of parameter combinations for signals and indicators
How walk-forward analysis divides historical data into in-sample and out-of-sample periods, optimizes parameters on in-sample windows, and validates on out-of-sample windows
How the framework explores parameter spaces and integrates with the backtesting engine
Guidance on avoiding overfitting and curve-fitting, selecting parameter ranges, and choosing lookback periods
Practical performance considerations for computationally intensive optimization runs

Project Structure

The optimization capabilities are implemented across several modules:

Analysis: combinate module for indicator combination testing
Strategy/Backtesting: WalkForwardSystem for rolling window optimization
Selector: OptimalSelectorBase and derived selectors for in-sample evaluation and selection
Performance: Performance statistics for evaluation metrics
Python bindings: convenient constructors and helpers for walk-forward and custom evaluators

graph TB
subgraph "Analysis"
COMB["combinate.cpp/.h"]
end
subgraph "Strategy/Backtesting"
WFS["WalkForwardSystem.h/.cpp"]
end
subgraph "Selector"
OSB["OptimalSelectorBase.h/.cpp"]
POS["PerformanceOptimalSelector.cpp"]
OES["OptimalEvaluateSelector.cpp"]
end
subgraph "Performance"
PERF["Performance.h/.cpp"]
end
subgraph "Python Bindings"
PY_SYS["_System.cpp"]
PY_SEL["_Selector.cpp"]
end
COMB --> PERF
WFS --> OSB
OSB --> POS
OSB --> OES
WFS --> PERF
PY_SYS --> WFS
PY_SEL --> OSB

Diagram sources

combinate.cpp
combinate.h
WalkForwardSystem.h
WalkForwardSystem.cpp
OptimalSelectorBase.h
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
OptimalEvaluateSelector.cpp
Performance.h
_System.cpp
_Selector.cpp

Section sources

combinate.cpp
combinate.h
WalkForwardSystem.h
WalkForwardSystem.cpp
OptimalSelectorBase.h
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
OptimalEvaluateSelector.cpp
Performance.h
_System.cpp
_Selector.cpp

Core Components

combinate module: Generates indicator combinations and evaluates them across a single stock or a block of stocks, returning performance metrics for each combination.
WalkForwardSystem: Implements rolling window optimization by evaluating candidate systems on training windows and applying the selected system to subsequent test windows.
OptimalSelectorBase and derived selectors: Define the in-sample evaluation process and produce run ranges for walk-forward execution.
Performance: Provides standardized performance statistics used for optimization criteria.

Section sources

combinate.cpp
combinate.h
WalkForwardSystem.h
OptimalSelectorBase.h
Performance.h

Architecture Overview

The optimization architecture centers around two complementary workflows:

Indicator/Signal Combination Testing (Analysis): Quickly explore parameter combinations for signals/indicators and rank by performance.
Walk-Forward Parameter Optimization (Strategy): Divide history into rolling in-sample and out-of-sample windows, select the best-performing system in-sample, and validate on-out-of-sample.

sequenceDiagram
participant User as "User"
participant PySys as "Python SYS_WalkForward"
participant WFS as "WalkForwardSystem"
participant SEL as "OptimalSelectorBase"
participant POS as "PerformanceOptimalSelector"
participant PERF as "Performance"
participant SYS as "System List"
User->>PySys : "Create walk-forward system with candidate systems"
PySys->>WFS : "Initialize with train_len, test_len, selector"
User->>WFS : "Run on KData"
WFS->>SEL : "calculate(query)"
SEL->>POS : "Iterate train ranges"
loop For each train range
POS->>SYS : "Clone and run candidate systems"
POS->>PERF : "statistics(tm, end_date)"
POS-->>SEL : "Selected system for test window"
end
SEL-->>WFS : "RunRanges and selected systems"
WFS->>WFS : "Apply selected system to test windows"
WFS-->>User : "Trades and performance"

Diagram sources

_System.cpp
WalkForwardSystem.cpp
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
Performance.cpp

Detailed Component Analysis

Indicator Combination Testing with combinate

The combinate module enables fast parameter-space exploration for signals/indicators by:

Generating indicator combinations from buy and sell indicator sets
Creating boolean signals from combinations
Running systems with these signals and computing performance metrics
Supporting single-stock and block-wide evaluation with parallelism

Key capabilities:

Index combination generation with a practical upper bound on input size
Indicator combination creation with a configurable lookback window
Single-stock and block-wide analysis returning performance vectors per combination

flowchart TD
Start(["Start"]) --> Inputs["Collect buy and sell indicator lists"]
Inputs --> GenIdx["Generate index combinations"]
GenIdx --> BuildCombo["Build indicator combinations<br/>with lookback window"]
BuildCombo --> BuildSignals["Create boolean signals from combinations"]
BuildSignals --> RunSys["Run systems with signals"]
RunSys --> Perf["Compute performance metrics"]
Perf --> Output["Return combination performance map/vector"]
Output --> End(["End"])

Diagram sources

combinate.h
combinate.h
combinate.h
combinate.cpp
combinate.cpp
combinate.cpp

Practical usage examples:

See the example signal class in Python that demonstrates parameterized signals and how to integrate them into the backtesting pipeline.
The combinate module is ideal for quickly scanning small to medium-sized parameter grids for signal thresholds or indicator parameters.

Section sources

combinate.h
combinate.h
combinate.h
combinate.cpp
combinate.cpp
combinate.cpp
Turtle_SG.py

Walk-Forward Analysis Implementation

Walk-forward analysis divides the historical dataset into overlapping rolling windows:

In-sample (training) window: used to evaluate candidate systems and select the best performer
Out-of-sample (test) window: used to validate the selected system’s performance

The framework:

Builds trading calendar from query and market
Defines train ranges and shifts by test_len to create overlapping windows
Evaluates candidates in-sample and maps selected systems to out-of-sample dates
Applies the selected system to test windows during run

sequenceDiagram
participant WFS as "WalkForwardSystem"
participant SEL as "OptimalSelectorBase"
participant POS as "PerformanceOptimalSelector"
participant SYS as "Candidate Systems"
participant PERF as "Performance"
WFS->>SEL : "calculate(query)"
SEL->>SEL : "Build train ranges from trading calendar"
loop For each train range
SEL->>POS : "Select best system"
POS->>SYS : "Clone and run"
POS->>PERF : "statistics(tm, end_date)"
POS-->>SEL : "Selected system"
SEL->>SEL : "Map selected system to test dates"
end
SEL-->>WFS : "RunRanges and selected systems"
WFS->>WFS : "Run selected system on test windows"

Diagram sources

WalkForwardSystem.cpp
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
PerformanceOptimalSelector.cpp
OptimalSelectorBase.h

Section sources

WalkForwardSystem.h
WalkForwardSystem.cpp
WalkForwardSystem.cpp
OptimalSelectorBase.h
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
PerformanceOptimalSelector.cpp

Parameter Space Exploration and Evaluation

Grid search: The combinate module generates combinations of indicator parameters and evaluates them systematically. This is a form of grid search over small parameter sets.
Custom evaluation: The OptimalSelectorBase allows custom evaluation functions via SE_EvaluateOptimal, enabling advanced optimization strategies beyond simple grid search.
Performance metrics: Performance statistics provide standardized measures for ranking candidate systems.

classDiagram
class OptimalSelectorBase {
+calculate(pf_realSysList, query)
+getRunRanges()
+evaluate(sys, enddate) double
}
class PerformanceOptimalSelector {
+calculate(pf_realSysList, query)
-_calculate_single(...)
-_calculate_parallel(...)
}
class OptimalEvaluateSelector {
+evaluate(sys, enddate) double
}
OptimalSelectorBase <|-- PerformanceOptimalSelector
OptimalSelectorBase <|-- OptimalEvaluateSelector

Diagram sources

OptimalSelectorBase.h
PerformanceOptimalSelector.cpp
OptimalEvaluateSelector.cpp

Section sources

OptimalSelectorBase.h
PerformanceOptimalSelector.cpp
OptimalEvaluateSelector.cpp
_Selector.cpp

Methodology: In-Sample vs Out-of-Sample Periods

In-sample (train) windows: Used to evaluate candidate systems and select the best performing system according to a chosen metric.
Out-of-sample (test) windows: Used to validate the selected system’s performance without re-optimizing parameters.
Overlapping windows: train_len and test_len define the window sizes; the sliding window ensures continuous coverage of the dataset.

Validation and correctness checks:

Unit tests demonstrate correct run-range construction and selection behavior for both single and multiple candidate systems.

Section sources

OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
test_SE_PerformanceOptimal.cpp
test_SE_PerformanceOptimal.cpp
test_SE_MaxFundsOptimal.cpp
test_SE_MaxFundsOptimal.cpp

Relationship Between Optimization Results and Strategy Robustness

Robustness hinges on out-of-sample validation: systems that perform well in-sample but fail out-of-sample likely overfit.
Walk-forward analysis mitigates overfitting by preventing parameter leakage from test windows into training decisions.
Metrics-based selection: Using Performance statistics ensures consistent evaluation criteria across systems.

Guidance:

Prefer out-of-sample metrics for final selection.
Use walk-forward with reasonable train_len/test_len to simulate realistic deployment conditions.
Avoid excessive parameter tuning; keep parameter ranges narrow and meaningful.

Section sources

Performance.h
Performance.cpp
test_SYS_WalkForward.cpp
test_SYS_WalkForward.cpp

Selecting Parameter Ranges and Lookback Periods

Parameter ranges: Start narrow and expand gradually; use domain knowledge to constrain plausible ranges.
Lookback periods: Align with the typical turnover and mean reversion characteristics of the asset class; too short may overfit noise; too long may miss recent regime changes.
Train/test lengths: Choose train_len to capture enough dynamics for robust selection; choose test_len to provide meaningful validation without excessive computation.

[No sources needed since this section provides general guidance]

Concrete Examples from combinate.cpp

Indicator combination generation and evaluation for buy/sell signals
Block-wide evaluation with parallel execution and error handling
Output structure for downstream analysis

Refer to the following paths for implementation details:

combinate.cpp
combinate.cpp
combinate.cpp
combinate.h
combinate.h

Section sources

combinate.cpp
combinate.cpp
combinate.cpp
combinate.h
combinate.h

Dependency Analysis

The optimization stack exhibits clear separation of concerns:

combinate depends on indicator and system infrastructure and returns performance vectors
WalkForwardSystem orchestrates run ranges and applies selected systems
OptimalSelectorBase defines the evaluation contract and produces run ranges
Performance provides the evaluation metrics used by selectors

graph TB
COMB["combinate.cpp/.h"] --> PERF["Performance.h/.cpp"]
WFS["WalkForwardSystem.h/.cpp"] --> OSB["OptimalSelectorBase.h/.cpp"]
OSB --> POS["PerformanceOptimalSelector.cpp"]
OSB --> OES["OptimalEvaluateSelector.cpp"]
WFS --> PERF
PY_SYS["Python _System.cpp"] --> WFS
PY_SEL["Python _Selector.cpp"] --> OSB

Diagram sources

combinate.cpp
combinate.h
WalkForwardSystem.h
WalkForwardSystem.cpp
OptimalSelectorBase.h
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp
OptimalEvaluateSelector.cpp
Performance.h
_System.cpp
_Selector.cpp

Section sources

combinate.cpp
WalkForwardSystem.cpp
OptimalSelectorBase.cpp
PerformanceOptimalSelector.cpp

Performance Considerations

Parallel evaluation: Both combinate block-wide analysis and selector calculations support parallel execution to reduce wall-clock time.
Memory and CPU: Large parameter grids and long histories increase memory and compute requirements; tune train_len/test_len accordingly.
I/O and serialization: Serialization support in selectors and systems enables persistence and reuse of evaluation results.

[No sources needed since this section provides general guidance]

Troubleshooting Guide

Common issues and remedies:

Empty candidate system list: Ensure candidate systems are added and properly initialized with a stock.
Invalid market or query: Verify market code and query parameters align with available trading calendar.
Overfitting symptoms: If in-sample performance drops significantly out-of-sample, reduce parameter complexity or increase test_len.
Performance evaluation errors: Wrap custom evaluators in try-catch blocks; the framework logs errors and continues.

Section sources

test_SYS_WalkForward.cpp
test_SYS_WalkForward.cpp
OptimalEvaluateSelector.cpp
combinate.cpp

Conclusion

hikyuu’s optimization toolkit combines:

Rapid indicator/signal combination testing via combinate for small-scale parameter grids
Robust walk-forward analysis for parameter-free validation across rolling windows
Flexible evaluation via custom selectors and standardized performance metrics

Adopting walk-forward with careful parameter range selection and out-of-sample validation helps mitigate overfitting and improves strategy robustness.

[No sources needed since this section summarizes without analyzing specific files]

Appendices

Python convenience: SYS_WalkForward and SE_EvaluateOptimal streamline setup and custom evaluation in Python bindings.

Section sources

_System.cpp
_Selector.cpp

Optimization Techniques

Optimization Techniques

Table of Contents

Introduction

Project Structure

Core Components

Architecture Overview

Detailed Component Analysis

Indicator Combination Testing with combinate

Walk-Forward Analysis Implementation

Parameter Space Exploration and Evaluation

Methodology: In-Sample vs Out-of-Sample Periods

Relationship Between Optimization Results and Strategy Robustness

Selecting Parameter Ranges and Lookback Periods

Concrete Examples from combinate.cpp

Dependency Analysis

Performance Considerations

Troubleshooting Guide

Conclusion

Appendices

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!