Backtest Results

Historical performance analysis of our trading strategies. All results are based on out-of-sample backtesting with no lookahead bias.

Performance Metrics Overview

Performance Metrics Overview

Comprehensive performance comparison across all strategies

Metric
Position-Sized
Consensus
Buy & Hold
Annual Return
10.07%
11.29%
8.46%
Sharpe Ratio
0.78
0.87
0.47
Sortino Ratio
0.92
0.96
0.60
Calmar Ratio
0.29
0.41
0.15
Max Drawdown
-34.67%
-27.62%
-56.78%
Annual Volatility
12.84%
13.00%
18.08%
Win Rate
65.22%
81.58%
0.00%
Total Trades
23
38
N/A
Backtest Period
35.8 years
35.8 years
35.8 years
Good
Positive performance or ratio > 1
Moderate
Moderate performance or ratio ≤ 1
Poor
Negative performance or high drawdown

Performance Analysis

Cumulative Returns - All Strategies

Comprehensive comparison of strategy performance

Position-Sized

+2991.45%

Consensus

+4480.19%

Buy & Hold

+1750.13%

Drawdown Analysis

Peak-to-trough declines over time

Rolling Returns

252-day rolling returns comparison

Model Analysis

Model Probabilities

Historical model prediction probabilities by horizon

Strategy Performance

Consensus Strategy

Historical backtest performance vs Buy & Hold

Annual Return

11.29%

Sharpe Ratio

0.87

Max Drawdown

-27.62%

Outperformance:+11.29%/year

Win Rate

81.58%

Total Trades

38

Time in Market

0.00%

Position-Sized Strategy

Historical backtest performance vs Buy & Hold

Annual Return

10.07%

Sharpe Ratio

0.78

Max Drawdown

-34.67%

Outperformance:+10.07%/year

Win Rate

65.22%

Total Trades

23

Time in Market

0.00%

Strategy Descriptions

Consensus Strategy

Binary approach that invests 100% when the consensus threshold is met.

Rules:

  • • ≥2 models signal LONG → 100% invested
  • • <2 models signal LONG → 100% cash
  • • Simple, easy to implement

Best for: Traders who prefer binary decisions and can tolerate higher volatility.

Position-Sized Strategy

Gradual approach that varies position size based on model agreement.

Rules:

  • • 3 models LONG → 100% position
  • • 2 models LONG → 90% position
  • • 1 model LONG → 30% position
  • • 0 models LONG → 0% (cash)

Best for: Traders who prefer gradual adjustments and smoother equity curves.

Model Training Insights

Model Performance Metrics

Classification performance on test set and cross-validation results

HorizonAUCAccuracyPrecisionRecallF1 ScoreCV AUCFeatures
3M
0.753
0.659
0.814
0.635
0.714
0.680 ± 0.234
10
6M
0.884
0.774
0.916
0.753
0.827
0.834 ± 0.125
10
12M
0.975
0.912
0.975
0.909
0.941
0.978 ± 0.034
10
Good
Score ≥ target threshold
Fair
Score slightly below threshold
Needs improvement
Score significantly below threshold

AUC: Target ≥ 0.60 (higher = better predictive power)

Precision: Accuracy of LONG predictions (higher = fewer false signals)

Recall: Ability to identify opportunities (higher = catches more good periods)

CV AUC: Cross-validation performance (lower std = more stable)

Confusion Matrices

How well each model classifies LONG vs CASH signals

3M Horizon

Accuracy: 65.9%

True Negative
2114
23.5%
Predicted: CASH
Actual: CASH ✓
False Positive
872
9.7%
Predicted: LONG
Actual: CASH ✗
False Negative
2190
24.4%
Predicted: CASH
Actual: LONG ✗
True Positive
3814
42.4%
Predicted: LONG
Actual: LONG ✓
Total predictions: 8,990

6M Horizon

Accuracy: 77.4%

True Negative
2124
23.6%
Predicted: CASH
Actual: CASH ✓
False Positive
445
4.9%
Predicted: LONG
Actual: CASH ✗
False Negative
1583
17.6%
Predicted: CASH
Actual: LONG ✗
True Positive
4838
53.8%
Predicted: LONG
Actual: LONG ✓
Total predictions: 8,990

12M Horizon

Accuracy: 91.2%

True Negative
1878
20.9%
Predicted: CASH
Actual: CASH ✓
False Positive
160
1.8%
Predicted: LONG
Actual: CASH ✗
False Negative
630
7.0%
Predicted: CASH
Actual: LONG ✗
True Positive
6322
70.3%
Predicted: LONG
Actual: LONG ✓
Total predictions: 8,990

Cross-Validation Results

Train vs Validation AUC per fold - detecting overfitting

3M Horizon CV Results

5 folds completed

Mean Val AUC: 0.680
Gap: 0.1370 (High Overfitting)

6M Horizon CV Results

5 folds completed

Mean Val AUC: 0.834
Gap: 0.0850 (Moderate)

12M Horizon CV Results

3 folds completed

Mean Val AUC: 0.978
Gap: -0.0069 (Good)

Feature Importance

Top 10 most important features for each prediction horizon

Top 10 Features - 3M Horizon

Most important features for prediction

Top 10 Features - 6M Horizon

Most important features for prediction

Top 10 Features - 12M Horizon

Most important features for prediction

Backtest Methodology

Data & Approach

  • Out-of-sample testing (no lookahead bias)
  • Walk-forward validation
  • Models trained on historical data only
  • Signals lagged by 1 day (realistic execution)

Key Metrics

  • Annual Return: Annualized total return
  • Sharpe Ratio: Risk-adjusted returns (higher is better)
  • Max Drawdown: Largest peak-to-trough decline
  • Win Rate: Percentage of profitable trades

Want to learn more about our models and methodology? View detailed methodology →