Backtest Results
Historical performance analysis of our trading strategies. All results are based on out-of-sample backtesting with no lookahead bias.
Performance Metrics Overview
Comprehensive performance comparison across all strategies
| Metric | Position-Sized | Consensus | Buy & Hold |
|---|---|---|---|
| Annual Return | 8.69% | 9.46% | 8.56% |
| Sharpe Ratio | 0.64 | 0.66 | 0.45 |
| Sortino Ratio | 0.73 | 0.70 | 0.59 |
| Calmar Ratio | 0.21 | 0.28 | 0.15 |
| Max Drawdown | -42.05% | -33.92% | -56.78% |
| Annual Volatility | 12.97% | 13.77% | 18.09% |
| Win Rate | 74.19% | 91.67% | 0.00% |
| Total Trades | 31 | 24 | N/A |
| Backtest Period | 35.9 years | 35.9 years | 35.9 years |
Cumulative Returns
Consensus, Position-Sized, and Buy & Hold strategies
Drawdown Comparison
Peak-to-trough declines (lower is worse)
Rolling Returns
252-day rolling annualized returns
Signal Distribution
Distribution of consensus signals across all time periods
Model Analysis
Model Prediction Probabilities
Confidence levels for each time horizon (50% threshold for LONG signal)
When probability > 50%, model signals LONG. When < 50%, model signals CASH.
Historical backtest performance vs Buy & Hold
9.46%
vs 8.56% B&H
0.66
vs 0.45 B&H
-33.92%
vs -56.78% B&H
Win Rate
91.67%
Total Trades
24
Time in Market
0.00%
Historical backtest performance vs Buy & Hold
8.69%
vs 8.56% B&H
0.64
vs 0.45 B&H
-42.05%
vs -56.78% B&H
Win Rate
74.19%
Total Trades
31
Time in Market
0.00%
Position Size History (Position-Sized Strategy)
Strategy allocation over time (0-100%)
Average Position: 66.7%
Position Size History (Consensus Strategy)
Strategy allocation over time (0-100%)
Average Position: 65.2%
Model Voting Pattern & Consensus Strength
Dot color reflects number of models LONG
Strategy Descriptions
Binary approach that invests 100% when the consensus threshold is met.
Rules:
- • ≥2 models signal LONG → 100% invested
- • <2 models signal LONG → 100% cash
- • Simple, easy to implement
Best for: Traders who prefer binary decisions and can tolerate higher volatility.
Gradual approach that varies position size based on model agreement.
Rules:
- • 3 models LONG → 100% position
- • 2 models LONG → 90% position
- • 1 model LONG → 30% position
- • 0 models LONG → 0% (cash)
Best for: Traders who prefer gradual adjustments and smoother equity curves.
Model Training Insights
Classification performance on test set and cross-validation results
| Horizon | AUC | Accuracy | Precision | Recall | F1 Score | CV AUC | Features |
|---|---|---|---|---|---|---|---|
3M | 0.701 | 0.614 | 0.854 | 0.535 | 0.658 | 0.719 ± 0.231 | 10 |
6M | 0.786 | 0.721 | 0.862 | 0.735 | 0.793 | 0.834 ± 0.139 | 5 |
12M | 0.930 | 0.887 | 0.944 | 0.910 | 0.927 | 0.847 ± 0.085 | 10 |
AUC: Target ≥ 0.60 (higher = better predictive power)
Precision: Accuracy of LONG predictions (higher = fewer false signals)
Recall: Ability to identify opportunities (higher = catches more good periods)
CV AUC: Cross-validation performance (lower std = more stable)
Confusion Matrices
How well each model classifies LONG vs CASH signals
3M Horizon
Accuracy: 61.4%
Actual: CASH ✓
Actual: CASH ✗
Actual: LONG ✗
Actual: LONG ✓
6M Horizon
Accuracy: 72.1%
Actual: CASH ✓
Actual: CASH ✗
Actual: LONG ✗
Actual: LONG ✓
12M Horizon
Accuracy: 88.8%
Actual: CASH ✓
Actual: CASH ✗
Actual: LONG ✗
Actual: LONG ✓
Cross-Validation Results
Train vs Validation AUC per fold - detecting overfitting
3M Horizon CV Results
5 folds completed
6M Horizon CV Results
5 folds completed
12M Horizon CV Results
5 folds completed
Feature Importance
Top 10 most important features for each prediction horizon
Top 10 Features - 3M Horizon
Most important features for prediction
Top 10 Features - 6M Horizon
Most important features for prediction
Top 10 Features - 12M Horizon
Most important features for prediction
Backtest Methodology
Data & Approach
- ✓Out-of-sample testing (no lookahead bias)
- ✓Walk-forward validation
- ✓Models trained on historical data only
- ✓Signals lagged by 1 day (realistic execution)
Key Metrics
- Annual Return: Annualized total return
- Sharpe Ratio: Risk-adjusted returns (higher is better)
- Max Drawdown: Largest peak-to-trough decline
- Win Rate: Percentage of profitable trades
Want to learn more about our models and methodology? View detailed methodology →