| Method | Weight | Point | 80% lo | 80% hi | 95% lo | 95% hi |
|---|---|---|---|---|---|---|
| linear | 25.0% | -4.79% | -8.47% | -1.13% | -10.05% | +0.88% |
| monte_carlo | 25.0% | -8.00% | -34.90% | +28.58% | -46.16% | +51.97% |
| ar1 | 25.0% | +0.01% | -5.03% | +5.33% | -7.60% | +8.25% |
| random_walk | 25.0% | +0.00% | -24.66% | +32.73% | -35.15% | +54.20% |
| Factor | Loading | Ticker value | Contribution |
|---|---|---|---|
| value_score | +0.0495 | -3.0000 | -0.1484 |
| quality_score | +0.0192 | -0.6799 | -0.0131 |
| momentum_score | +0.0286 | +3.0000 | +0.0858 |
| lowvol_score | +0.0173 | -2.5204 | -0.0436 |
| revisions_score | +0.0236 | +3.0000 | +0.0709 |
| news_activity_score | +0.0402 | +0.0244 | +0.0010 |
| (intercept) | - | - | -0.0005 |
| Factor | Mean IC | ICIR | % positive | Cumulative attribution | n periods |
|---|---|---|---|---|---|
| value_score | +0.2563 | +2.690 | 100.0% | +0.5383 | 252 |
| quality_score | +0.1105 | +1.096 | 86.5% | +0.2401 | 252 |
| momentum_score | +0.1428 | +1.354 | 92.5% | +0.2460 | 252 |
| lowvol_score | +0.0878 | +0.887 | 82.1% | +0.1870 | 252 |
| revisions_score | +0.1375 | +1.319 | 88.9% | +0.2738 | 252 |
| news_activity_score | +0.3540 | +3.832 | 100.0% | +1.0893 | 252 |
| Horizon | n predictions | Directional accuracy | MAE (return) | RMSE | 80% CI hit rate | Pearson(pred, real) |
|---|---|---|---|---|---|---|
| 1d | 691 | 53.0% | 2.34% | 3.55% | 83.1% | +0.008 |
| 30d | 662 | 62.4% | 17.30% | 22.74% | 72.4% | +0.206 |
| 60d | 632 | 54.7% | 34.01% | 42.39% | 58.7% | +0.261 |
intraday_forecaster_v10_trained_lgbm.
horizon_conflict flag when |1h move| > 1%.
meta_labeler_v10_trained_lgbm.
meta_abstain flag is added to the critique. See AFML Ch.3.6 (Lopez de Prado 2018).
| Factor | Loading wi | Ticker z-value | +1 sd impact | -1 sd impact |
|---|---|---|---|---|
| value_score | +0.0495 | -3.000 | +0.0495 | -0.0495 |
| news_activity_score | +0.0402 | +0.024 | +0.0402 | -0.0402 |
| momentum_score | +0.0286 | +3.000 | +0.0286 | -0.0286 |
| revisions_score | +0.0236 | +3.000 | +0.0236 | -0.0236 |
| quality_score | +0.0192 | -0.680 | +0.0192 | -0.0192 |
| lowvol_score | +0.0173 | -2.520 | +0.0173 | -0.0173 |
| Scenario | Description | Stressed point | Delta vs base |
|---|---|---|---|
| rates_+100bps | Yield curve +100bps (lowvol -1sd, value -0.5sd) | -8.99% | -4.20% |
| recession_risk_off | Recession (momentum -1.5sd, lowvol +1sd, news -1sd) | -11.37% | -6.58% |
| quality_flight | Quality flight (quality +1sd, revisions +1sd, momentum -0.5sd) | -1.93% | +2.86% |
Point return formula (linear factor model):
r_hat = alpha + sum_i (w_i × f_i)
where w_i is the cross-sectional loading for factor i
(estimated from the cross-section of US large-cap names) and f_i is the
z-scored factor value for this ticker (Grinold-Kahn winsorization at ±3).
Per-factor contribution shown in the waterfall above is exactly
w_i × f_i (decimal-return units), so the bars sum (with intercept) to
the linear forecast point. The ensemble point combines linear / Monte Carlo /
AR(1) / random-walk per per-method-comparison table.
Bias mapping (composite z-score → LONG/NEUTRAL/SHORT): weighted sum of value (1.0), quality (0.7), momentum (1.0), low-vol (0.5), revisions (1.0), news-activity (0.6); thresholds ±1.0 with macro override (risk-off + LONG → reduce to NEUTRAL; risk-on-growth + SHORT → reduce to NEUTRAL). LONG → Playbook A (long-call / call-debit-spread / covered-call); SHORT → Playbook B (long-put / put-debit-spread / bear-call-spread); NEUTRAL → cash.
Sensitivity (one-factor-at-a-time tornado above): impact = w_i × 1sd
per factor (MSCI / Two-Sigma Venn convention). Stress scenarios are canned multi-factor
shocks (rates +100bps; recession risk-off; quality flight).
Factor decay uses exponential half-life per factor type:
catalysts 7d, news 14d, revisions 30d, momentum 45d, value 90d, quality 90d
(Tetlock 2007; Cohen-Malloy-Pomorski 2012). The heatmap above shows
contribution × exp(-ln(2) × t / half_life) on a 5-day grid.
Factor IC backtest: Spearman rank correlation between each factor's
cross-sectional ranking on day t and the realized h-day-forward
return, aggregated across the panel (Grinold-Kahn 2000 ch.4 fundamental law of
active management). Source for this report: synthetic_panel_252d_100tickers_seed42 (parallel runner).
Placebo audit (v6, GT-Score):
GT-score = 0.51
SPURIOUS — setup likely lacks real signal
.
Computed from 3 seed(s) across placebo kinds:
shuffled, gaussian_iid, garch.
Method: real directional accuracy = 53.0%;
max placebo accuracy = 53.8%;
AMG = -0.008.
Source: placebo_audit_v6_ar1_runner.
Provenance: arXiv 2604.15531 "Spurious Predictability in Financial ML" (Paper #3).
Risk-flag spurious_predictability_audit_failed added to narrative.
| Date | Type | Source | Confidence | Description |
|---|---|---|---|---|
| 2026-06-04 | earnings | yahoo | high | Earnings announcement (CIEN) |