Phase 6A — Post-Run Evaluation Template¶
Scoring sheet for momentum-aware one-sided participation filter runs. Use this template for every Phase 6A session until Phase 6A is formally closed.
Run identity¶
- Session ID:
- Duration:
- Config:
- bid_offset_bps = 10
- ask_offset_bps = 14
- drift_pct = 0.65
- max_skew_bps = 10
- momentum_filter_enabled = true
- momentum_lookback_ticks = 3
- momentum_threshold_bps = 4.0
Hard startup gate¶
Must confirm from startup log:
- momentum_filter_enabled=True
- momentum_lookback_ticks=3
- momentum_threshold_bps=4.0
If any missing or wrong: Run invalid for Phase 6A evaluation.
1. Core outcome block¶
- Fills total:
- Buy fills:
- Sell fills:
- VW realized spread:
- Avg realized spread:
- Toxic fills / toxicity %:
- Ending inventory %:
- Drift range:
- XRP session %:
- Portfolio end value:
2. Momentum filter activity¶
- Total BUY suppressions:
- Total SELL suppressions:
- First trigger timestamp:
- Last trigger timestamp:
- Total ticks with suppression active (if available):
Activity classification: - Dormant — 0 total suppressions. No meaningful momentum regime. Valid for baseline preservation only. - Moderate / healthy — Suppressions in bursts around directional moves. Bounded counters. Filter active and selective. - Overfiring — Frequent suppressions across noisy tape. Threshold too sensitive or signal too noisy.
3. Side-correctness check¶
For each suppression cluster, record: timestamp, momentum_bps sign, suppressed side, subsequent price path.
Pass rule: - positive momentum → SELL suppressed - negative momentum → BUY suppressed
Fail: any inversion of sign/side logic. Single ambiguous case OK, repeated = Phase 6A fails immediately.
4. Calm-regime baseline preservation¶
Compare against pre-6A baseline sessions.
Expected calm-regime profile: - VW spread roughly positive - fills not collapsed - toxicity near zero
| Outcome | Criteria |
|---|---|
| Pass | VW positive, fills in plausible range, no starvation, toxicity ~0 |
| Warning | VW still positive but fills materially collapse; suppressions active in calm tape |
| Fail | Previously healthy session now broken; VW materially worse in calm; filter fires without momentum |
5. Anchor regime classification¶
Classify anchor over session: - Negative anchor — anchor meaningfully below CLOB mid → SELL structurally advantaged - Neutral anchor — near zero, not directionally dominant → cleanest read of intrinsic filter value - Positive anchor — anchor meaningfully above CLOB mid → BUY structurally advantaged
Record: - anchor mean: - anchor range: - anchor bias direction:
Interpretation rule: do not over-credit filter for strength on the anchor-advantaged side.
6. Interaction and conversion block¶
- Buy near-touch (session):
- Sell near-touch (session):
- Buy conversion %:
- Sell conversion %:
| Signal | Interpretation |
|---|---|
| Healthy | Conversion plausible on at least one side; no complete collapse |
| Warning | One-sided conversion near zero from suppression without momentum justification |
| Useful 6A signal | Suppressed side shows reduced adverse interaction while opposite side still converts |
7. Stress-regime check¶
- Max drift reached:
abs(drift) >= 10%: yes/no- Duration at/above 10%:
- Cap engaged during suppression windows: yes/no
Interpretation: - No stress (drift <10%) → Phase 6A evaluated for calm-regime preservation only, not full stress benefit. - Stress reached → true test of momentum filter at the boundary.
8. Pass / Fail decision tree¶
Phase 6A PASS — all true: - ✓ startup gate passed - ✓ side-correctness passed - ✓ no evidence of overfiring - ✓ calm baseline preserved - ✓ toxicity stays near zero - ✓ either stress behavior improved OR calm regime intact while filter stays selective
Phase 6A PARTIAL PASS: - logic correct, no breakage - but no strong momentum regime occurred OR anchor regime makes attribution difficult - safe to keep testing, benefit not yet isolated
Phase 6A FAIL — any of: - side suppression wrong - filter overfires in calm tape - fill activity collapses without justification - VW degrades materially in otherwise healthy regime - implementation not actually active in startup logs
9. Trigger count thresholds (evaluation, not production)¶
| Range | Interpretation |
|---|---|
| 0–10 per side over 2h | Low / selective / calm regime |
| ~10–40 per side during directional periods | Healthy active range if clustered |
| 50+ both sides in calm-looking run | Overfiring concern — investigate |
| Continuous alternating suppression | Too sensitive |
Soft thresholds. Not hard failure lines.
10. Output format for close¶
Send close in this structure: 1. Session results 2. Startup gate confirmation 3. Momentum filter activity 4. Side-correctness assessment 5. Anchor regime 6. Drift / stress status 7. Conversion and interaction 8. Verdict: PASS / PARTIAL PASS / FAIL 9. Next recommendation
Atlas scoring priority (what matters most)¶
- Side-correctness
- Baseline not broken
- No overfiring
- Stress improvement if stress occurred
- Anchor-adjusted interpretation
For the first Phase 6A run, we need to know:¶
- Did the filter activate correctly?
- Did it stay out of the way when not needed?
- Did it avoid harming the good baseline?
If yes — Phase 6A survives first contact.
Template provided by Atlas, 2026-04-15. Use for all Phase 6A runs.