John-Z's Overlay · Trifecta ATM

User guide

How to use this site

What the numbers mean and how to read a race page. Research-grade methodology, transparent backtest, tipsheet — not betting advice. Walk-forward backtests show the model picks horses well but pari-mutuel takeout is too steep a floor to clear, so treat the picks as one input alongside your own handicapping.

App Developer John B. Clayton · Lexington, KY · Built with Claude Code

For the model's internal methodology — training data, features, validation, lift over baseline — see About.

1. What this is

A model-driven tipsheet for the meets we follow. Every horse on every card runs through a LightGBM speed-fig regressor + LambdaRank ensemble trained on 157,000 historical past-performance rows across 7 tracks. The output: per-horse W/P/S/ITM probabilities, a "shape" classification per race, and the model's most-likely-to-cash trifecta structure with hit probability and pari-mutuel-implied payout — all transparent, all informational.

Walk-forward cross-validation shows the speed-fig regressor reduces MAE 24% over the naive "next race ≈ last race" baseline (6.57 vs. 8.69 fig points) on a held-out 14,000-row test set per fold. The model has real horse-picking skill — top-1 hit rate of 29.6% out-of-sample on 8-horse fields (vs. ~12.5% random).

Honest claim: the model picks horses well, but pari-mutuel takeout (~16% on WPS, ~19–26% on trifecta) is too steep a hurdle to consistently clear at this data scale. Walk-forward backtests show following the model's "most-likely" structures returns roughly the takeout in losses. Use this as one input among many, not as a betting service. The tier badges describe race shape (model confidence), not bet quality.

What it's good for:

2. Reading a race page

The field grid

Pro tip: a horse with both ▲ +Edge AND ↓ Live drift is the strongest signal — the public is starting to agree with the model but the price hasn't fully closed. A horse with +Edge but flat Live odds is an unfinished mispricing.

Pace Shape panel

Classifies the projected early-running mix:

The "Projected speed" line lists which horses are likely to contest the lead, ranked by Quirin Speed Points.

Race-to-race navigation

Top of every race page: ← Race {prev} / [Day card] / Race {next} → arrows let you walk the card without bouncing back through the day index.

3. The Trifecta Shape card

Each race shows the model's most-likely-to-cash trifecta structure with its hit probability and pari-mutuel-implied payout. Behind the scenes: 50,000 Plackett-Luce-sampled finish orders are evaluated against per-track calibrated pari-mutuel payout estimates. Every standard structure is scored:

The card surfaces the structure with the highest predicted hit probability and its informational payout — not a +EV claim. Click Other ticket structures to see alternatives with smaller stakes or different coverage profiles.

Race-shape badges

Tier Top-4 ITM concentration Meaning
TIGHT ≥ 80% Top 4 cover most of the ITM math. Chalky shape — favorites should hit, payouts small. Highest hit-rate tier — best chance to cash a small ticket, though still negative-EV after takeout.
COMPETITIVE 65 – 80% Multiple legitimate contenders, model less concentrated. Mid-spread ITM math. Model has no clear edge — every ticket structure on these races is steeply negative-EV. Skip, or treat as an entertainment bet and size accordingly.
WIDE OPEN < 65% No clear take — anyone could hit; longshots have room. Chaos races. Lottery shape — low hit rates, larger payouts when they cash. Entertainment-only.

These tiers describe the model's confidence on the race shape, not the +EV of any wager. Walk-forward backtests on 7,000+ historical races show pari-mutuel takeout (16% on WPS, 19–26% on trifecta) is too steep to consistently clear at this data scale. Use the model as a tipsheet input alongside your own handicapping — not as automated betting advice.

4. Reading the trifecta shape by race conditions

The simulator picks the structure with the highest hit probability per race. Understanding why it picked what it picked is the real value here — the structure tells you the shape of the race, which sharpens your own pick.

Small field (6–9 horses), top 4 cluster tightly

Typical structure: 4-horse box. Hit rate often >30%; the top 4 cover most realistic 1-2-3 outcomes. Chalky shape; payouts will be small.

Confident favorite (P(win) ≥ 35%), 8–12 horse field

Typical structure: 1×3×3 or 1×4×4 key. Concentrates on combos where the favorite wins. Tells you the model is locking on a single horse — useful even if you bet a different structure.

Wide-open race, several positive-Edge longshots

Typical structure: 5- or 6-horse box or 3×5×5 part wheel. Signals the model thinks chaos is plausible and longshots have a real claim.

Big field (14+ horses), Derby-shape

Top combos collectively hold little probability mass; even the densest structure typically hits <5% of the time. The model surfaces wide boxes to communicate that, not to recommend playing them.

A,B,C structure breakdown

Every race exposes A/B/C horse labels in the Trifecta Shape card. The labels are the model's handicapping read of the race:

5. Beyond trifecta

The same data underwrites other wagers. There's no auto-EV simulator for them yet, but the field-grid probabilities are directly usable.

6. The horse page

Click any horse name on a race page. You get three things:

7. Limits and disclaimers