Transparency

How It Works

Box Box uses a machine learning ensemble trained on F1 race data from 2022 to present. Predictions are regenerated after qualifying each Saturday using the actual grid positions.

2026 Season — New Formula

2026 introduces new aerodynamic and power unit regulations — essentially a new formula. The model now trains on 2026 race results as they come in, weighted 5× more heavily than 2025 data. Early-season predictions carry higher uncertainty until the new competitive order becomes clear.

Data Sources

Jolpica / Ergast

Historical

Historical race results, qualifying positions, driver standings, and constructor standings from 2023 onwards. Used for both training and 2026 race result ingestion.

api.jolpi.ca

OpenF1 API

Real-time

Real-time session data for the current season: practice lap times, qualifying results, race weather (temperature, rainfall). Primary source for 2026 data.

openf1.org

All API responses are cached locally. Qualifying data refreshes every 48 hours; race results are cached permanently. Calendar and standings refresh every 6 hours.

Feature Engineering

Each row in the training dataset represents one driver in one race. 9 features are currently computed per driver. Features with no historical baseline fall back to field averages. Championship standings will be added as a feature from Round 4 onwards once enough 2026 data exists.

High

Qualifying Position

Actual grid position from qualifying. If a driver crashed or was excluded, they're assigned last place.

High

Qualifying Gap to Pole %

Driver's best qualifying time as a % behind pole. More meaningful than raw lap time across different circuits.

Medium

Recent Form (5-race avg)

Average finishing position over the last 5 races. Captures current momentum regardless of which team they're on.

Medium

Team Race Pace Rank

Constructor's average finishing position over recent races, exponentially weighted so the latest races count more.

Medium

Practice Pace Delta

Driver's best FP3 (or FP2) lap vs the session fastest, as a percentage. Proxy for race weekend setup and raw pace.

Medium

Circuit Historical Average

Driver's average finishing position at this specific circuit across all training seasons.

Medium

Positions Gained Average

Average positions gained from grid to finish over recent races. Captures race-craft vs pure qualifying pace.

Medium

Championship Standing

Driver's normalised championship position and points from the prior round. Reflects current-season competitive order.

Medium

Career Win & Podium Rate

All-time win rate and podium rate from training data. Helps correctly rank drivers who recently changed teams (e.g., Hamilton to Ferrari).

Low–Med

Constructor Championship Pos.

Team's normalised constructor standing. Independent signal for overall car performance.

Low–Med

Recent Season Avg Position

Driver's average finish in their most recent complete season. Reflects current car, not career average.

Contextual

Race Weather

Is it raining (0/1) and track temperature at race start, from OpenF1. Changes tyre behaviour and overtaking rates significantly.

Low

Team Reliability Score

Constructor's DNF rate this season. Penalises teams with recent mechanical failures.

Low

Driver DNF Rate

Driver's DNF rate over last 10 races. Penalises error-prone or crash-prone drivers.

Low

Overtake Difficulty Index

Circuit-type index: street circuits score high (hard to overtake), high-speed circuits score low.

Model Architecture

XGBoost Win

Classifier

Win probability

Binary classifier (position = 1). Uses scale_pos_weight to handle class imbalance — only 1 in 20 drivers wins per race.

LightGBM Podium

Classifier

Top-3 probability

Gradient boosting with is_unbalance=True. Trained independently from the win model. Output is normalised so all drivers' podium probabilities sum to 3.0.

XGBoost Position

Regressor

Predicted finish

Regression model for full grid ordering. Resolves ties in probability output and determines predicted finishing rank.

Training: TimeSeriesSplit cross-validation (4 folds) ensures the model is never trained on future race data — no leakage. Season weights: 2026 races are weighted 5× more than 2025, which is weighted 2× more than 2024. Trained on ~1,800 driver-race rows across 2022–2026. Models are retrained as 2026 results accumulate.

Accuracy

Random baseline

Picking any driver from a 20-car grid at random wins 5% of the time.

Pole position baseline

Historically, the pole-sitter converts to a race win about 30% of the time.

~30%

CV winner accuracy

Measured via TimeSeriesSplit on 56 historical races. The model correctly picks the race winner in roughly 1 in 2 races — ~10× better than random.

48.2%

CV podium accuracy

Average overlap between predicted top-3 and actual top-3. Measured across the same 56 races.

64.3%

CV figures are from cross-validation on 2023–2025 data. Live 2026 accuracy is tracked on the home page as each race result comes in.

Known Limitations

Limited training data

The model trains on 2022–2026 race results (~1,800 driver-race rows across 4+ seasons). This is a small dataset for ML — confidence intervals are wide, and the model is most reliable when qualifying position already tells a clear story.

2026 is an entirely new formula

New aerodynamic and power unit regulations mean 2026 cars behave differently from anything in the 2023–2025 training data. The model now incorporates 2026 results as they arrive (weighted 5× more than 2025), but the first few races carry high uncertainty until the new competitive order stabilises.

New driver and team combinations

Hamilton moved to Ferrari, Antonelli replaced him at Mercedes, Cadillac is brand new. The model relies on career stats and constructor standings when team-specific history is thin — some drivers are inherently more uncertain than others.

Mechanical failures and incidents are unpredictable

The model cannot predict Q1 crashes, rear-axle failures, or race-day retirements. Verstappen starting P20 in Australia after a Q1 crash is a perfect example — no historical feature could foresee that outcome.

No tyre strategy or race-day tactics

Compound choice at the start, undercut windows, safety car timing, and pit strategy calls are major race outcome drivers that aren't available pre-race and aren't modelled.

Qualifying fallback for missing data

If qualifying data isn't available from the API yet (pre-qualifying weekend), the model uses a driver's historical average grid position. This is less accurate than actual qualifying results and can significantly skew predictions.