Price Path Information and Sports Match Prediction in Polymarket

A Polymarket Football Case Study

Advanced Research Project University Elite Scholars 2026

Research Team

Junze Yuan

Supervisors: Robert Zhang, Chanjuan Pan

Duration 6 weeks

Department VC Investment Research Team

Program Track Camellia Elite Scholars Program University Track

Research Scope Sports Matches

Market Data Aug 2025 – Mar 2026 · 309 matches

Abstract

This case study asks whether first-half price path information in Polymarket football markets improves probabilistic forecasts of match outcomes beyond the pre-match price P₀. A central secondary question is whether path-based features add value beyond simply observing the halftime level P_HT as a summary of first-half updating. Path features do improve on P₀ on all five cross-validation folds, but P_HT alone achieves the lowest mean CV log loss (0.534) of any specification tested—beating the best explicit path model (P₀ plus aggregate first-half change, CV log loss 0.540) by 0.006 on average. In other words, where the price ends up at halftime is more informative than how it got there. Interval-based path families do not outperform this benchmark on average. Results are predictive (not causal); the sample is modest (n = 927 conditions, 309 events) and fold-level gains are heterogeneous (gain range roughly 0.02–0.08 log loss units across folds).

Keywords

Prediction Markets Market-Implied Probabilities Price Dynamics Information Aggregation Probabilistic Forecasting Sports Prediction Markets Polymarket

Project Snapshot

This project studies whether first-half price movements in Polymarket football markets contain useful information for predicting final match outcomes beyond the pre-match market price.

Using four price snapshots — kickoff, 15 minutes, 30 minutes, and halftime — the study compares a pre-match baseline model against several first-half information models, including aggregate price change, interval-level price paths, volatility-style absolute changes, and a halftime-price benchmark.

927

Conditions

309

Football
Events

Price
Snapshots

5-Fold

Grouped
CV

Primary
Metrics

Does the route to halftime matter, or is the halftime market price enough?

The study asks whether the sequence of first-half market updates adds predictive value beyond the pre-match price P₀. A secondary question compares explicit path-based features with the halftime price level P_HT, which may already summarize the market's accumulated first-half information.

Research Design

The dataset comes from Polymarket football markets. Each row represents one contract-side condition nested under a match event. Because multiple conditions from the same match are dependent, the evaluation groups all conditions from the same event into the same fold to avoid leakage.

P₀

Kickoff

P₁₅

15′

15 min

P₃₀

30′

30 min

P_HT

Halftime

Outcome

Data and Features

Sample structure and feature-family definitions

927 conditions across 309 events; development sample 720 rows (train/validation 510/210); held-out test set 207 rows; mean positive outcome rate ~33% (three-way contract structure); kickoff date range 2025-08-15 to 2026-03-22. Four price snapshots per condition: P₀, P₁₅, P₃₀, P_HT.

Feature families

A — Baseline: P₀ only.
B — Halftime benchmark: P_HT only (no path information).
C — Aggregate path: P₀ + total signed first-half change.
F — Interval signed path: P₀ + three sub-interval signed changes (0–15, 15–30, 30–HT).
G — Interval absolute path: P₀ + absolute interval changes as volatility proxies.

Model and Evaluation

Logistic regression with grouped chronological cross-validation

Estimation

Logistic regression (sklearn, max_iter = 5 000); no standardisation of inputs

Evaluation protocol

Grouped chronological 5-fold CV: entire events ordered by kickoff and assigned to folds; train on past blocks, score later matches
Primary metrics: mean and standard deviation of log loss and Brier score across folds; count of folds beating baseline A
Final model refit on all development data (n = 720) and evaluated once on held-out test (n = 207)
AUC on the test set reported as a secondary discrimination measure

Main Finding

In this sample, where the price ends up at halftime is more informative than how it got there.

Model Performance Comparison — Mean CV Log Loss

Best performer (P_HT only)

Other models

The main result is that first-half information consistently improves prediction over the pre-match baseline, but the halftime price level alone is the strongest parsimonious signal. The P_HT-only benchmark achieves a mean CV log loss of 0.534, beating the best explicit path model (P₀ + aggregate change, 0.540) by 0.006 on average. Models that decompose the first half into finer intervals do not recover this gap.

Key Findings

First-half prices contain signal

All first-half information models beat the P₀-only baseline across all five cross-validation folds, showing that in-match market updates contain useful predictive information.

Halftime price is a compact summary

The P_HT-only benchmark achieves the lowest mean cross-validated log loss (0.534), suggesting that the halftime price acts as a compact summary of first-half information.

More path detail does not necessarily help

Decomposing the first half into interval-level price changes does not outperform the simpler aggregate change model. Absolute interval changes, used as volatility proxies, add little over the baseline.

Calibration improves mainly in the mid-range

Reliability analysis shows that P_HT and P₀ + aggregate change improve mid-range calibration relative to P₀, while tail probabilities remain sparse and less stable.

Discussion & Conclusion

Interpretation

The results suggest that Polymarket prices behave like probability-like signals that update as match information arrives—goals, momentum shifts, expected-goals narratives, red cards, and tactical adjustments all tend to move prices toward contracts that better reflect the evolving match state.

A useful economic intuition is that by halftime, market participants have often already incorporated most of the first-half news into the level P_HT. Explicit path-based features may therefore repeat information that is already absorbed into that endpoint: under simple linear models with four intra-half snapshots, recording where the price lands at halftime can act as a parsimonious summary of first-half updating.

For parsimonious forecasting under the current data and model class, storing P_HT may be sufficient relative to engineering finer-grained path coordinates. More complex path modeling may become valuable once richer microstructure data are available (higher-frequency quotes, order-flow, liquidity, or trading volumes).

These conclusions describe predictive associations under the stated evaluation protocol; they are not causal claims about which features “matter” in an economic sense—only about what forecasts best under this sample and model family.

Limitations

The findings should be interpreted conservatively. The sample covers 309 football events (927 contract-side conditions); performance gains are heterogeneous across grouped chronological folds (mean CV log-loss improvements versus P₀ vary roughly 0.02–0.08 units across folds).

The extract does not include liquidity, trading volume, or order-flow data, so the study cannot distinguish a large price move on thin volume from the same move supported by deeper market activity—both appear as identical price updates in the feature set.

The evaluation uses logistic regression on raw bounded features; richer nonlinear or tree-based specifications were out of scope and might interact differently with path coordinates. The held-out test split can disagree slightly with cross-validated rankings for nearby models (for example, reported test log losses for some path specifications versus the halftime benchmark), underscoring single-split uncertainty.

Contracts follow a three-way structure with a positive-outcome rate near one-third; results may not transfer mechanically to other sports, leagues, or binary-only markets.

Future Extensions

Higher-frequency snapshots (for example one-minute marks or order-book midquotes) could test whether finer path detail recovers incremental signal once the endpoint alone stops being a sufficient statistic.

Liquidity- and volume-weighted price changes would treat a 10-cent move on thin depth differently from the same move with sustained participation—directly addressing the largest blind spot in the current feature set.

Nonlinear models (gradient boosting, shallow neural nets) could capture interactions between interval moves and baseline levels that logistic regression cannot express cleanly.

Larger multi-season and multi-league panels would stabilise fold-level estimates and support cross-sport generalisation—for instance whether a halftime-level-dominates-path pattern holds in basketball, tennis, or other timed formats.

Finally, combining path features with non-market signals (expected-goals models, team-level priors, injury news) would clarify whether market path information remains incremental once conditioning on observable fundamentals outside the order book.

Research Outputs

Full research report — Junze Yuan (PDF)

The GitHub code repository is shared on a case-by-case basis with collaborators who need access.

Back to Publications