Price Path Information and Sports Match Prediction in Polymarket
A Polymarket Football Case Study
Research Team
Abstract
Keywords
Project Snapshot
This project studies whether first-half price movements in Polymarket football markets contain useful information for predicting final match outcomes beyond the pre-match market price.
Using four price snapshots — kickoff, 15 minutes, 30 minutes, and halftime — the study compares a pre-match baseline model against several first-half information models, including aggregate price change, interval-level price paths, volatility-style absolute changes, and a halftime-price benchmark.
Events
Snapshots
CV
Metrics
Does the route to halftime matter, or is the halftime market price enough?
The study asks whether the sequence of first-half market updates adds predictive value beyond the pre-match price P0. A secondary question compares explicit path-based features with the halftime price level PHT, which may already summarize the market's accumulated first-half information.
Research Design
The dataset comes from Polymarket football markets. Each row represents one contract-side condition nested under a match event. Because multiple conditions from the same match are dependent, the evaluation groups all conditions from the same event into the same fold to avoid leakage.
Data and Features
927 conditions across 309 events; development sample 720 rows (train/validation 510/210); held-out test set 207 rows; mean positive outcome rate ~33% (three-way contract structure); kickoff date range 2025-08-15 to 2026-03-22. Four price snapshots per condition: P0, P15, P30, PHT.
- A — Baseline: P0 only.
- B — Halftime benchmark: PHT only (no path information).
- C — Aggregate path: P0 + total signed first-half change.
- F — Interval signed path: P0 + three sub-interval signed changes (0–15, 15–30, 30–HT).
- G — Interval absolute path: P0 + absolute interval changes as volatility proxies.
Model and Evaluation
- Logistic regression (sklearn, max_iter = 5 000); no standardisation of inputs
- Grouped chronological 5-fold CV: entire events ordered by kickoff and assigned to folds; train on past blocks, score later matches
- Primary metrics: mean and standard deviation of log loss and Brier score across folds; count of folds beating baseline A
- Final model refit on all development data (n = 720) and evaluated once on held-out test (n = 207)
- AUC on the test set reported as a secondary discrimination measure
Main Finding
In this sample, where the price ends up at halftime is more informative than how it got there.
Model Performance Comparison — Mean CV Log Loss
The main result is that first-half information consistently improves prediction over the pre-match baseline, but the halftime price level alone is the strongest parsimonious signal. The PHT-only benchmark achieves a mean CV log loss of 0.534, beating the best explicit path model (P0 + aggregate change, 0.540) by 0.006 on average. Models that decompose the first half into finer intervals do not recover this gap.
Key Findings
Discussion & Conclusion
Interpretation
The results suggest that Polymarket prices behave like probability-like signals that update as match information arrives—goals, momentum shifts, expected-goals narratives, red cards, and tactical adjustments all tend to move prices toward contracts that better reflect the evolving match state.
A useful economic intuition is that by halftime, market participants have often already incorporated most of the first-half news into the level PHT. Explicit path-based features may therefore repeat information that is already absorbed into that endpoint: under simple linear models with four intra-half snapshots, recording where the price lands at halftime can act as a parsimonious summary of first-half updating.
For parsimonious forecasting under the current data and model class, storing PHT may be sufficient relative to engineering finer-grained path coordinates. More complex path modeling may become valuable once richer microstructure data are available (higher-frequency quotes, order-flow, liquidity, or trading volumes).
These conclusions describe predictive associations under the stated evaluation protocol; they are not causal claims about which features “matter” in an economic sense—only about what forecasts best under this sample and model family.
Limitations
The findings should be interpreted conservatively. The sample covers 309 football events (927 contract-side conditions); performance gains are heterogeneous across grouped chronological folds (mean CV log-loss improvements versus P0 vary roughly 0.02–0.08 units across folds).
The extract does not include liquidity, trading volume, or order-flow data, so the study cannot distinguish a large price move on thin volume from the same move supported by deeper market activity—both appear as identical price updates in the feature set.
The evaluation uses logistic regression on raw bounded features; richer nonlinear or tree-based specifications were out of scope and might interact differently with path coordinates. The held-out test split can disagree slightly with cross-validated rankings for nearby models (for example, reported test log losses for some path specifications versus the halftime benchmark), underscoring single-split uncertainty.
Contracts follow a three-way structure with a positive-outcome rate near one-third; results may not transfer mechanically to other sports, leagues, or binary-only markets.
Future Extensions
Higher-frequency snapshots (for example one-minute marks or order-book midquotes) could test whether finer path detail recovers incremental signal once the endpoint alone stops being a sufficient statistic.
Liquidity- and volume-weighted price changes would treat a 10-cent move on thin depth differently from the same move with sustained participation—directly addressing the largest blind spot in the current feature set.
Nonlinear models (gradient boosting, shallow neural nets) could capture interactions between interval moves and baseline levels that logistic regression cannot express cleanly.
Larger multi-season and multi-league panels would stabilise fold-level estimates and support cross-sport generalisation—for instance whether a halftime-level-dominates-path pattern holds in basketball, tennis, or other timed formats.
Finally, combining path features with non-market signals (expected-goals models, team-level priors, injury news) would clarify whether market path information remains incremental once conditioning on observable fundamentals outside the order book.
Research Outputs
The GitHub code repository is shared on a case-by-case basis with collaborators who need access.
