Going deep: models for continuous-time within-play valuation of game outcomes in American football with tracking data

football
player-tracking data
Big Data Bowl
expected points
win probability
continuous-time expected value
deep learning
conditional density estimation
An introduction to continuous-time, within-play expected value in football demonstrated with ball-carrier models
Authors
Affiliations

Ronald Yurko

Department of Statistics & Data Science, Carnegie Mellon University

Francesca Matano

Department of Statistics & Data Science, Carnegie Mellon University

Lee F. Richardson

Department of Statistics & Data Science, Carnegie Mellon University

Nicholas Granered

Statistics, University of Pittsburgh

Taylor Pospisil

Department of Statistics & Data Science, Carnegie Mellon University

Konstantinos Pelechrinis

School of Computing and Information, University of Pittsburgh

Samuel L. Ventura

Department of Statistics & Data Science, Carnegie Mellon University

Published

January 13, 2020

JQAS arxiv

@article{yurko2020going,
  title={Going deep: models for continuous-time within-play valuation of game outcomes in American football with tracking data},
  author={Yurko, Ronald and Matano, Francesca and Richardson, Lee F and Granered, Nicholas and Pospisil, Taylor and Pelechrinis, Konstantinos and Ventura, Samuel L},
  journal={Journal of Quantitative Analysis in Sports},
  volume={16},
  number={2},
  pages={163--182},
  year={2020},
  publisher={De Gruyter}
}

Abstract

Continuous-time assessments of game outcomes in sports have become increasingly common in the last decade. In American football, only discrete-time estimates of play value were possible, since the most advanced public football datasets were recorded at the play-by-play level. While measures such as expected points and win probability are useful for evaluating football plays and game situations, there has been no research into how these values change throughout the course of a play. In this work, we make two main contributions: First, we introduce a general framework for continuous-time within-play valuation in the National Football League using player-tracking data. Our modular framework incorporates several modular sub-models, to easily incorporate recent work involving player tracking data in football. Second, we use a long short-term memory recurrent neural network to construct a ball-carrier model to estimate how many yards the ball-carrier is expected to gain from their current position, conditional on the locations and trajectories of the ball-carrier, their teammates and opponents. Additionally, we demonstrate an extension with conditional density estimation so that the expectation of any measure of play value can be calculated in continuous-time, which was never before possible at such a granular level.