Going deep: models for continuous-time within-play valuation of game outcomes in American football with tracking data

NFL

player tracking data

Big Data Bowl

expected points

win probability

continuous-time expected value

deep learning

conditional density estimation

An introduction to continuous-time, within-play expected value in football demonstrated with ball-carrier models

Authors

Affiliations

Ronald Yurko

Department of Statistics & Data Science, Carnegie Mellon University

Francesca Matano

Department of Statistics & Data Science, Carnegie Mellon University

Lee F. Richardson

Department of Statistics & Data Science, Carnegie Mellon University

Nicholas Granered

Statistics, University of Pittsburgh

Taylor Pospisil

Department of Statistics & Data Science, Carnegie Mellon University

Konstantinos Pelechrinis

School of Computing and Information, University of Pittsburgh

Samuel L. Ventura

Department of Statistics & Data Science, Carnegie Mellon University

Published

January 13, 2020

@article{yurko2020going,
  title={Going deep: models for continuous-time within-play valuation of game outcomes in American football with tracking data},
  author={Yurko, Ronald and Matano, Francesca and Richardson, Lee F and Granered, Nicholas and Pospisil, Taylor and Pelechrinis, Konstantinos and Ventura, Samuel L},
  journal={Journal of Quantitative Analysis in Sports},
  volume={16},
  number={2},
  pages={163--182},
  year={2020},
  publisher={De Gruyter}
}

Abstract

Continuous-time assessments of game outcomes in sports have become increasingly common in the last decade. In American football, only discrete-time estimates of play value were possible, since the most advanced public football datasets were recorded at the play-by-play level. While measures such as expected points and win probability are useful for evaluating football plays and game situations, there has been no research into how these values change throughout the course of a play. In this work, we make two main contributions: First, we introduce a general framework for continuous-time within-play valuation in the National Football League using player-tracking data. Our modular framework incorporates several modular sub-models, to easily incorporate recent work involving player tracking data in football. Second, we use a long short-term memory recurrent neural network to construct a ball-carrier model to estimate how many yards the ball-carrier is expected to gain from their current position, conditional on the locations and trajectories of the ball-carrier, their teammates and opponents. Additionally, we demonstrate an extension with conditional density estimation so that the expectation of any measure of play value can be calculated in continuous-time, which was never before possible at such a granular level.