RStudio Sports Analytics Meetup: NFL Big Data Bowl 2022 Winners discuss the Math behind the Path

Transcript#

This transcript was generated automatically and may contain errors.

everybody, it's so nice to see you back at today's RStudio community meetup. If we haven't had a chance to meet yet, and this is your first meetup. I'm Rachel calling in from Boston today. It's so nice to meet you. Feel free to introduce yourselves through the chat window and say hello as well. I love getting to see where people are calling in from all over the world and to also see people sharing helpful resources with each other there in the chat too. I host these meetups every Tuesday at noon Eastern Time, and they are all recorded and shared up to the RStudio YouTube. If you ever want to go back and check out past sessions too.

If this is your first meetup, this is a friendly meetup environment for teams to share different use cases with each other and teach lessons learned. Together, we're all dedicated to making this an inclusive and open environment for everyone, no matter your experience, industry or background.

For a heads up about next week, we will have Julia Silgi and Isabel Zimmerman with us here next Tuesday to talk about MLOps with with Vetiver in Python and R.

Today, I am so excited to have you all here for our sports analytics meetup with the NFL Big Data Bowl 2022 winners, led by Robin, Brendan, Riker and Elijah all here with me today. I've seen a lot on Twitter and LinkedIn about their team and I'm excited for us all to be able to ask them questions here today as well.

Robin, Brendan, Riker and Elijah are all graduate students in statistics at Simon Fraser University in Vancouver, Canada. They each have a passion for sports analytics and enjoy applying their stats knowledge to the game. First off, I want to say congratulations. And thank you so much for being here to share your experience with us today.

crochet distance is a measure of difference between two paths where essentially intuitively you could think of it as one path being a man, the other path being a dog. And the crochet distance is the minimum length of a leash you need to traverse from one end of the paths to the other end.

You can see visually a bit of visual intuition behind this. With the blue lines here, you can see that sort of like steps along each of these paths. And the teal line there is another step, but it's just the step where the length is maximized. And that's sort of the minimum length of a leash that you would need to traverse from both ends.

And with this crochet distance, we also have to take into account a couple of other things. Just in case there are cases where there's less than five yards remaining in either the observed or the optimal path. So if there aren't, so if both paths are more than five yards, then we just simply calculate the crochet distance. And that's sort of our measure of deviation from the optimal path in that moment. But if either of the paths are between one and five yards, then we're going to multiply the crochet distance by five and divide it by the minimum of those two, the minimum length of those two paths to sort of scale up and account for the fact that that path, we aren't really seeing a full five yard movement. So we're trying to kind of counteract that. And if there's less than one yard on either of the paths, then we omit the calculation since we don't really have enough information to see how the punt return was moving relative to the optimal path.