Using Vision Systems,
Generative Models and
Reinforcement Learning for
Sports Analytics
Mehrsan Javan
Chief Technology Officer
Sportlogiq
• Data Generation: Cost efficient,
scalable, and robust data acquisition
from videos using fully automated/AI-
assisted systems
• Insights Generation: Descriptive and
predictive analytics and relevant insights
for users at scale for performance
evaluation, scouting, media content
creation, prediction, etc.
Sport Analytics Is About Insights Extracted from Data
2
© 2024 Sportlogiq
Analytics,
Metrics, Insights
Game Models
Player Tracking
Activity Detection
• Top tier pro leagues (NHL, NFL, EPL)
• Accurate raw datapoints from different sources
• Advanced analytical skills and deep sports knowledge, inhouse data science teams
• 2nd tier pro & draft eligible leagues
• Partial and almost accurate raw datapoints
• Domain expertise with limited analytical knowledge
• 3rd tier leagues and youth games
• Partial, incomplete and inaccurate raw data
• Limited domain expertise and almost no analytical knowledge
Sport Analytics in Different Leagues
3
© 2024 Sportlogiq
• Analytics foundations are based on
uniform and accurate input data
• Catered for scouts, players, coaches,
GMs, and media
• Player/team metrics and physical
data
• Player archetypes, search and
recommendations, scouting tools
• Expected value models and
reinforcement learning tools for
strategy evaluation
Sample NHL Player Card in Sportlogiq app
Analytics in Professional Leagues
4
© 2024 Sportlogiq
• Data incompleteness — regardless of how strong our computer visions systems are, we
are limited to what the camera sees
• Any useful analytical insight requires reliable and uniform input data
• Pro level analytics models don’t work out of the box with youth data
• Insights relevance — what is relevant in the pro leagues may not be relevant for the
youth sports users
• Users neither have the sports knowledge nor analytics backgrounds
• Educating users takes a long time, in the meantime they need a valuable product
Extending Analytics from Pro Leagues to Youth Sports Is
Not Straightforward
5
© 2024 Sportlogiq
• Partial player observations
• Player location data exists for a
subset of players that are visible in
the video
• Not necessarily all players are
observed and tracked for the whole
games
Distribution of the fraction of the ice area
covered in the camera field of view
Data Incompleteness and Inaccuracy — Partial
Observations
6
© 2024 Sportlogiq
Fraction of the ice surface area visible in each frame
Normalized
distribution
density
• Partial game events and play sequence
data
• Players actions and game events have
a higher false detection rates
compared to pro leagues
• Some game events are not collected
in youth games
Data Incompleteness and Inaccuracy — Partial
Observations
7
© 2024 Sportlogiq
Data Incompleteness and Inaccuracy — Partial
Observations
8
© 2024 Sportlogiq
• Use of Generative AI for data creation
• Complete player locations for
physical data and metrics generation
• Plausible sequences of on puck/ball
actions by filling the missing actions
and correcting mistakes
Generating Complete Data from Partial Observations
9
© 2024 Sportlogiq
• Input tracking data is a partial set of
observed location data extracted using
computer vision tracking techniques
• Generated output is a complete set of
player location data
• Auxiliary outputs for instantaneous
velocity and acceleration
• Player physical metrics
• Group behavior modeling to capture
team tactics remains a challenge
Tracking Data Generation
Generative AI for Data Creation — Trajectory Data
10
© 2024 Sportlogiq
Partially
Observed
Location (x, y)
Team Roster
and Match
Metadata
Camera
Parameters
Spatial
Attention
Networks
Complete
Location Data
(x,y) and Their
Derivatives
• Problem Statement: Given a set of partially accurate and incomplete sequences of on
puck/ball actions, generate a plausible play sequence for the whole game
• A play sequence is a complete set of events, with the event type, other attributes,
outcome, time, location on the ice, and global game state
• An abstract example of a play sequence without attributes: {Faceoff, Loose Puck
Recover, Pass, Reception, Shot, Blocked Shot, Loose Puck Recovery, Shot, Goal,
Whistle}
Generative AI for Data Creation — Game Events
11
© 2024 Sportlogiq
• Modelling approach is inspired by
language models
• Each event can be equivalent to a
sentence in a document
• A document is an episode of play in
hockey game starting with a faceoff
and ending with a whistle
Example Event Token
Generative AI for Data Creation — Game Events
12
© 2024 Sportlogiq
{"tag": "playerEvent", "period": 2,
"periodTime": 721,
"teamId": 15, "playerId": 1330,
"xAdjCoord": 37.92, "yAdjCoord": 39.48,
"playersOnIce":
[384,380,1321,1330,1753,11581,627,305,1348,1964
,1968,1977],
"teamInPossession": 15, "scoreDifferential": -
1,
"playZone": "oz", "playSection": "eastPoint",
"eventName": "pass", "outcome": "successful"}
• Base model: Generative Pretrained
Transformer (GPT) which captures the
causality in the play sequence data
• Over 10 seasons of data from 5
professional hockey leagues were
used for training the base model
• The model was then tuned with limited
number of youth games (a few hundred)
• GPT model passed both automated and
the human QA systems
Game Event Sequence Creation
Generative AI for Data Creation — Game Events
13
© 2024 Sportlogiq
14
© 2024 Sportlogiq
We got the data, but what is the end product?
• Userbase Segmentation
• 10% team and personal coaches
• 57% family members who are not
parents and are over 56 or mention
their grandchild
• 33% parents and players
• Users are not analytically oriented
• 27% have no understanding of stats
• 32% understand basic stats
• 48% not interested in learning analytics
Relevant Insights for the Youth Markets
15
© 2024 Sportlogiq
27%
32%
24%
14%
3%
Users' Sport Analytics Knowledge
I don’t understand analytics at all
I can follow basic stats and analytics discussions
I can understand visualizations and chart analytics
I can easily analyze and explain chart analytics and data visualizations
I examine raw data to create detailed analyses and strategic insights
• The most powerful and sophisticated
tool in sport analytics is Inverse Multi-
Agent Reinforcement Learning (MARL)
• A game is a sequence of <state, action>
with a reward signal, each game state
has a quantitative value
• All advanced player metrics and
profiling are based on the MARL
systems
• Inverse MARL can tell us what a
team/player is trying to optimize
What Is Behind Pro Leagues Analytics?
16
© 2024 Sportlogiq
• A tool already exists that can assign
quantitative values to each game state
• Reward signal can get modified to
measure changes in game states for
each team based on offense/defense
objectives
• Significant changes in the game state
values are important moments in the
game – highlight-worthy segments
• Game/player highlights can get
generated automatically from videos
Example: value of shots at different
locations on ice given a specific game
context
The Use of Pro Leagues MARL for Youth Games
17
© 2024 Sportlogiq
• Instead of a generic game highlight, a
user specific highlight can get created
for each player
• Using a variety of reward signals, based
on a player position (offense, defense) a
highlight reel of offensive or defensive
plays can get created
• Highlights are accompanied with light
stats
• Highlighted segments are marked based
on how interesting they are
Highlight Example
User Specific Highlights
18
© 2024 Sportlogiq
• End users in youth sports want simple yet valuable insights
• Pro level insights won’t be useful in the youth sports in their current shape and form
• Producing reliable sport analytics content requires vast amount of uniform and accurate
data
• Generative AI can fill the gaps in the input data
• Complete player location data can get generated from partial observations
• A plausible play sequence for on puck action can get generated from inaccurate and partial
play sequence data
• MARL can be used to generate player specific highlights
• Sport data is multimodal and requires multi-modal generative AI tools
Youth Sports Product Development — Conclusions
19
© 2024 Sportlogiq
For More Information
M. Horton, Learning feature representation
from football tracking, Sloan sport analytics
2021
US Patents and pending applications
11,130,040; 18/529,204; 17/445,354;
17/817,454
20
© 2024 Sportlogiq
Videos
All videos and demos are available at
https://sportlogiq.com
References
Liu et al., Learning agent representation
for ice hockey, NeurIPS2020

More Related Content

PDF
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
PPTX
The Digital Revolution of Sports Data
PDF
Sports Analytics in the Era of Big Data and Data Science
PPTX
Impact of data science on sports arena.pptx
PPTX
Big Data BizViz Sports Analytics
PDF
Sports and Big data
PDF
What is the Role of AI in the Sports Industry_.pdf
PPTX
AI in Sports gen ai uses in sports I willl be use for the sport persons
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
The Digital Revolution of Sports Data
Sports Analytics in the Era of Big Data and Data Science
Impact of data science on sports arena.pptx
Big Data BizViz Sports Analytics
Sports and Big data
What is the Role of AI in the Sports Industry_.pdf
AI in Sports gen ai uses in sports I willl be use for the sport persons

Similar to “Using Vision Systems, Generative Models and Reinforcement Learning for Sports Analytics,” a Presentation from Sportlogiq (20)

PPTX
Sports Biz Tech .ppt
PDF
Big Data for Big Sports
PDF
Super Bowl 2025 The Impact of Real-Time Data on the Ultimate NFL Showdown
PPTX
[DSC DACH 23] From Data to Performance: How Deep Learning is Revolutionizing ...
PDF
Football, Data and Crushing Competition
PPT
Sportz Interactive Product Portfolio
PPTX
Week 1: Introduction to Sports analytics
PDF
Sport Analytics Innovation Summit
PPTX
Data science in baseball
PDF
Sports Analytics Innovation Summit - Data Powered Storytelling
PPTX
Disrupting Sports Tech
PDF
Digital Transformation in Sports & Entertainment Industry Case Study (1).pdf
PPTX
Football Result Prediction using Dixon Coles Algorithm
PDF
The Evolution and Power of Football Data Feeds.pdf
PPTX
Revolutionizing Football Fan Experiences
PDF
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
PDF
The Top 10 Cutting-Edge Sports Technology Solution Providers, 2019
PDF
Betegy B2B solutions
PDF
The Future of Coaching in Sport with AI/ML
PDF
Calling the shots with Machine Learning in Sports
Sports Biz Tech .ppt
Big Data for Big Sports
Super Bowl 2025 The Impact of Real-Time Data on the Ultimate NFL Showdown
[DSC DACH 23] From Data to Performance: How Deep Learning is Revolutionizing ...
Football, Data and Crushing Competition
Sportz Interactive Product Portfolio
Week 1: Introduction to Sports analytics
Sport Analytics Innovation Summit
Data science in baseball
Sports Analytics Innovation Summit - Data Powered Storytelling
Disrupting Sports Tech
Digital Transformation in Sports & Entertainment Industry Case Study (1).pdf
Football Result Prediction using Dixon Coles Algorithm
The Evolution and Power of Football Data Feeds.pdf
Revolutionizing Football Fan Experiences
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
The Top 10 Cutting-Edge Sports Technology Solution Providers, 2019
Betegy B2B solutions
The Future of Coaching in Sport with AI/ML
Calling the shots with Machine Learning in Sports
Ad

More from Edge AI and Vision Alliance (20)

PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
“The New OpenCV 5.0: Added Features, Performance Improvements and Future Dire...
PDF
“Introduction to Shrinking Models with Quantization-aware Training and Post-t...
PDF
“Customizing Vision-language Models for Real-world Applications,” a Presentat...
PDF
“An Introduction to the MIPI CSI-2 Image Sensor Standard and Its Latest Advan...
PDF
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
PDF
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
PDF
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
PDF
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
PDF
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
PDF
“Introduction to Data Types for AI: Trade-offs and Trends,” a Presentation fr...
PDF
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
PDF
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
PDF
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
“The New OpenCV 5.0: Added Features, Performance Improvements and Future Dire...
“Introduction to Shrinking Models with Quantization-aware Training and Post-t...
“Customizing Vision-language Models for Real-world Applications,” a Presentat...
“An Introduction to the MIPI CSI-2 Image Sensor Standard and Its Latest Advan...
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
“Introduction to Data Types for AI: Trade-offs and Trends,” a Presentation fr...
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Ad

Recently uploaded (20)

PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Five Habits of High-Impact Board Members
PPT
Geologic Time for studying geology for geologist
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
CloudStack 4.21: First Look Webinar slides
PPTX
Modernising the Digital Integration Hub
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
Chapter 5: Probability Theory and Statistics
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
STKI Israel Market Study 2025 version august
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
August Patch Tuesday
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Getting Started with Data Integration: FME Form 101
Developing a website for English-speaking practice to English as a foreign la...
Five Habits of High-Impact Board Members
Geologic Time for studying geology for geologist
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Group 1 Presentation -Planning and Decision Making .pptx
CloudStack 4.21: First Look Webinar slides
Modernising the Digital Integration Hub
A contest of sentiment analysis: k-nearest neighbor versus neural network
Chapter 5: Probability Theory and Statistics
DP Operators-handbook-extract for the Mautical Institute
NewMind AI Weekly Chronicles – August ’25 Week III
Zenith AI: Advanced Artificial Intelligence
STKI Israel Market Study 2025 version august
1 - Historical Antecedents, Social Consideration.pdf
August Patch Tuesday
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Final SEM Unit 1 for mit wpu at pune .pptx
Module 1.ppt Iot fundamentals and Architecture
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Getting Started with Data Integration: FME Form 101

“Using Vision Systems, Generative Models and Reinforcement Learning for Sports Analytics,” a Presentation from Sportlogiq

  • 1. Using Vision Systems, Generative Models and Reinforcement Learning for Sports Analytics Mehrsan Javan Chief Technology Officer Sportlogiq
  • 2. • Data Generation: Cost efficient, scalable, and robust data acquisition from videos using fully automated/AI- assisted systems • Insights Generation: Descriptive and predictive analytics and relevant insights for users at scale for performance evaluation, scouting, media content creation, prediction, etc. Sport Analytics Is About Insights Extracted from Data 2 © 2024 Sportlogiq Analytics, Metrics, Insights Game Models Player Tracking Activity Detection
  • 3. • Top tier pro leagues (NHL, NFL, EPL) • Accurate raw datapoints from different sources • Advanced analytical skills and deep sports knowledge, inhouse data science teams • 2nd tier pro & draft eligible leagues • Partial and almost accurate raw datapoints • Domain expertise with limited analytical knowledge • 3rd tier leagues and youth games • Partial, incomplete and inaccurate raw data • Limited domain expertise and almost no analytical knowledge Sport Analytics in Different Leagues 3 © 2024 Sportlogiq
  • 4. • Analytics foundations are based on uniform and accurate input data • Catered for scouts, players, coaches, GMs, and media • Player/team metrics and physical data • Player archetypes, search and recommendations, scouting tools • Expected value models and reinforcement learning tools for strategy evaluation Sample NHL Player Card in Sportlogiq app Analytics in Professional Leagues 4 © 2024 Sportlogiq
  • 5. • Data incompleteness — regardless of how strong our computer visions systems are, we are limited to what the camera sees • Any useful analytical insight requires reliable and uniform input data • Pro level analytics models don’t work out of the box with youth data • Insights relevance — what is relevant in the pro leagues may not be relevant for the youth sports users • Users neither have the sports knowledge nor analytics backgrounds • Educating users takes a long time, in the meantime they need a valuable product Extending Analytics from Pro Leagues to Youth Sports Is Not Straightforward 5 © 2024 Sportlogiq
  • 6. • Partial player observations • Player location data exists for a subset of players that are visible in the video • Not necessarily all players are observed and tracked for the whole games Distribution of the fraction of the ice area covered in the camera field of view Data Incompleteness and Inaccuracy — Partial Observations 6 © 2024 Sportlogiq Fraction of the ice surface area visible in each frame Normalized distribution density
  • 7. • Partial game events and play sequence data • Players actions and game events have a higher false detection rates compared to pro leagues • Some game events are not collected in youth games Data Incompleteness and Inaccuracy — Partial Observations 7 © 2024 Sportlogiq
  • 8. Data Incompleteness and Inaccuracy — Partial Observations 8 © 2024 Sportlogiq
  • 9. • Use of Generative AI for data creation • Complete player locations for physical data and metrics generation • Plausible sequences of on puck/ball actions by filling the missing actions and correcting mistakes Generating Complete Data from Partial Observations 9 © 2024 Sportlogiq
  • 10. • Input tracking data is a partial set of observed location data extracted using computer vision tracking techniques • Generated output is a complete set of player location data • Auxiliary outputs for instantaneous velocity and acceleration • Player physical metrics • Group behavior modeling to capture team tactics remains a challenge Tracking Data Generation Generative AI for Data Creation — Trajectory Data 10 © 2024 Sportlogiq Partially Observed Location (x, y) Team Roster and Match Metadata Camera Parameters Spatial Attention Networks Complete Location Data (x,y) and Their Derivatives
  • 11. • Problem Statement: Given a set of partially accurate and incomplete sequences of on puck/ball actions, generate a plausible play sequence for the whole game • A play sequence is a complete set of events, with the event type, other attributes, outcome, time, location on the ice, and global game state • An abstract example of a play sequence without attributes: {Faceoff, Loose Puck Recover, Pass, Reception, Shot, Blocked Shot, Loose Puck Recovery, Shot, Goal, Whistle} Generative AI for Data Creation — Game Events 11 © 2024 Sportlogiq
  • 12. • Modelling approach is inspired by language models • Each event can be equivalent to a sentence in a document • A document is an episode of play in hockey game starting with a faceoff and ending with a whistle Example Event Token Generative AI for Data Creation — Game Events 12 © 2024 Sportlogiq {"tag": "playerEvent", "period": 2, "periodTime": 721, "teamId": 15, "playerId": 1330, "xAdjCoord": 37.92, "yAdjCoord": 39.48, "playersOnIce": [384,380,1321,1330,1753,11581,627,305,1348,1964 ,1968,1977], "teamInPossession": 15, "scoreDifferential": - 1, "playZone": "oz", "playSection": "eastPoint", "eventName": "pass", "outcome": "successful"}
  • 13. • Base model: Generative Pretrained Transformer (GPT) which captures the causality in the play sequence data • Over 10 seasons of data from 5 professional hockey leagues were used for training the base model • The model was then tuned with limited number of youth games (a few hundred) • GPT model passed both automated and the human QA systems Game Event Sequence Creation Generative AI for Data Creation — Game Events 13 © 2024 Sportlogiq
  • 14. 14 © 2024 Sportlogiq We got the data, but what is the end product?
  • 15. • Userbase Segmentation • 10% team and personal coaches • 57% family members who are not parents and are over 56 or mention their grandchild • 33% parents and players • Users are not analytically oriented • 27% have no understanding of stats • 32% understand basic stats • 48% not interested in learning analytics Relevant Insights for the Youth Markets 15 © 2024 Sportlogiq 27% 32% 24% 14% 3% Users' Sport Analytics Knowledge I don’t understand analytics at all I can follow basic stats and analytics discussions I can understand visualizations and chart analytics I can easily analyze and explain chart analytics and data visualizations I examine raw data to create detailed analyses and strategic insights
  • 16. • The most powerful and sophisticated tool in sport analytics is Inverse Multi- Agent Reinforcement Learning (MARL) • A game is a sequence of <state, action> with a reward signal, each game state has a quantitative value • All advanced player metrics and profiling are based on the MARL systems • Inverse MARL can tell us what a team/player is trying to optimize What Is Behind Pro Leagues Analytics? 16 © 2024 Sportlogiq
  • 17. • A tool already exists that can assign quantitative values to each game state • Reward signal can get modified to measure changes in game states for each team based on offense/defense objectives • Significant changes in the game state values are important moments in the game – highlight-worthy segments • Game/player highlights can get generated automatically from videos Example: value of shots at different locations on ice given a specific game context The Use of Pro Leagues MARL for Youth Games 17 © 2024 Sportlogiq
  • 18. • Instead of a generic game highlight, a user specific highlight can get created for each player • Using a variety of reward signals, based on a player position (offense, defense) a highlight reel of offensive or defensive plays can get created • Highlights are accompanied with light stats • Highlighted segments are marked based on how interesting they are Highlight Example User Specific Highlights 18 © 2024 Sportlogiq
  • 19. • End users in youth sports want simple yet valuable insights • Pro level insights won’t be useful in the youth sports in their current shape and form • Producing reliable sport analytics content requires vast amount of uniform and accurate data • Generative AI can fill the gaps in the input data • Complete player location data can get generated from partial observations • A plausible play sequence for on puck action can get generated from inaccurate and partial play sequence data • MARL can be used to generate player specific highlights • Sport data is multimodal and requires multi-modal generative AI tools Youth Sports Product Development — Conclusions 19 © 2024 Sportlogiq
  • 20. For More Information M. Horton, Learning feature representation from football tracking, Sloan sport analytics 2021 US Patents and pending applications 11,130,040; 18/529,204; 17/445,354; 17/817,454 20 © 2024 Sportlogiq Videos All videos and demos are available at https://sportlogiq.com References Liu et al., Learning agent representation for ice hockey, NeurIPS2020