TacticGen: Grounding Adaptable and Scalable Generation of Football Tactics
TacticGen reframes football analytics from predicting what may happen to generating what should happen. It models coordinated player and ball movements with a multi-agent diffusion transformer, then guides generation toward tactical objectives expressed as rules, natural language, or learned value functions.
1 The Chinese University of Hong Kong, Shenzhen
2 Real Analytics
3 Birmingham City Football Club
4 University of Liverpool
5 Simon Fraser University
6 Sun Yat-sen University
† Corresponding author.
Highlights
Why this paper matters
TacticGen is positioned as a foundation-oriented model for football tactical design: accurate enough to model realistic movement, adaptable enough to respond to different objectives, and scalable enough to improve with larger models and more data.
From prediction to generation
The paper reframes football analytics from forecasting likely play evolution to generating coordinated tactical alternatives.
Multi-agent football modeling
TacticGen jointly models the ball and 22 players, explicitly handling cooperative and competitive interactions.
Zero-shot flexible tactical guidance
During inference, a single trained generator can be steered by rule functions, natural language prompts, or learned value models.
Validated with practitioners
Case studies with football experts show generated tactics are hard to distinguish from real play and tactically valuable.
Motivation
Why football tactics should be generated, not only predicted
This figure sets up the problem shift behind TacticGen: instead of only forecasting how play may unfold, the system should also produce coordinated tactical alternatives that align with a coach's intent.
From descriptive football analytics to tactical decision support
The motivation figure introduces the paper's core question: how to move from passive trajectory prediction toward controllable generation that can assist tactical planning.
Framework
The TacticGen pipeline
After the method overview, this figure gives a cleaner system-level view of how context encoding, diffusion generation, and tactical guidance fit together inside the complete framework.
Conditioning, generation, and guidance in one model stack
The framework figure ties the paper together by showing how event context, player-ball history, and guidance signals are fused into a unified generation pipeline for tactical control.
Method
A controllable diffusion framework for football tactics
TacticGen builds on diffusion transformers and adapts them to football with explicit agent interaction modeling, context-aware conditioning, and guidance at sampling time.
The core generation pipeline
These three components define the base system: a multi-agent denoising backbone, a context encoder that summarizes match state, and an inference-time objective module that makes the generator steerable.
Backbone
DiT CoreMulti-agent Diffusion Transformer
TacticGen extends DiT into a multi-agent setting so that noisy future trajectories can be denoised while preserving team-level coordination and opponent response.
Joint denoising across players and ball keeps team structure coherent.
Context
Temporal FusionSelf-attentive context encoding
Past trajectories, ball history, event type, goal difference, and timestamps are fused into context embeddings through MLP-based encoding, self-attention, and adaptive conditioning.
Rich match context is compressed into controllable conditioning signals.
Guidance
Inference ControlObjective-driven generation
Classifier guidance steers samples at inference time, letting one pretrained generator adapt to tactical goals without retraining.
One generator, multiple tactical objectives, no extra training pass required.
On top of the shared architecture, TacticGen exposes multiple control interfaces so the same pretrained model can respond to different tactical design goals.
Three ways to steer tactical generation
The guidance layer supports explicit football rules, coach-friendly language prompts, and learned value signals, covering both interpretable control and performance-oriented optimization.
Rule-based guidance
Differentiable tactical rules encode concepts like support width, zone occupation, or pitch control value.
Best for explicit tactical priors and interpretable control.
Language-based guidance
Natural language instructions are turned into executable guidance functions, enabling high-level tactical prompting.
Best for coach-friendly prompts and rapid what-if iteration.
Value-based guidance
A learned value model injects long-term utility into generation, guiding trajectories toward strategically stronger outcomes.
Best for optimizing downstream tactical quality over longer horizons.
The same architecture is deployed in two practical operating modes, letting TacticGen cover both realistic trajectory prediction and more deliberate tactical design workflows.
Two practical operating modes
Both modes share the same football-native generator, but differ in how much future ball information is available at inference time and, as a result, what kind of coaching workflow they support best.
Short-context trajectory prediction
Predicts future player and ball trajectories from a short observed context window, making it a natural fit for realistic generation.
- Uses recent match context without assuming the future ball path is known.
- Best suited to realistic future trajectory prediction and model benchmarking.
Ball-conditioned tactical generation
Conditions on a full ball trajectory so the model can coordinate player movement around an anticipated ball path.
- Provides stronger control when analysts or coaches already have a target ball path in mind.
- Best suited to tactical planning, design exploration, and controlled what-if generation.
Dataset
Large-scale football dataset with aligned event and tracking data
The dataset combines analyst-annotated event logs and full-pitch optical tracking, then standardizes them into event-centric training examples with consistent spatial coordinates, temporal context, and future trajectory targets.
1,432 matches across 2018-2025
The dataset spans multiple top-tier competitions, led by the EFL Championship and Premier League.
From raw match feeds to aligned multi-agent decision points
Each sample starts from two synchronized sources: Opta-style play-by-play annotations for on-ball actions, and optical tracking for the ball plus all players. An adapted Needleman-Wunsch alignment process links the two streams, producing structured football events that can directly condition trajectory generation.
Top-tier matches used to build the aligned dataset.
Annotated football events after alignment.
Processed timesteps containing agent positions.
10 context frames followed by 54 subsequent frames.
What one training example contains
Every event stores match metadata, tactical context, agent states, and the future trajectory target needed for training.
Metadata
Game ID, event ID, episode ID, timestamps, and match bookkeeping.
Match state
Goal difference, possession length, event outcome, and ball-control status.
Action labels
30 unified action types plus the next-event ball destination.
Team indicators
Home vs. away, attacking vs. defending, and episode termination flags.
Context tensors
Past agent positions and features over the fixed context window.
Trajectory tensors
Variable-length future player trajectories with aligned agent features.
Explore the football event data distribution
Inspect spatial density on the normalized pitch
The heatmap is built from 1,692,800 coordinates from a single game, while the overlay dots show a smaller sampled subset for readability.
Results
Accurate, adaptable, and scalable
On the paper's large-scale football dataset, TacticGen improves prediction quality over prior baselines and then uses the same modeling foundation to support controllable tactical generation.
Scaling behavior
From 1.74M to 311.50M parameters, up to 600K training steps, and up to 78M training examples, TacticGen shows consistent scaling gains across model size, training time, and data scale.
Expert validation
In case studies with five experts, generated trajectories achieved a realism F1 of 0.50 ± 0.07, and guided TacticGen samples received a utility score of 0.81 ± 0.04.
Videos
Main Paper Videos
These clips show how TacticGen compares with real trajectories and how generation changes under different tactical objectives, including rule guidance, pitch-control guidance, language prompts, and value-based guidance.
Start With the Reference Play
All videos below are based on the same Pass event. The interactive view here shows the ground-truth play first, so viewers can get familiar with the player layout, ball location, and overall structure before comparing generated tactics.
Red markers indicate the attacking team, blue markers indicate the defending team, and green indicates the ball.
TacticGen-C Prediction
This clip shows conditional trajectory prediction for a pass event, where player movement is generated while following the observed ball trajectory.
Attacking Rule Guidance
The attacking team is guided by composed tactical rules: support the ball carrier, maintain width, create passing angles, and occupy Zone 14.
Defending Rule Guidance
The defending team is guided to press the ball carrier, collapse toward the ball, deepen the block, and reduce passing lanes.
Ground-Truth PCV
This is the reference pitch-control map at the final frame, showing the baseline spatial dominance before any PCV guidance is applied.
Attack High PCV Trajectory
This trajectory view shows pitch-control guidance for the attacking team, encouraging runners to move forward and create more attacking space.
Attack High PCV Map
This final-frame pitch-control map shows how attacking control expands into more dangerous offensive areas under PCV guidance.
Defense High PCV Trajectory
This trajectory view shows PCV guidance for the defending team, increasing pressure on the carrier and improving central coverage.
Defense High PCV Map
This final-frame pitch-control map highlights stronger defensive coverage, with the defending side reclaiming more key space.
LLM Prompt: Move Forward
Prompt: "Make the attacking team move forward more aggressively." The generated players respond by pushing forward faster toward the defending goal.
LLM Prompt: Stretch the Defense
Prompt: "Make the right winger drift into the corner to stretch the defense and open up more space." The right winger follows the instruction by moving wide.
Attack High Value
The attacking side is guided by a learned value model, leading players to push higher and closer to goal-scoring areas.
Defense High Value
The defending side is guided by the value model to reduce attacking payoff, producing faster pressure on the ball carrier.
Human Evaluation
Interactive Expert Evaluation
We study TacticGen with two complementary human-facing tasks: a Realism Test that asks whether a clip looks generated or realistic, and a Utility Preference task that asks which clip demonstrates better tactical value. The clips and pairs shown on this webpage are only showcase examples, not the full set of human evaluation samples.
Realism Test
Can You Tell Generated Plays From Real Ones?
We added 10 expert case-study clips on realism. Watch each play, decide whether it looks generated or realistic, and the interface will reveal the correct answer immediately after your choice.
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Select your answer
Realism Test
Task Overview
You will watch a set of short football videos. Each video follows the same pattern:
- Event context (10 frames) - real tracking data showing the ball and all 22 players' x / y positions for the 10 frames immediately before the on-ball event (pass, clearance, ball-touch, etc.).
- Post-event segment (N frames) - after the event occurs, the video continues tracking the ball and all players until one of the following: the next on-ball action occurs, the tracking data ends, or the 64-frame limit is reached. The ball trajectory is always authentic; the 22 player trajectories may come from real tracking data or from AI-generated predictions.
- Freeze frame - the final frame is held for ~3 s to let you inspect the ending.
Additional Notes
- The attacking team always moves left to right.
- Sampling rate: player tracking is captured at 10 Hz, and videos are rendered at 5 fps, so 1 s of video equals 0.5 s of real-world play.
- The ball trajectory is always real; only the post-event segment of 22 players may be synthetic.
- This webpage contains 10 expert case-study clips: 5 generated and 5 realistic. They are presented here as a fixed curated set rather than a randomized annotation batch.
- For each clip, the post-event player movements are always entirely realistic or entirely generated, without mixing the two.
- We provide five realistic clips to get you familiar with the animations here: Google Drive Folder.
Your Task
For each video, decide whether the post-event player movements look realistic (authentic) or generated (AI-predicted). Keep in mind that the ball trajectory is always realistic.
PS
- You may submit your responses after completing at least one video. Clicking the Submit button will record all of your already selected answers to the server at once. For convenience, you might choose to submit your responses every 5 or 10 events.
- If you'd like to take a break and continue later, you're also welcome to temporarily submit your progress. For example, if you complete and submit feedback for the first 10 videos, you can return later and resume directly from page 11.
- When clicking the submit button, it will require you to provide some simple feedback during your annotation progress.
Utility Preference
Which Clip Shows Better Tactical Utility?
These expert case studies compare a Generated clip against a Realistic clip. For each pair, decide whether the left clip, the right clip, or equal shows better utility.
Which clip looks better for the attacking team?
Which clip looks better for the attacking team?
Which clip looks better for the attacking team?
Which clip looks better for the defending team?
Which clip looks better for the defending team?
Which clip looks better for the defending team?
Utility Preference
Task Overview
You will watch a set of short football videos. Each pair follows the same pattern:
- Event context (10 frames) - real tracking data showing the ball and all 22 players' x / y positions for the 10 frames immediately before the on-ball event (pass, clearance, ball-touch, etc.).
- Post-event segment (N frames) - after the event occurs, the video continues to track the ball and all players until one of the following conditions is met: the next on-ball action occurs, the tracking data is completed, or the 64-frame limit is reached. The ball trajectory is always authentic, while the trajectories of the 22 players may either come from real tracking data or from AI adjustments that optimize the movement of one team.
- Freeze frame - the final frame is held for ~3 s so you can inspect the ending.
Additional Notes
- The attacking team always moves left to right.
- Sampling rate: player tracking is captured at 10 Hz, and videos are rendered at 5 fps, so 1 s of video equals 0.5 s of real-world play.
- The ball trajectory is always real. Minor disturbances may come from data inaccuracies, so please ignore those when making your judgment.
- These six pairs are expert case studies: three focus on attacking utility and three focus on defending utility.
Your Task
For each pair of videos, determine which clip of the post-event player trajectories for the highlighted team demonstrates better tactics and positioning strategy. Please treat realism as a least consideration in this task.
After you choose, the interface will reveal whether your selected clip was Generated or Realistic, and will also show how the experts voted.
Paper
Paper and Citation
Paper
Access the manuscript PDF and citation information below.
BibTeX
@article{xu2026tacticgen,
title = {TacticGen: Grounding Adaptable and Scalable Generation of Football Tactics},
author = {Xu, Sheng and Liu, Guiliang and Kharrat, Tarak and Luo, Yudong and Aloulou, Mohamed and Lopez Pena, Javier and Sofeikov, Konstantin and Reid, Adam and Roberts, Paul and Spencer, Steven and Carnall, Joe and McHale, Ian and Schulte, Oliver and Zha, Hongyuan and Zheng, Wei-Shi},
journal = {arXiv preprint arXiv:2604.18210},
year = {2026},
url = {https://arxiv.org/abs/2604.18210}
}