Abstract
This study proposes a novel approach to clustering football teams’ playing styles using a Deep Embedded Clustering (DEC) algorithm applied to large-scale event data. The dataset comprises over 3 million events from 1,826 matches played in the top-tier leagues of England, Spain, Italy, Germany, and France during the 2016/2017 season. Each match event, such as passes, shots, and duels, contributes to a comprehensive representation of team behavior on the field. To capture tactical nuances, the dataset is segmented into four distinct phases of play, and each phase is clustered independently. These intermediate clustering results are aggregated to create a feature representation for each team, which is subsequently clustered to reveal dominant playing styles. A detailed feature engineering process, inspired by recent literature, incorporates spatial and temporal elements of play, including pass motifs, positional tendencies, and graph-based metrics. The resulting clusters are evaluated in terms of their clustering quality, measured by Silhouette, A(C)1, and A(C)2 scores, their predictive utility for match outcomes, and their tactical interpretability. The analysis demonstrates clear performance disparities among styles, offering insights into the effectiveness of specific tactical schemas. This methodology enables data-driven tactical analysis and benchmarking in football, highlighting the potential of unsupervised learning, particularly deep clustering, to inform strategic decision-making in sports analytics. The source code of this study is available at: https://github.com/egecjdemir/how_football_teams_play.
| Original language | English |
|---|---|
| Title of host publication | Sports Analytics - 2nd International Conference, ISACE 2025, Proceedings |
| Editors | Jin-song Dong, Jing Sun, Xiaofei Xie, Kan Jiang |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 53-68 |
| Number of pages | 16 |
| ISBN (Print) | 9783032061669 |
| DOIs | |
| Publication status | Published - 2026 |
| Event | 2nd International Sports Analytics Conference and Exhibition, ISACE 2025 - Shanghai, China Duration: 26 Sept 2025 → 27 Sept 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15925 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 2nd International Sports Analytics Conference and Exhibition, ISACE 2025 |
|---|---|
| Country/Territory | China |
| City | Shanghai |
| Period | 26/09/25 → 27/09/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Keywords
- DEC
- Deep clustering
- Football analytics
- Playing styles
- Sports data mining
- Unsupervised learning