figshare
Browse

Multidimensional match performance dataset - StatsBomb & Transfermarkt

Download (50.54 MB)
dataset
posted on 2024-04-23, 08:51 authored by benjamin vermautbenjamin vermaut

Match performance analysis stands at the forefront of football research, offering profound insights into the dynamics of player and team behaviours. While we encourage those seeking detailed event data to explore the Open Data of StatsBomb or A public data set of spatio temporal match events in soccer.

Our data sets allow to use original event data from StatsBomb or Wyscout, but with our record link tables, they could incorporate information on injuries or market values from Transfermarkt.

In this Multidimensional match performance dataset, we aggregated the event data at a match level by counting the total number of each event type that occured during the match.This option streamlines the initial stages of data collection and aggregation, which is particularly beneficial for those working at a higher level of analysis.

The dataset contains the following fields (categorized by dimension):


  • Match Information(StatsBomb) : match_id, match_date, home_team, away_team, home_score, away_score, kick_off, match_week, stadium, home_managers, away_managers, match_status, match_status_360,
    last_updated, last_updated_360, data_version, shot_fidelity_version, xy_fidelity_version
  • Competition Information(StatsBomb): competition, season, competition_season, competition_stage
  • Team Information(StatsBomb): team_name, formation
  • Player Information(StatsBomb): player_id, player_name, jersey_number, country
  • Player Selection(Created from StatsBomb): selected, played, started, time_played, positions, main_position
  • Match performances(Created from StatsBomb): pass, ball_receipt*, carry, pressure, block, ball_recovery, miscontrol, interception,foul_committed, foul_won, shot, duel, dribble, dribbled_past, clearance, dispossessed, points,nb_cards, card_type_red_card, card_type_second_yellow, card_type_yellow_card
  • Injury(Transfermarkt): was_injured, Injury_index, Injury, Injury_from, Injury_until, Days, Games_missed
  • Market Value(Transfermarkt): last_market_value_at_match_date_eur, last_market_value_at_match_date_index


Here are the sources of the differents datasets:


The scientific article explaining the data set will be available soon.

History