athlete_events.csv (39.58 MB)

Olympic history: longitudinal data scraped from www.sports-reference.com

Download (39.58 MB)
dataset
posted on 28.05.2018, 03:20 by Randi Griffin
Historical data on the modern Olympics from Athens 1896 to Rio 2016. The data was scraped from www.sports-reference.com in May 2018.

The file athlete_events.csv contains 271116 rows and 15 columns. Each row corresponds to an individual athlete competing in an individual Olympic event (athlete-events). The columns are:

1. ID - Row numbers
2. Name - Athlete's name
3. Sex - M or F
4. Age - Integer
5. Height - In centimeters
6. Weight - In kilograms
7. Team - Team name
8. NOC - National Olympic Committee 3-letter code
9. Games - Year and season
10. Year - Integer
11. Season - Summer or Winter
12. City - Host city
13. Sport - Sport
14. Event - Event
15. Medal - Gold, Silver, Bronze, or NA

For more information:

About the sports-reference Olympic database: http://olympstats.com/2016/08/21/the-olymadmen-and-olympstats-and-sports-reference/

About how I scraped the data: https://rgriff23.github.io/2018/05/27/olympic-history-1-web-scraping.html

About how I wrangled the data:
https://rgriff23.github.io/2018/05/28/olympic-history-2-data-wrangling-1.html
https://rgriff23.github.io/2018/05/28/olympic-history-3-data-wrangling-2.html

My GitHub repo for this project, including analyses using this data:
https://github.com/rgriff23/Olympic_history

History