figshare
Browse
1/1
3 files

Semantic Trails Datasets

Version 2 2019-01-30, 09:40
Version 1 2018-12-10, 17:53
dataset
posted on 2019-01-30, 09:40 authored by Diego MontiDiego Monti, Enrico Palumbo, Giuseppe Rizzo, Raphaël Troncy, Thibault Ehrhart, Maurizio Morisio
This page contains the Semantic Trails Datasets (STDs) created from two different collections of check-ins obtained from the Foursquare platform.

The check-ins have been preprocessed in order to remove suspicious or erroneous check-ins. We constructed the semantic trails by grouping check-ins close in time and by adding various semantic information.

In details, the resulting CSV files have the following fields: trail_id, user_id, venue_id, venue_category, venue_schema, venue_city, venue_country, and timestamp.

The user_id is a numeric identifier and it has been anonymized. The venue_id corresponds to the Foursquare URI of the venue and, therefore, it can be used to obtain additional information. The venue_category is a category identifier from the Foursquare taxonomy, while the venue_schema is the corresponding Schema.org term. The venue_city is the Wikidata entity corresponding to the city in which the venue is located, while the venue_country is the country associated with the city. Finally, the timestamp is expressed in the ISO 8601 format and it has been approximated to the minute.

We also provide a Turtle version of the datasets, created starting from the CSV files.

History