figshare
Browse

CityPropStats: Property statistics by building age (1910-2020) for 795 core-based statistical areas in the United States

dataset
posted on 2025-03-15, 08:11 authored by Johannes H. UhlJohannes H. Uhl, Stefan Leyk, Scott G. Ortman

CityPropStats provides aggregated property statistics for 795 cities and towns (i.e., Metropolitan and Micropolitan statistical areas) in the conterminous United States. These statistics include sum, mean, median, Gini index and entropy of residential floor space, cadastral parcel size, floor-area ratio, and property value, approximately for the reference year 2020, aggregated by building construction year in decadal steps (cumulative and incremental) from 1910 to 2020.

Cumulative statistics: CBSA_Property_Statistics_1910-2020_cumulative.csv

Decadal time slices statistics: CBSA_Property_Statistics_1910-2020_decadal_slices.csv

Data source: Zillow Transaction and Assessment Dataset (ZTRAX), provided to University of Colorado Boulder via a data share agreement (2016-2023).

CityPropStats is a supplementary dataset to:

Ortman S.G., et al. (accepted): "Changes in Agglomeration and Productivity are Poor Predictors of Inequality Across the Archaeological Record". Proceedings of the National Academy of Sciences (2025).

Column description:

cbsa_id

CBSA GEOID

cbsa_name

Full name

cbsa_type

CBSA type (metro vs micropolitan statistical area)

year_from

Earliest year for selection interval of properties based on their construction year

year_to

Latest year for selection interval of properties based on their construction year

cbsa_pop

CBSA population or population change (US Census)

tot_res_props

Total residential properties

tot_res_area_sqkm

Total indoor area of residential properties in sqkm

avg_res_area_sqm

Average indoor area of residential properties in sqm

median_res_area_sqm

Median indoor area of residential properties in sqm

q25_res_area_sqm

25th percentile of indoor area of residential properties in sqm

q75_res_area_sqm

75th percentile of indoor area of residential properties in sqm

gini_res_area

Gini index of residential property indoor area

tot_prop_value_usd

Total residential property value in USD

median_prop_value_usd

Median residential property value in USD

q25_prop_value_usd

25th percentile of residential property values in USD

q75_prop_value_usd

75th percentile of residential property values in USD

gini_prop_value

Gini index of residential property values

tot_lot_area_sqkm

Total lot (cadastral parcel) area in sqkm

avg_lot_area_sqm

Mean lot area in sqm

median_lot_area_sqm

Median lot area in sqm

q25_lot_area_sqm

25th percentile of lot area in sqm

q75_lot_area_sqm

75th percentile of lot area in sqm

gini_lot_area

Gini index of lot area

avg_far

Mean floor-area-ratio (FAR), with FAR being the ratio of building indoor area and lot area, based on residential properties

median_far

Median floor-area-ratio (FAR), with FAR being the ratio of building indoor area and lot area, based on residential properties

q25_far

25th percentile of floor-area-ratio (FAR), with FAR being the ratio of building indoor area and lot area, based on residential properties

q75_far

75th percentile of floor-area-ratio (FAR), with FAR being the ratio of building indoor area and lot area, based on residential properties

entropy_res_area

Shannon entropy of the indoor area of residential properties, based on properties

entropy_prop_value

Shannon entropy of the property value of residential properties, based on properties

entropy_lot_area

Shannon entropy of the lot size of residential properties, based on properties

area_completeness

Ratio of properties with a valid indoor area attribute [0,1]

value_completeness

Ratio of properties with a valid property value attribute [0,1]

lotsize_completeness

Ratio of properties with a valid indoor area, property value, and lot size attribute [0,1]

area_value_completeness

Ratio of properties with a valid lot size attribute [0,1]

area_value_lotsize_completeness

Ratio of properties with both a valid indoor area and property value attribute [0,1]


Funding

Funding for this work was provided through the Humans, Disasters, and the Built Environment and the Human Networks and Data Science –Infrastructure programs of the US National Science Foundation (Award Numbers 1924670 and 2121976, respectively) to the University of Colorado Boulder. This research benefited from support provided to the University of Colorado Population Center (CUPC, Project 2P2CHD066613-06) from the Eunice Kennedy Shriver Institute of Child Health, Human and Human Development. The content is solely the authors’ responsibility and does not necessarily represent the official views of the National Institutes of Health (NIH) or CUPC. We gratefully acknowledge access to the Zillow Transaction and Assessment Dataset (ZTRAX) through a data use agreement between the University of Colorado Boulder and Zillow Group, Inc. The results and opinions are those of the authors and do not reflect the position of Zillow Group. Support by Zillow Group, Inc., is gratefully acknowledged. Moreover, we gratefully acknowledge support by Safe Software, Inc., for providing a Feature Manipulation Engine (FME) license.

History