Population data and extracted features from Canada Census data in DAUID scale
The dataset is based on Statistics Canada census data spanning four census periods (2001, 2006, 2016, and 2021). The dataset captures population statistics disaggregated by ethnicity at the Dissemination Area (DA) level—the smallest standard geographic unit for census data dissemination, covering approximately 400-700 people per unit. For Toronto, this encompasses approximately 3,700 DAs, providing high spatial resolution for analyzing urban dynamics. The dataset includes detailed population counts for the five largest ethnic groups in Toronto: China, India, Philippines, Portugal, and Sri Lanka.
The features are also extracted from census datasets and 298 socioeconomic and demographic features from the census data, organized into 12 categories:
- Demographics: Population age structure, household composition, and family size
- Housing: Dwelling types, ownership status, housing values, and maintenance needs
- Family Structure: Marriage patterns, presence of children, household types
- Income: Median household and individual income, income sources
- Employment: Labor force participation, employment/unemployment rates
- Mobility & Migration: Internal and external migration patterns, non-permanent residents
- Visible Minorities: Population distribution by visible minority status
- Language: Official language use, mother tongue, and multilingual capabilities
- Occupation: Employment categories across economic sectors
- Religion: Religious affiliations and practices
- Industry: Distribution across industry sectors
- Place of Birth: Country of origin information