Multifactorial Zip Code-Year Dataset: Socio-Economic, Demographic, and Environmental Variables in the Contiguous United States (2000-2016) (doi:10.7910/DVN/5XBJBM)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Multifactorial Zip Code-Year Dataset: Socio-Economic, Demographic, and Environmental Variables in the Contiguous United States (2000-2016)

Identification Number:

doi:10.7910/DVN/5XBJBM

Distributor:

Harvard Dataverse

Date of Distribution:

2024-07-30

Version:

1

Bibliographic Citation:

Khoshnevis, Naeem; Wu, Xiao; Braun, Danielle, 2024, "Multifactorial Zip Code-Year Dataset: Socio-Economic, Demographic, and Environmental Variables in the Contiguous United States (2000-2016)", https://doi.org/10.7910/DVN/5XBJBM, Harvard Dataverse, V1

Study Description

Citation

Title:

Multifactorial Zip Code-Year Dataset: Socio-Economic, Demographic, and Environmental Variables in the Contiguous United States (2000-2016)

Identification Number:

doi:10.7910/DVN/5XBJBM

Authoring Entity:

Khoshnevis, Naeem (Harvard University)

Wu, Xiao (Columbia University)

Braun, Danielle (Harvard University)

Distributor:

Harvard Dataverse

Access Authority:

Khoshnevis, Naeem

Depositor:

Khoshnevis, Naeem

Date of Deposit:

2023-09-27

Holdings Information:

https://doi.org/10.7910/DVN/5XBJBM

Study Scope

Keywords:

Earth and Environmental Sciences, Social Sciences, Socio-Economic Variables, Environmental Contexts

Abstract:

This dataset aggregates extensive public data corresponding to 34,928 zip codes from the contiguous United States, spanning from 2000 to 2016. It encompasses 580,244 zip code-year observations, capturing a myriad of variables to portray a comprehensive picture of each region. The variables include, but are not limited to, education rate, median household income, median house value, poverty rate, percentages of Hispanic and Black populations, and meteorological variables, offering nuanced insights into the socio-economic conditions, demographic composition, and environmental contexts of each area. This rich, multifaceted dataset serves as a valuable resource for exploratory research, specifically designed to facilitate the evaluation of potential causal relationships, with a focus on educational attainment, although its extensive range of variables allows for a multitude of applications across various domains.

Country:

United States

Notes:

This dataset serves as the empirical foundation for examples and illustrative purposes within the CausalGPS R package.

Methodology and Processing

Sources Statement

Notes:

Dear depositor, as we collect information about your dataset, it's important for us to know what ontology supported your deposits so we can build a better system. please drop ontology URL here:

Data Access

Notes:

<a href="http://creativecommons.org/licenses/by/4.0">CC BY 4.0</a>

Other Study Description Materials

Other Study-Related Materials

Label:

zip_data.RData

Notes:

application/x-rlang-transport