ZIP to County Crosswalk (doi:10.7910/DVN/0U2TCB)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Entire Codebook

Document Description

Citation

Title:

ZIP to County Crosswalk

Identification Number:

doi:10.7910/DVN/0U2TCB

Distributor:

Harvard Dataverse

Date of Distribution:

2024-08-12

Version:

1

Bibliographic Citation:

Kitch, James, 2024, "ZIP to County Crosswalk", https://doi.org/10.7910/DVN/0U2TCB, Harvard Dataverse, V1, UNF:6:Ille9Yav2FM1Vg8kkPZlgA== [fileUNF]

Study Description

Citation

Title:

ZIP to County Crosswalk

Identification Number:

doi:10.7910/DVN/0U2TCB

Authoring Entity:

Kitch, James (Harvard Data Science Initiative)

Distributor:

Harvard Dataverse

Access Authority:

Kitch, James

Depositor:

Kitch, James

Date of Deposit:

2024-07-02

Holdings Information:

https://doi.org/10.7910/DVN/0U2TCB

Study Scope

Keywords:

Social Sciences, geoboundaries

Abstract:

The following crosswalks are the result of a <a href="https://github.com/NSAPH-Data-Processing/zip_fips_master_xwalk/tree/main">data pipeline</a> that pulls crosswalks from the U.S. Department of Housing and Urban Development (HUD) database, compiling a comprehensive ZIP --> FIPS crosswalk from 2010 to 2023. The crosswalks are available in four different forms: <br><br> "one2one": one row, per ZIP code, per year. Each ZIP is matched to its best matching FIPS code. <br> "one2few": Potentially multiple rows, per ZIP code, per year. All FIPS codes with a non-zero number of addresses for a given ZIP code are returned. <br> "one2one_summy" and "one2few_summy" return the same respective types of data as the above, but summarize across chunks of years. <br><br> Further description of these datasets, as well as code to reproduce and adjust results according to certain parameters, is available at the <a href="https://github.com/NSAPH-Data-Processing/zip_fips_master_xwalk/tree/main">Github repo</a>.

Time Period:

2010-01-01-2023-12-31

Country:

United States

Geographic Unit(s):

county, zipcode

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/licenses/by-sa/4.0">CC BY-SA 4.0</a>

Other Study Description Materials

File Description--f10409349

File: zip2county_master_xwalk_2010_2023_tot_ratio_one2few.tab

  • Number of cases: 730592

  • No. of variables per record: 5

  • Type of File: text/tab-separated-values

Notes:

UNF:6:YNOGR6oTCG2yf4nVA7+2eA==

Summary file of all top county matches for every ZIP code, across years.

File Description--f10409351

File: zip2county_master_xwalk_2010_2023_tot_ratio_one2few_summy.tab

  • Number of cases: 59181

  • No. of variables per record: 9

  • Type of File: text/tab-separated-values

Notes:

UNF:6:b/oMGLVf0nekccscUkiOBA==

Summary file of all county matches for a given ZIP code, across years.

File Description--f10409350

File: zip2county_master_xwalk_2010_2023_tot_ratio_one2one.tab

  • Number of cases: 548373

  • No. of variables per record: 4

  • Type of File: text/tab-separated-values

Notes:

UNF:6:qOqro6VH6C8R2zwXTvLBXA==

[most commonly used file] Contains top county match for every matched ZIP code for every year.

File Description--f10409352

File: zip2county_master_xwalk_2010_2023_tot_ratio_one2one_summy.tab

  • Number of cases: 41335

  • No. of variables per record: 7

  • Type of File: text/tab-separated-values

Notes:

UNF:6:J9CBPK96IgxnvcyMb7MMog==

Contains all matches for every matched ZIP code for every year.

Variable Description

List of Variables:

Variables

zip

f10409349 Location:

Summary Statistics: Mean 49334.29582448261; StDev 26427.727455050903; Min. 501.0; Valid 730592.0; Max. 99929.0

Variable Format: numeric

Notes: UNF:6:nUoNEbQvYVIC7j0aSMprnA==

county

f10409349 Location:

Summary Statistics: Valid 730592.0; Min. 1001.0; Mean 29877.82894693632; StDev 15503.990651453321; Max. 99999.0

Variable Format: numeric

Notes: UNF:6:KBAX406EMrpmT3qAE2W4Ww==

year

f10409349 Location:

Summary Statistics: Min. 2010.0; Valid 730592.0; Max. 2023.0; StDev 4.006770696039704; Mean 2016.6583510358985

Variable Format: numeric

Notes: UNF:6:Ud/dwS0jB5b49FcseUGiVA==

tot_ratio

f10409349 Location:

Summary Statistics: Valid 730592.0; Max. 1.0; Min. 2.2353361945636624E-5; Mean 0.7505871949155731; StDev 0.3917026779783573;

Variable Format: numeric

Notes: UNF:6:czSCr81iKzof9Y/ZjdpAiA==

top_match

f10409349 Location:

Variable Format: character

Notes: UNF:6:zPm6N4rqTNobHlGmzKdq0g==

zip

f10409351 Location:

Summary Statistics: Mean 49210.078842865725; Max. 99929.0; Valid 59181.0; StDev 26464.577738647717; Min. 501.0

Variable Format: numeric

Notes: UNF:6:rpV/5k+muUEdJoyhlJLQfg==

county

f10409351 Location:

Summary Statistics: Mean 29885.195417447623; Valid 59181.0; Min. 1001.0; Max. 99999.0; StDev 15640.591238721583

Variable Format: numeric

Notes: UNF:6:zynJEdI/X92UJMTKYa5PSw==

top_match

f10409351 Location:

Variable Format: character

Notes: UNF:6:pE/iXvaJ17vvh4Dh9NK4QA==

min_year

f10409351 Location:

Summary Statistics: Max. 2023.0; Valid 59181.0; Mean 2011.1080583295313; Min. 2010.0; StDev 2.7721212853686077

Variable Format: numeric

Notes: UNF:6:01DIVATRNQRL9bB9+HUUMw==

max_year

f10409351 Location:

Summary Statistics: Mean 2022.4531015021712; StDev 2.16026938735474; Min. 2010.0; Max. 2023.0; Valid 59181.0

Variable Format: numeric

Notes: UNF:6:zO7NXw1D7zOADdH1sWRXpQ==

total_matches

f10409351 Location:

Summary Statistics: Valid 59181.0; Max. 14.0; StDev 3.619383386750999; Mean 12.345043172639803; Min. 1.0

Variable Format: numeric

Notes: UNF:6:7jtqrSvvTxGqjAevkyGuNg==

tot_ratio_avg

f10409351 Location:

Summary Statistics: Max. 1.0; Mean 0.699645422509859; StDev 0.4160865841160535; Valid 59181.0; Min. 2.2353361945636624E-5;

Variable Format: numeric

Notes: UNF:6:TsAcYRhLYEhvFttGP+xarA==

tot_ratio_min

f10409351 Location:

Summary Statistics: StDev 0.42061065454951185; Valid 59181.0; Max. 1.0; Min. 2.2353361945636624E-5; Mean 0.6903003635435991

Variable Format: numeric

Notes: UNF:6:noic2ZjQBJhqESx+d3gzSA==

tot_ratio_max

f10409351 Location:

Summary Statistics: StDev 0.4129254191892216; Mean 0.7083454514030233; Max. 1.0; Valid 59181.0; Min. 2.2353361945636624E-5

Variable Format: numeric

Notes: UNF:6:wuSIyhMi6OjQ6abBboSP9Q==

zip

f10409350 Location:

Summary Statistics: Max. 99929.0; Mean 49558.27422199447; Valid 548373.0; Min. 501.0; StDev 27879.903295663145;

Variable Format: numeric

Notes: UNF:6:dWkFzptt144WgFYwPiUE6A==

county

f10409350 Location:

Summary Statistics: Mean 29626.874185987326; Max. 78030.0; Valid 548373.0; Min. 1001.0; StDev 15718.314692748363;

Variable Format: numeric

Notes: UNF:6:LlWWXu3LYKNIuwjIYkswjg==

year

f10409350 Location:

Summary Statistics: Mean 2016.546436458396; StDev 4.011496823020403; Min. 2010.0; Valid 548373.0; Max. 2023.0;

Variable Format: numeric

Notes: UNF:6:FTRQT5BeugGloysQo0gJPw==

tot_ratio

f10409350 Location:

Summary Statistics: Valid 548373.0; Min. 0.2573385518590998; Max. 1.0; Mean 0.9693127671478015; StDev 0.0882437439918503;

Variable Format: numeric

Notes: UNF:6:wmv6PHO5cw/A0zSZpbJjFw==

zip

f10409352 Location:

Summary Statistics: Mean 49381.080053235004; StDev 28034.416451793084; Valid 41335.0; Max. 99929.0; Min. 501.0;

Variable Format: numeric

Notes: UNF:6:2I9FqCh0hsiCxJ8LEzf9QA==

county

f10409352 Location:

Summary Statistics: Valid 41335.0; StDev 15827.02602653045; Mean 29443.38891980241; Max. 78030.0; Min. 1001.0;

Variable Format: numeric

Notes: UNF:6:ZyTyUBQavRIT8LE0se5l8Q==

min_year

f10409352 Location:

Summary Statistics: Mean 2010.4425305431232; Min. 2010.0; Valid 41335.0; Max. 2023.0; StDev 1.844824686048626

Variable Format: numeric

Notes: UNF:6:42j6TbwkOdMZ82o5UepR6A==

max_year

f10409352 Location:

Summary Statistics: Max. 2023.0; StDev 1.6198938844451212; Min. 2010.0; Mean 2022.7090843111164; Valid 41335.0

Variable Format: numeric

Notes: UNF:6:JSoXK1OvGhCZZxD1C7Nh5g==

tot_ratio_avg

f10409352 Location:

Summary Statistics: Max. 1.0; StDev 0.09910173476117279; Valid 41335.0; Mean 0.963650794064434; Min. 0.2668132111305724

Variable Format: numeric

Notes: UNF:6:e6ZNTyyEPXZCFOSpeqbUqg==

tot_ratio_min

f10409352 Location:

Summary Statistics: Min. 0.2573385518590998; Mean 0.9563924109866098; Max. 1.0; Valid 41335.0; StDev 0.11138075051873815;

Variable Format: numeric

Notes: UNF:6:T9nOUrZilVp+CVo3BvzvCw==

tot_ratio_max

f10409352 Location:

Summary Statistics: Min. 0.2729145211122554; Max. 1.0; Valid 41335.0; StDev 0.0912172622940618; Mean 0.9702527814537023

Variable Format: numeric

Notes: UNF:6:sncVQfQ63FiilwoTiYbRyQ==