National Health and Nutrition Examination Survey (NHANES) (doi:10.7910/DVN/IMWQPJ)

View:

Part 1: Document Description
Part 2: Study Description
Entire Codebook

Document Description
Citation
Title:	National Health and Nutrition Examination Survey (NHANES)
Identification Number:	doi:10.7910/DVN/IMWQPJ
Distributor:	Harvard Dataverse
Date of Distribution:	2013-05-21
Version:	2
Bibliographic Citation:	Damico, Anthony, 2013, "National Health and Nutrition Examination Survey (NHANES)", https://doi.org/10.7910/DVN/IMWQPJ, Harvard Dataverse, V2
Study Description
Citation
Title:	National Health and Nutrition Examination Survey (NHANES)
Identification Number:	doi:10.7910/DVN/IMWQPJ
Authoring Entity:	Damico, Anthony
Producer:	Anthony Damico
Distributor:	Harvard Dataverse
Distributor:	IQSS Dataverse Network
Date of Deposit:	2013
Date of Distribution:	2013
Holdings Information:	https://doi.org/10.7910/DVN/IMWQPJ
Study Scope
Abstract:	<h3 class="post-title entry-title" itemprop="name"> analyze the national health and nutrition examination survey (nhanes) with r </h3> nhanes is this fascinating survey where doctors and dentists accompany survey interviewers in a little mobile medical center that drives around the country. while the survey folks are interviewing people, the medical professionals administer laboratory tests and conduct a real doctor's examination. <a href="http://www.cdc.gov/nchs/data/nhanes/meccomp.pdf">the b lood work and medical exam</a> allow researchers like you and me to answer tough questions like, "how many people have diabetes but don't know they have diabetes?"<br /> <br /> conducting the lab tests and the physical isn't cheap, so a new nhanes data set becomes available once every two years and only includes about twelve thousand respondents. since the number of respondents is so small, analysts often pool multiple years of data together. the replication scripts below give a few different examples of how multiple years of data can be pooled with r.<br /> <br /> the survey gets conducted by <a href="http://www.cdc.gov/">the centers for disease control and prevention (cdc)</a>, and generalizes to the united states non-institutional, non-active duty military population.<br /> <br /> most of the data tables produced by the cdc include only a small number of variables, so importation with the foreign package's <a href="http://stat.ethz.ch/R-manual/R-devel/library/foreign/html/read.xpo rt.html">read.xport</a> function is pretty straightforward. but that makes merging the appropriate data sets trickier, since it might not be clear what to pull for which variables. for every analysis, start with the table with 'demo' in the name -- this file includes basic demographics, weighting, and complex sample survey design variables. since it's quick to download the files directly from the cdc's ftp site, there's no massive ftp download automation script. this new github repository co ntains five scripts:<br /> <br /> <b>2009-2010 interview only - download and analyze.R</b><br /> <ul> <li>download, import, save the demographics and health insurance files onto your local computer</li> <li>load both files, limit them to the variables needed for the analysis, merge them together</li> <li>perform a few example variable recodes</li> <li>create the complex sample survey object, using the interview weights</li> <li>run a series of pretty generic analyses on the health insurance ques tions</li> </ul> <br /> <b>2009-2010 interview plus laboratory - download and analyze.R</b><br /> <ul> <li>download, import, save the demographics and cholesterol files onto your local computer</li> <li>load both files, limit them to the variables needed for the analysis, merge them together</li> <li>perform a few example variable recodes</li> <li>create the complex sample survey object, using the mobile examination component (mec) weights</li> <li>perform a direct-method age-adjustment and matc h <a href="http://www.cdc.gov/nchs/data/databriefs/db92.pdf">figure 1 of this cdc cholesterol brief</a></li> </ul> <br /> <b>replicate 2005-2008 pooled cdc oral examination figure.R</b><br /> <ul> <li>download, import, save, pool, recode, create a survey object, run some basic analyses</li> <li>replicate <a href="http://www.cdc.gov/nchs/data/databriefs/db96.pdf">figure 3 from this cdc oral health databrief</a> - the whole <a href="http://stat.ethz.ch/R-manual/R-devel/library/graphics/html/barplo t.html">barplot</a></li> </ul> <br /> <b>replicate cdc publications.R</b><br /> <ul> <li>download, import, save, pool, merge, and recode the demographics file plus cholesterol laboratory, blood pressure questionnaire, and blood pressure laboratory files</li> <li>match the cdc's example sas and sudaan syntax file's <a href="http://www.cdc.gov/nchs/tutorials/nhanes/downloads/Continuous/descriptive_means_sas.pdf">output for descriptive means</a></li> <li>match the cdc's example sas and sudaan synta x file's <a href="http://www.cdc.gov/nchs/tutorials/nhanes/downloads/Continuous/descriptive_proportion.pdf">output for descriptive proportions</a></li> <li>match the cdc's example sas and sudaan syntax file's <a href="http://www.cdc.gov/nchs/tutorials/nhanes/downloads/Continuous/descriptive_percentile.pdf">output for descriptive percentiles</a></li> </ul> <br /> <b>replicate human exposure to chemicals report.R</b> <span style="background-color: yellow;">(user-contributed)</span><br /> <ul> <li> download, import, save, pool, merge, and recode the demographics file plus urinary bisphenol a (bpa) laboratory files </li> <li>log-transform some of the columns to calculate the geo<a href="http://www.cdc.gov/nchs/tutorials/nhanes/downloads/Continuous/descriptive_means_sas.pdf"></a>metric means and quantiles</li> <li>match the 2007-2008 statistics shown on pdf page 21 of <a href="http://www.cdc.gov/exposurereport/pdf/FourthReport_UpdatedTables_Sep2012.pdf#page=21">the cdc's fourth edition of the report</a></li> </ul> <br /> <br /> <a href="https://github.com/ajdamico/usgsd/tree/master/National%20Health%20and%2 0Nutrition%20Examination%20Survey">click here to view these five scripts </a><br /> <br /> <br /> for more detail about the national health and nutrition examination survey (nhanes), visit:<br /> <ul> <li><a href="http://www.cdc.gov/nchs/nhanes.htm">the cdc's nhanes homepage</a></li> <li><a href="http://riskfactor.cancer.gov/tools/nhanes_tutorial.html">the national cancer institute's page of nhanes web tutorials</a></li> </ul> <br /> <br /> notes:<br /> <br /> nhanes includes interview-only weights and interview + mobile examination component (mec) weights. if you o nly use questions from the basic interview in your analysis, use the interview-only weights (the sample size is a bit larger). i haven't really figured out a use for the interview-only weights -- nhanes draws most of its power from the combination of the interview and the mobile examination component variables. if you're only using variables from the interview, see if you can use a data set with a larger sample size like the <a href="http://usgsd.blogspot.com/search/label/current%20population%20survey%20%28cps%29">current population (cps)</a>, <a href="http://usgsd.blogspot.com/search/label/national%20health%20interview%20survey%20%28nhis%29">national health interview survey (nhis)</a>, or <a href="http://usgsd.blogspot.com/search/label/medical%20expenditure%20panel%20survey%20%28meps%29">medical expenditure panel survey (meps)</a> instead.<br /> <br /> <br /> confidential to sas, spss, stata, sudaan users: why are you still riding around on a donkey after we've invented the internal combustion engine? time to transition to r. :D
Methodology and Processing
Sources Statement
Data Access
Notes:	<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>
Other Study Description Materials