Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress (doi:10.7910/DVN/6RL6ID)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description
Citation
Title:	Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress
Identification Number:	doi:10.7910/DVN/6RL6ID
Distributor:	Harvard Dataverse
Date of Distribution:	2024-05-01
Version:	1
Bibliographic Citation:	Vishwanath, Arjun, 2024, "Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress", https://doi.org/10.7910/DVN/6RL6ID, Harvard Dataverse, V1
Study Description
Citation
Title:	Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress
Identification Number:	doi:10.7910/DVN/6RL6ID
Authoring Entity:	Vishwanath, Arjun (Vanderbilt University)
Producer:	Arjun Vishwanath
Distributor:	Harvard Dataverse
Access Authority:	Arjun Vishwanath
Depositor:	Vishwanath, Arjun
Date of Deposit:	2023-12-01
Holdings Information:	https://doi.org/10.7910/DVN/6RL6ID
Study Scope
Keywords:	Social Sciences, congress; race; representation
Abstract:	We know little about the extent to which racial minorities are symbolically represented by members of Congress. This stands in contrast to a wealth of research analyzing the extent to which minorities are substantively and descriptively represented. This article provides the most comprehensive analysis of symbolic representation to date. Using data on legislators' speech from 105,875 newsletters and 620,838 floor speeches, I find that white legislators of both parties are more likely to symbolically represent blacks, Hispanics, and Asians if those groups are more populous in their constituency. However, these effects only hold cross-sectionally; using a difference-in-differences setup from redistricting shocks, I find that there is little within-legislator variation in speech patterns as their constituencies change. Lastly, I show that, unlike on the symbolic dimension, legislators' substantive representation is not influenced by group size. I conclude that white legislators are symbolically responsive to their constituents' identities in their speech patterns.
Notes:	This dataset underwent an independent verification process, complying with the AJPS Verification Policy updated June 2023, which replicated the tables and figures in the primary article. For the supplementary materials, verification was performed solely for the successful execution of the code. The verification process was carried out by the Cornell Center for Social Sciences at Cornell University. <br></br> The associated article has been awarded the Open Materials Badge. Learn more about the Open Practice Badges from the <a href="https://www.cos.io/">Center for Open Science</a>. <br></br> <img src="https://socialsciences.cornell.edu/sites/default/files/2024-04/materials_large_color.png" alt="Open Materials Badge " width="60" height="60"> <br></br> Open Materials Badge
Methodology and Processing
Sources Statement
Data Sources:	Ansolabehere, Stephen; Schaffner, Brian, 2013, "CCES Common Content, 2012", https://doi.org/10.7910/DVN/HQEVPK, Harvard Dataverse, V9, UNF:5:Eg5SQysFZaPiXc8tEbmmRA== [fileUNF]
	<br></br> Ansolabehere, Stephen; Schaffner, Brian F., 2017, "CCES Common Content, 2016", https://doi.org/10.7910/DVN/GDF6Z0, Harvard Dataverse, V4, UNF:6:WhtR8dNtMzReHC295hA4cg== [fileUNF]
	<br></br> Cormack, Lindsey (2017). “DCInbox – Capturing Every Constituent E-newsletter from 2009 Onwards”. The Legislative Scholar 2 (1): 2-36
	<br></br> Gentzkow, Matthew, Jesse M. Shapiro, and Matt Taddy (2019). “Measuring Group Differences in High-Dimensional Choices: Method and Application to Congressional Speech”. Econometrica 87 (4): 1307-1340
	<br></br> Schaffner, Brian; Ansolabehere, Stephen, 2015, "CCES Common Content, 2014", https://doi.org/10.7910/DVN/XFXJVY, Harvard Dataverse, V5, UNF:6:WvvlTX+E+iNraxwbaWNVdg== [fileUNF]
	<br></br> Schaffner, Brian; Stephen Ansolabehere; Sam Luks, 2019, "CCES Common Content, 2018", https://doi.org/10.7910/DVN/ZSBZ7K, Harvard Dataverse, V6, UNF:6:hFVU8vQ/SLTMUXPgmUw3JQ== [fileUNF]
Data Access
Disclaimer:	The <i>American Journal of Political Science</i> and the Cornell Center for Social Sciences are not responsible for the accuracy or quality of data uploaded within the <i>AJPS</i> Dataverse, for the use of those data, or for interpretations or conclusions based on their use.
Other Study Description Materials
Other Study-Related Materials
Label:	cces_lccr_bound.rds
Text:	contains data from the 2012, 2014, 2016, and 2018 CCES
Notes:	application/gzip
Other Study-Related Materials
Label:	Codebook.pdf
Text:	describes the variables used in the analysis and contained in the datasets provided here
Notes:	application/pdf
Other Study-Related Materials
Label:	cs_analyzed.rds
Text:	contains floor speech data at the document-sentence level
Notes:	application/gzip
Other Study-Related Materials
Label:	cs_analyzed_doc_level.rds
Text:	contains floor speech data from cs_analyzed.rds aggregated at the document level
Notes:	application/gzip
Other Study-Related Materials
Label:	cs_analyzed_legislator_term.rds
Text:	contains the data from cs_analyzed.rds aggregated at the member-congress-chamber level
Notes:	application/gzip
Other Study-Related Materials
Label:	cs_analyzed_legislator_term_all.rds
Text:	contains the data from cs_analyzed.rds aggregated at the member-congress-chamber level but also includes speech on defense, immigration, and trade
Notes:	application/gzip
Other Study-Related Materials
Label:	cs_lasso_out_of_sample_validation.rds
Text:	contains a sample of floor speech sentences from the raw codings. Used to compare the Lasso model's predictions to the coders' classifications.
Notes:	application/gzip
Other Study-Related Materials
Label:	generate_tables_and_figures.R
Text:	contains the code necessary to reproduce all figures and tables reported in the main text and appendix
Notes:	type/x-r-syntax
Other Study-Related Materials
Label:	ideology_data.RDS
Text:	contains data on MCs' roll-call based ideology
Notes:	application/gzip
Other Study-Related Materials
Label:	mturk_codings.rds
Text:	contains pilot codings from MTurk
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletters_analyzed.rds
Text:	contains newsletter data at the document-sentence level
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletters_analyzed_doc_level.rds
Text:	contains newsletter data from newsletters_analyzed.rds aggregated at the document level
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletters_analyzed_legislator_term.rds
Text:	contains the data from newsletters_analyzed.rds aggregated at the member-congress-chamber level
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletters_analyzed_legislator_term_all.rds
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletters_lasso_out_of_sample_validation.rds
Text:	contains a sample of newsletter sentences from the raw codings. Used to compare the Lasso model's predictions to the coders' classifications.
Notes:	application/gzip
Other Study-Related Materials
Label:	newsletter_count.rds
Text:	contains newsletter data at the member-congress-chamber level
Notes:	application/gzip
Other Study-Related Materials
Label:	ra_codings.rds
Text:	contains pilot codings from research assistants
Notes:	application/gzip
Other Study-Related Materials
Label:	README.txt
Text:	README file
Notes:	text/plain