Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress (doi:10.7910/DVN/6RL6ID)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress

Identification Number:

doi:10.7910/DVN/6RL6ID

Distributor:

Harvard Dataverse

Date of Distribution:

2024-05-01

Version:

1

Bibliographic Citation:

Vishwanath, Arjun, 2024, "Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress", https://doi.org/10.7910/DVN/6RL6ID, Harvard Dataverse, V1

Study Description

Citation

Title:

Replication Data for: Race, Legislative Speech, and Symbolic Representation in Congress

Identification Number:

doi:10.7910/DVN/6RL6ID

Authoring Entity:

Vishwanath, Arjun (Vanderbilt University)

Producer:

Arjun Vishwanath

Distributor:

Harvard Dataverse

Access Authority:

Arjun Vishwanath

Depositor:

Vishwanath, Arjun

Date of Deposit:

2023-12-01

Holdings Information:

https://doi.org/10.7910/DVN/6RL6ID

Study Scope

Keywords:

Social Sciences, congress; race; representation

Abstract:

We know little about the extent to which racial minorities are symbolically represented by members of Congress. This stands in contrast to a wealth of research analyzing the extent to which minorities are substantively and descriptively represented. This article provides the most comprehensive analysis of symbolic representation to date. Using data on legislators' speech from 105,875 newsletters and 620,838 floor speeches, I find that white legislators of both parties are more likely to symbolically represent blacks, Hispanics, and Asians if those groups are more populous in their constituency. However, these effects only hold cross-sectionally; using a difference-in-differences setup from redistricting shocks, I find that there is little within-legislator variation in speech patterns as their constituencies change. Lastly, I show that, unlike on the symbolic dimension, legislators' substantive representation is not influenced by group size. I conclude that white legislators are symbolically responsive to their constituents' identities in their speech patterns.

Notes:

This dataset underwent an independent verification process, complying with the AJPS Verification Policy updated June 2023, which replicated the tables and figures in the primary article. For the supplementary materials, verification was performed solely for the successful execution of the code. The verification process was carried out by the Cornell Center for Social Sciences at Cornell University. <br></br> The associated article has been awarded the Open Materials Badge. Learn more about the Open Practice Badges from the <a href="https://www.cos.io/">Center for Open Science</a>. <br></br> <img src="https://socialsciences.cornell.edu/sites/default/files/2024-04/materials_large_color.png" alt="Open Materials Badge " width="60" height="60"> <br></br> Open Materials Badge

Methodology and Processing

Sources Statement

Data Sources:

Ansolabehere, Stephen; Schaffner, Brian, 2013, "CCES Common Content, 2012", https://doi.org/10.7910/DVN/HQEVPK, Harvard Dataverse, V9, UNF:5:Eg5SQysFZaPiXc8tEbmmRA== [fileUNF]

<br></br> Ansolabehere, Stephen; Schaffner, Brian F., 2017, "CCES Common Content, 2016", https://doi.org/10.7910/DVN/GDF6Z0, Harvard Dataverse, V4, UNF:6:WhtR8dNtMzReHC295hA4cg== [fileUNF]

<br></br> Cormack, Lindsey (2017). “DCInbox – Capturing Every Constituent E-newsletter from 2009 Onwards”. The Legislative Scholar 2 (1): 2-36

<br></br> Gentzkow, Matthew, Jesse M. Shapiro, and Matt Taddy (2019). “Measuring Group Differences in High-Dimensional Choices: Method and Application to Congressional Speech”. Econometrica 87 (4): 1307-1340

<br></br> Schaffner, Brian; Ansolabehere, Stephen, 2015, "CCES Common Content, 2014", https://doi.org/10.7910/DVN/XFXJVY, Harvard Dataverse, V5, UNF:6:WvvlTX+E+iNraxwbaWNVdg== [fileUNF]

<br></br> Schaffner, Brian; Stephen Ansolabehere; Sam Luks, 2019, "CCES Common Content, 2018", https://doi.org/10.7910/DVN/ZSBZ7K, Harvard Dataverse, V6, UNF:6:hFVU8vQ/SLTMUXPgmUw3JQ== [fileUNF]

Data Access

Disclaimer:

The <i>American Journal of Political Science</i> and the Cornell Center for Social Sciences are not responsible for the accuracy or quality of data uploaded within the <i>AJPS</i> Dataverse, for the use of those data, or for interpretations or conclusions based on their use.

Other Study Description Materials

Other Study-Related Materials

Label:

cces_lccr_bound.rds

Text:

contains data from the 2012, 2014, 2016, and 2018 CCES

Notes:

application/gzip

Other Study-Related Materials

Label:

Codebook.pdf

Text:

describes the variables used in the analysis and contained in the datasets provided here

Notes:

application/pdf

Other Study-Related Materials

Label:

cs_analyzed.rds

Text:

contains floor speech data at the document-sentence level

Notes:

application/gzip

Other Study-Related Materials

Label:

cs_analyzed_doc_level.rds

Text:

contains floor speech data from cs_analyzed.rds aggregated at the document level

Notes:

application/gzip

Other Study-Related Materials

Label:

cs_analyzed_legislator_term.rds

Text:

contains the data from cs_analyzed.rds aggregated at the member-congress-chamber level

Notes:

application/gzip

Other Study-Related Materials

Label:

cs_analyzed_legislator_term_all.rds

Text:

contains the data from cs_analyzed.rds aggregated at the member-congress-chamber level but also includes speech on defense, immigration, and trade

Notes:

application/gzip

Other Study-Related Materials

Label:

cs_lasso_out_of_sample_validation.rds

Text:

contains a sample of floor speech sentences from the raw codings. Used to compare the Lasso model's predictions to the coders' classifications.

Notes:

application/gzip

Other Study-Related Materials

Label:

generate_tables_and_figures.R

Text:

contains the code necessary to reproduce all figures and tables reported in the main text and appendix

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

ideology_data.RDS

Text:

contains data on MCs' roll-call based ideology

Notes:

application/gzip

Other Study-Related Materials

Label:

mturk_codings.rds

Text:

contains pilot codings from MTurk

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletters_analyzed.rds

Text:

contains newsletter data at the document-sentence level

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletters_analyzed_doc_level.rds

Text:

contains newsletter data from newsletters_analyzed.rds aggregated at the document level

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletters_analyzed_legislator_term.rds

Text:

contains the data from newsletters_analyzed.rds aggregated at the member-congress-chamber level

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletters_analyzed_legislator_term_all.rds

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletters_lasso_out_of_sample_validation.rds

Text:

contains a sample of newsletter sentences from the raw codings. Used to compare the Lasso model's predictions to the coders' classifications.

Notes:

application/gzip

Other Study-Related Materials

Label:

newsletter_count.rds

Text:

contains newsletter data at the member-congress-chamber level

Notes:

application/gzip

Other Study-Related Materials

Label:

ra_codings.rds

Text:

contains pilot codings from research assistants

Notes:

application/gzip

Other Study-Related Materials

Label:

README.txt

Text:

README file

Notes:

text/plain