Replication data for: Computer-Assisted Text Analysis for Comparative Politics (doi:10.7910/DVN/MPU019)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Replication data for: Computer-Assisted Text Analysis for Comparative Politics

Identification Number:

doi:10.7910/DVN/MPU019

Distributor:

Harvard Dataverse

Date of Distribution:

2015-04-23

Version:

1

Bibliographic Citation:

Lucas, Christopher; Nielsen, Richard A.; Roberts, Margaret E.; Stewart, Brandon M.; Storer, Alex; Tingley, Dustin, 2015, "Replication data for: Computer-Assisted Text Analysis for Comparative Politics", https://doi.org/10.7910/DVN/MPU019, Harvard Dataverse, V1

Study Description

Citation

Title:

Replication data for: Computer-Assisted Text Analysis for Comparative Politics

Identification Number:

doi:10.7910/DVN/MPU019

Authoring Entity:

Lucas, Christopher (Harvard University)

Nielsen, Richard A. (MIT)

Roberts, Margaret E. (UCSD)

Stewart, Brandon M. (Harvard University)

Storer, Alex (Stanford GSB)

Tingley, Dustin (Harvard University)

Date of Production:

2015

Distributor:

Harvard Dataverse

Distributor:

Dataverse

Access Authority:

Dustin Tingley

Date of Deposit:

2015-03-08

Series Name:

Forthcoming

Holdings Information:

https://doi.org/10.7910/DVN/MPU019

Study Scope

Keywords:

stm, text, comparative

Abstract:

Recent advances in research tools for the systematic analysis of textual data are enabling exciting new research throughout the social sciences. For comparative politics, scholars who are often interested in non-English and possibly multilingual textual datasets, these advances may be difficult to access. This article discusses practical issues that arise in the processing, management, translation, and analysis of textual data with a particular focus on how procedures differ across languages. These procedures are combined in two applied examples of automated text analysis using the recently introduced Structural Topic Model. We also show how the model can be used to analyze data that have been translated into a single language via machine translation tools. All the methods we describe here are implemented in open-source software packages available from the authors.

Time Period:

2015-2015

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

Forthcoming, Political Analysis

Identification Number:

10.1093/pan/mpu019

Bibliographic Citation:

Forthcoming, Political Analysis

Other Study-Related Materials

Label:

analysis_replication.R

Text:

Notes:

text/plain; charset=US-ASCII

Other Study-Related Materials

Label:

CombinedLuceneREPLICATION.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

fatwa_replication_public.R

Text:

Notes:

text/plain; charset=US-ASCII

Other Study-Related Materials

Label:

jihad_metadata_edited.csv

Text:

Notes:

text/plain; charset=US-ASCII

Other Study-Related Materials

Label:

Prepped.Translated.Docs.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

Prepped.Translated.Docs_TermByTerm.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

README.txt

Text:

Notes:

text/plain; charset=US-ASCII

Other Study-Related Materials

Label:

results_replication.R

Text:

Notes:

text/plain; charset=US-ASCII

Other Study-Related Materials

Label:

RichCheck102314-SM.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

SnowdenC-noRT-TermByTerm.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

SnowdenC-noRT.RData

Text:

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

SnowdennoC-noRT.RData

Text:

Notes:

application/x-rlang-transport