Cultural Analytics Dataverse

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

81 to 90 of 111 Results

Replication Data for: Fleshing Out Models of Gender in English-Language Novels Jan 6, 2020 Cheng, Jonathan, 2020, "Replication Data for: Fleshing Out Models of Gender in English-Language Novels", https://doi.org/10.7910/DVN/QUGW8V, Harvard Dataverse, V1, UNF:6:x2iDNeioSI9VzG8UcbRYIg== [fileUNF] Code and data for research on characterization published as "Fleshing Out Models of Gender in English-Language Novels," Jonathan Cheng, Cultural Analytics 2019. The compressed character folder is a frozen version of the code. The metadata for ~13000 novels are contained in the article_metadata folder. All word frequencies and code used to produce e...
Replication Data for: "The Canon of Dutch Literature According to Google" Sep 26, 2019 Smeets, Roel; Deijl, Lucas van der; Bosch, Antal van den, 2019, "Replication Data for: "The Canon of Dutch Literature According to Google"", https://doi.org/10.7910/DVN/T79QZE, Harvard Dataverse, V1, UNF:6:qNTnSGrQXSNmVnkAygi4/Q== [fileUNF] Data used to reconstruct the Google related searches network of Dutch authors. The comparison data on the Dutch authors in the registers in the Dutch literary histories (GNT) cannot published because of copyright issues.
Replication data for: "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke. Jul 11, 2019 Arnold, Taylor, 2019, "Replication data for: "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke.", https://doi.org/10.7910/DVN/S84TSX, Harvard Dataverse, V1, UNF:6:17SsmaMc+98S967++7Pr7w== [fileUNF] Datasets to support "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke. Data available as CSV file and code provided in R.
Replication Data for: A BLAST-based, Language-agnostic Text Reuse Algorithm with a MARKUS Implementation and Sequence Alignment Optimized for Large Chinese Corpora Mar 19, 2019 Vierthaler, Paul, 2019, "Replication Data for: A BLAST-based, Language-agnostic Text Reuse Algorithm with a MARKUS Implementation and Sequence Alignment Optimized for Large Chinese Corpora", https://doi.org/10.7910/DVN/2YYJ2B, Harvard Dataverse, V1 Code and sample corpus used for this article, which introduces a BLAST-based text reuse algorithm optimized for Chinese corpora. The code in this repository is under active development. The code assumes you are using the Anaconda distribution of Python 3.6 or later, and have installed the python-Levenshtein library. The sample corpus comes from Chr...
Replication Data for: "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel" Feb 20, 2019 Liddle, Dallas, 2019, "Replication Data for: "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel"", https://doi.org/10.7910/DVN/M7AGWJ, Harvard Dataverse, V1, UNF:6:ZnqE0YBsVHbEZIVNu80UQg== [fileUNF] Datasets to support "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel" by Dallas Liddle in Journal of Cultural Analytics (2019)
Replication Data for: Race, Writing, and Computation: Racial Difference and the US Novel, 1880-2000 Jan 10, 2019 Long, Hoyt; So, Richard Jean; Zhu, Yuancheng, 2019, "Replication Data for: Race, Writing, and Computation: Racial Difference and the US Novel, 1880-2000", https://doi.org/10.7910/DVN/6ANTB8, Harvard Dataverse, V1 Code, metadata, extracted features, and figures for this study. Also contains a technical appendix referenced in the study.
Replication Data for: "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction" Jan 4, 2019 Kraicer, Eve; Piper, Andrew, 2019, "Replication Data for: "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction"", https://doi.org/10.7910/DVN/NFPJAQ, Harvard Dataverse, V1, UNF:6:WzI1ydrK0/WeHjFPWl/ErQ== [fileUNF] Code, metadata and data for "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction" by Eve Kraicer and Andrew Piper.
Critical Search: A procedure for guided reading in large-scale textual corpora Jan 4, 2019 Guldi, Jo, 2019, "Critical Search: A procedure for guided reading in large-scale textual corpora", https://doi.org/10.7910/DVN/BJNAPD, Harvard Dataverse, V1, UNF:6:1wmPrNSwQHXqXCV3htlX7Q== [fileUNF] This dataset contains full-scale visualizations as well as original data and code (in R and Python) to reproduce the figures and tables for "Critical Search." The data includes full-text data for the Hansard debates, and the code employs keyword search, topic modeling, and KL measurement.
Detecting Footnotes in 32 million pages of ECCO Dec 2, 2018 Abuelwafa, Sherif, 2018, "Detecting Footnotes in 32 million pages of ECCO", https://doi.org/10.7910/DVN/FMZYFP, Harvard Dataverse, V1, UNF:6:DL6X/3z1H4y9aH9Np0hXzA== [fileUNF] This dataset contains the metadata for both the training and test sets, in addition to the obtained results for ECCO I & II for “Detecting Footnotes in 32 million pages of ECCO,” Journal of Cultural Analytics (2018).
Replication Data for: Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB) Nov 19, 2018 Warren, Christopher, 2018, "Replication Data for: Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB)", https://doi.org/10.7910/DVN/D3KFLP, Harvard Dataverse, V1, UNF:6:lR4DZP3El4yXbOApDx0z9g== [fileUNF] Code and data to support “Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB),” appearing in the Journal of Cultural Analytics (2018). Repository includes Jupyter Notebook, Python code, CSVs, pickles, and Open Refine transformation JSONS. Raw Oxford Dictionary of National Biogr...

Replication Data for: Fleshing Out Models of Gender in English-Language Novels

Jan 6, 2020

Cheng, Jonathan, 2020, "Replication Data for: Fleshing Out Models of Gender in English-Language Novels", https://doi.org/10.7910/DVN/QUGW8V, Harvard Dataverse, V1, UNF:6:x2iDNeioSI9VzG8UcbRYIg== [fileUNF]

Code and data for research on characterization published as "Fleshing Out Models of Gender in English-Language Novels," Jonathan Cheng, Cultural Analytics 2019. The compressed character folder is a frozen version of the code. The metadata for ~13000 novels are contained in the article_metadata folder. All word frequencies and code used to produce e...

Replication Data for: "The Canon of Dutch Literature According to Google"

Sep 26, 2019

Smeets, Roel; Deijl, Lucas van der; Bosch, Antal van den, 2019, "Replication Data for: "The Canon of Dutch Literature According to Google"", https://doi.org/10.7910/DVN/T79QZE, Harvard Dataverse, V1, UNF:6:qNTnSGrQXSNmVnkAygi4/Q== [fileUNF]

Data used to reconstruct the Google related searches network of Dutch authors. The comparison data on the Dutch authors in the registers in the Dutch literary histories (GNT) cannot published because of copyright issues.

Replication data for: "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke.

Jul 11, 2019

Arnold, Taylor, 2019, "Replication data for: "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke.", https://doi.org/10.7910/DVN/S84TSX, Harvard Dataverse, V1, UNF:6:17SsmaMc+98S967++7Pr7w== [fileUNF]

Datasets to support "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke. Data available as CSV file and code provided in R.

Replication Data for: A BLAST-based, Language-agnostic Text Reuse Algorithm with a MARKUS Implementation and Sequence Alignment Optimized for Large Chinese Corpora

Mar 19, 2019

Vierthaler, Paul, 2019, "Replication Data for: A BLAST-based, Language-agnostic Text Reuse Algorithm with a MARKUS Implementation and Sequence Alignment Optimized for Large Chinese Corpora", https://doi.org/10.7910/DVN/2YYJ2B, Harvard Dataverse, V1

Code and sample corpus used for this article, which introduces a BLAST-based text reuse algorithm optimized for Chinese corpora. The code in this repository is under active development. The code assumes you are using the Anaconda distribution of Python 3.6 or later, and have installed the python-Levenshtein library. The sample corpus comes from Chr...

Replication Data for: "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel"

Feb 20, 2019

Liddle, Dallas, 2019, "Replication Data for: "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel"", https://doi.org/10.7910/DVN/M7AGWJ, Harvard Dataverse, V1, UNF:6:ZnqE0YBsVHbEZIVNu80UQg== [fileUNF]

Datasets to support "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel" by Dallas Liddle in Journal of Cultural Analytics (2019)

Replication Data for: Race, Writing, and Computation: Racial Difference and the US Novel, 1880-2000

Jan 10, 2019

Long, Hoyt; So, Richard Jean; Zhu, Yuancheng, 2019, "Replication Data for: Race, Writing, and Computation: Racial Difference and the US Novel, 1880-2000", https://doi.org/10.7910/DVN/6ANTB8, Harvard Dataverse, V1

Code, metadata, extracted features, and figures for this study. Also contains a technical appendix referenced in the study.

Replication Data for: "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction"

Jan 4, 2019

Kraicer, Eve; Piper, Andrew, 2019, "Replication Data for: "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction"", https://doi.org/10.7910/DVN/NFPJAQ, Harvard Dataverse, V1, UNF:6:WzI1ydrK0/WeHjFPWl/ErQ== [fileUNF]

Code, metadata and data for "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction" by Eve Kraicer and Andrew Piper.

Critical Search: A procedure for guided reading in large-scale textual corpora

Jan 4, 2019

Guldi, Jo, 2019, "Critical Search: A procedure for guided reading in large-scale textual corpora", https://doi.org/10.7910/DVN/BJNAPD, Harvard Dataverse, V1, UNF:6:1wmPrNSwQHXqXCV3htlX7Q== [fileUNF]

This dataset contains full-scale visualizations as well as original data and code (in R and Python) to reproduce the figures and tables for "Critical Search." The data includes full-text data for the Hansard debates, and the code employs keyword search, topic modeling, and KL measurement.

Detecting Footnotes in 32 million pages of ECCO

Dec 2, 2018

Abuelwafa, Sherif, 2018, "Detecting Footnotes in 32 million pages of ECCO", https://doi.org/10.7910/DVN/FMZYFP, Harvard Dataverse, V1, UNF:6:DL6X/3z1H4y9aH9Np0hXzA== [fileUNF]

This dataset contains the metadata for both the training and test sets, in addition to the obtained results for ECCO I & II for “Detecting Footnotes in 32 million pages of ECCO,” Journal of Cultural Analytics (2018).

Replication Data for: Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB)

Nov 19, 2018

Warren, Christopher, 2018, "Replication Data for: Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB)", https://doi.org/10.7910/DVN/D3KFLP, Harvard Dataverse, V1, UNF:6:lR4DZP3El4yXbOApDx0z9g== [fileUNF]

Code and data to support “Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB),” appearing in the Journal of Cultural Analytics (2018). Repository includes Jupyter Notebook, Python code, CSVs, pickles, and Open Refine transformation JSONS. Raw Oxford Dictionary of National Biogr...

Add Data

Share Dataverse

Link Dataverse

Reset Modifications