Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

81 to 90 of 111 Results
Jan 6, 2020
Cheng, Jonathan, 2020, "Replication Data for: Fleshing Out Models of Gender in English-Language Novels", https://doi.org/10.7910/DVN/QUGW8V, Harvard Dataverse, V1, UNF:6:x2iDNeioSI9VzG8UcbRYIg== [fileUNF]
Code and data for research on characterization published as "Fleshing Out Models of Gender in English-Language Novels," Jonathan Cheng, Cultural Analytics 2019. The compressed character folder is a frozen version of the code. The metadata for ~13000 novels are contained in the article_metadata folder. All word frequencies and code used to produce e...
Sep 26, 2019
Smeets, Roel; Deijl, Lucas van der; Bosch, Antal van den, 2019, "Replication Data for: "The Canon of Dutch Literature According to Google"", https://doi.org/10.7910/DVN/T79QZE, Harvard Dataverse, V1, UNF:6:qNTnSGrQXSNmVnkAygi4/Q== [fileUNF]
Data used to reconstruct the Google related searches network of Dutch authors. The comparison data on the Dutch authors in the registers in the Dutch literary histories (GNT) cannot published because of copyright issues.
Jul 11, 2019
Arnold, Taylor, 2019, "Replication data for: "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke.", https://doi.org/10.7910/DVN/S84TSX, Harvard Dataverse, V1, UNF:6:17SsmaMc+98S967++7Pr7w== [fileUNF]
Datasets to support "A Visual Style in Two Network Sitcoms" by Taylor Arnold, Lauren Tilton, and Annie Berke. Data available as CSV file and code provided in R.
Mar 19, 2019
Vierthaler, Paul, 2019, "Replication Data for: A BLAST-based, Language-agnostic Text Reuse Algorithm with a MARKUS Implementation and Sequence Alignment Optimized for Large Chinese Corpora", https://doi.org/10.7910/DVN/2YYJ2B, Harvard Dataverse, V1
Code and sample corpus used for this article, which introduces a BLAST-based text reuse algorithm optimized for Chinese corpora. The code in this repository is under active development. The code assumes you are using the Anaconda distribution of Python 3.6 or later, and have installed the python-Levenshtein library. The sample corpus comes from Chr...
Feb 20, 2019
Liddle, Dallas, 2019, "Replication Data for: "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel"", https://doi.org/10.7910/DVN/M7AGWJ, Harvard Dataverse, V1, UNF:6:ZnqE0YBsVHbEZIVNu80UQg== [fileUNF]
Datasets to support "Could Fiction Have an Information History? Statistical Probability and the Rise of the Novel" by Dallas Liddle in Journal of Cultural Analytics (2019)
Jan 10, 2019
Long, Hoyt; So, Richard Jean; Zhu, Yuancheng, 2019, "Replication Data for: Race, Writing, and Computation: Racial Difference and the US Novel, 1880-2000", https://doi.org/10.7910/DVN/6ANTB8, Harvard Dataverse, V1
Code, metadata, extracted features, and figures for this study. Also contains a technical appendix referenced in the study.
Jan 4, 2019
Kraicer, Eve; Piper, Andrew, 2019, "Replication Data for: "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction"", https://doi.org/10.7910/DVN/NFPJAQ, Harvard Dataverse, V1, UNF:6:WzI1ydrK0/WeHjFPWl/ErQ== [fileUNF]
Code, metadata and data for "Social Characters: The Hierarchy of Gender in Contemporary English-Language Fiction" by Eve Kraicer and Andrew Piper.
Jan 4, 2019
Guldi, Jo, 2019, "Critical Search: A procedure for guided reading in large-scale textual corpora", https://doi.org/10.7910/DVN/BJNAPD, Harvard Dataverse, V1, UNF:6:1wmPrNSwQHXqXCV3htlX7Q== [fileUNF]
This dataset contains full-scale visualizations as well as original data and code (in R and Python) to reproduce the figures and tables for "Critical Search." The data includes full-text data for the Hansard debates, and the code employs keyword search, topic modeling, and KL measurement.
Dec 2, 2018
Abuelwafa, Sherif, 2018, "Detecting Footnotes in 32 million pages of ECCO", https://doi.org/10.7910/DVN/FMZYFP, Harvard Dataverse, V1, UNF:6:DL6X/3z1H4y9aH9Np0hXzA== [fileUNF]
This dataset contains the metadata for both the training and test sets, in addition to the obtained results for ECCO I & II for “Detecting Footnotes in 32 million pages of ECCO,” Journal of Cultural Analytics (2018).
Nov 19, 2018
Warren, Christopher, 2018, "Replication Data for: Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB)", https://doi.org/10.7910/DVN/D3KFLP, Harvard Dataverse, V1, UNF:6:lR4DZP3El4yXbOApDx0z9g== [fileUNF]
Code and data to support “Historiography’s Two Voices: Data Infrastructure and History at Scale in the Oxford Dictionary of National Biography (ODNB),” appearing in the Journal of Cultural Analytics (2018). Repository includes Jupyter Notebook, Python code, CSVs, pickles, and Open Refine transformation JSONS. Raw Oxford Dictionary of National Biogr...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.