1830.txt

This file is part of "Corpus of Historical American English (COHA)".

Version 1.3
File Citation
Davies, Mark, 2015, "1830.txt", Corpus of Historical American English (COHA), https://doi.org/10.7910/DVN/8SRSYK/HWHNRV, Harvard Dataverse, V1
Dataset Citation
Davies, Mark, 2015, "Corpus of Historical American English (COHA)", https://doi.org/10.7910/DVN/8SRSYK, Harvard Dataverse, V1
File Metrics
30 Downloads
File Metadata  
Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Enable access request
You must enable request access or add terms of access to restrict file access.
Save Changes
Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Edit Retention Period

The selected file or files have already been published. Contact an administrator to change the retention period date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.


Select File(s)

Please select one or more files.

Share File

Share this file on your favorite social media networks.

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

https://qa.dataverse.org/api/access/datafile/

Dataset Terms

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom Dataset Terms - the following Custom Dataset Terms have been defined for this dataset.

Licensed electronic resources are restricted to members of the MIT community and for the purposes of research, education, and scholarship. Under MIT's licenses for electronic resources, users generally may not: - redistribute the materials or permit anyone other than a member of the MIT community to use them - remove, obscure or modify any copyright or other notices included in the materials - use the materials for commercial purposes. Users are individually responsible for compliance with these terms. This data is restricted to members of the MIT community for educational, scholarly, and research purposes. In no case can the data be distributed beyond the MIT community, even in joint research with individuals at other institutions. 1. In no case can substantial amounts of the full-text data (typically, a total of 50,000 words or more) be distributed outside the organization listed on the license agreement. For example, you cannot create a large word list or set of n-grams, and then distribute this to others, and you could not copy 70,000 words from different texts and then place this on a website where users from outside your organization would have access to the data. 2. The data cannot be placed on a network (including the Internet), unless access to the data is limited (via restricted login, password, etc) just to those from the MIT community. In addition to the full-text data itself, #2 also applies to derived frequency, collocates, n-grams, concordance and similar data that is based on the corpus. 3. If portions of the derived data is made available to others, it cannot include substantial portions of the the raw frequency of words (e.g. the word occurs 3,403 times in the corpus) or the rank order (e.g. it is the 304th most common words). (Note: it is acceptable to use the frequency data to place words and phrases in "frequency bands", e.g. words 1-1000, 1001-3000, 3001-10,000, etc. However, there should not be more than about 20 frequency bands in your application.) 4. Any publications or products that are based on the data should contain a reference to the source of the data: http://corpus.byu.edu/full-text.” 5. Note that a small, unique change will be made to each set of data, and this will serve as a "fingerprint" to identify you as the source of this data. Automated Google searches are run daily to find copies of the data on the Web. If the data that is sent to you is found outside of your organization, you will make a reasonable effort to contact the administrators for that web page or website, to have the data removed.
Provenance

Provenance is a record of the origin of your data file and any transformations it has been through. Upload a JSON file from a provenance capture tool to generate a graph of your data's provenance. For more information, please refer to our User Guide.

File must be JSON format and follow the W3C standard.

Select File

You may also add information documenting the history of your data file, including how it was created, how it has changed, and who has worked with it.

Provenance
No changes have been made.
Request Access

  You need to Sign Up or Log In to request access.

Compute

This file is restricted and you may not compute on it because you have not been granted access.