Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

51 to 60 of 2,445 Results
Oct 18, 2023
Delgado, Dana; Jones, Karen; Walker, Kevin; Strassel, Stephanie; Caruso, Christopher; Graff, David, 2023, "2019 OpenSAT Public Safety Communications Simulation", https://doi.org/10.35111/7Z20-JG48
Abstract Introduction 2019 OpenSAT Public Safety Communications Simulation was developed by the Linguistic Data Consortium (LDC) and contains approximately 141 hours of speech recordings and transcripts used in the used in the National Institute of Standards and Technology (NIST) Open Speech Analytic Technologies (OpenSAT) 2019 evaluation's automat...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Oct 17, 2023
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Speech", https://doi.org/10.35111/X1S2-XV64
Abstract Introduction CALLFRIEND Russian Speech (LDC2023S08) was developed by the Linguistic Data Consortium (LDC) and consists of approximately 48 hours of telephone conversations (100 recordings) between native speakers of Russian. The calls were recorded in 1999 as part of the CALLFRIEND collection. One hundred native Russian speakers living in...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Oct 12, 2023
Statistics Canada, 2023, "National Graduates Survey - Public Use Microdata File, 2015 (time of graduation), 2018 (time of interview)", https://doi.org/10.25318/81M0011X-ENG, Statistics Canada
Data from this survey will be used to better understand the experiences and outcomes of graduates, and to improve government programs. The survey is designed to collect details on topics such as: i) the extent to which graduates of postsecondary programs have been successful in obtaining employment since graduation; ii) the relationship between the...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Oct 11, 2023
Statistics Canada, 2023, "General Social Survey Cycle 34: Canadians' Safety (Victimization), 2019", https://hdl.handle.net/11272.1/AB2/TY08CB, Statistics Canada
The main objective of the GSS on Canadians' Safety is to better understand how Canadians perceive crime and the justice system and to capture information on their experiences of victimization. This survey is the only national survey of self-reported victimization and is collected in all provinces and territories. The survey allows for estimates of...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Sep 29, 2023
Statistics Canada, 2022, "Canadian Income Survey, 2020", https://hdl.handle.net/11272.1/AB2/I6BDAC, Statistics Canada
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 3701) and tax data. The survey gathers information on labour market activ...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Sep 13, 2023
Statistics Canada, 2022, "2021 Census Geographic Attribute File", https://hdl.handle.net/11272.1/AB2/BXLPEP, Statistics Canada
The 2021 Geographic Attribute File contains all the 2021 Census DBs and their selected attributes, such as standard geographic areas’ unique identifiers (UIDs), DGUIDs, population and dwelling counts, land area, 2021 Census incompletely enumerated Indian reserves and Indian settlements, and the corresponding DAs’ representative point coordinates. H...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Aug 30, 2023
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation", https://doi.org/10.35111/1YBT-ZQ79
Abstract Introduction HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,800 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and related technologies, LDC developed, in collaboration with NIST (the Nati...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Aug 30, 2023
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2021, "HAVIC MED Training Data -- Videos, Metadata and Annotation", https://doi.org/10.35111/RAK4-XF36
Abstract Introduction HAVIC MED Training Data -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 2,100 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and related technologies, LDC developed, in collaboration with NIST (the Nat...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Aug 30, 2023
Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Walker, Kevin; Jones, Karen, 2021, "RATS Speaker Identification", https://doi.org/10.35111/ZQET-2102
Abstract Introduction RATS Speaker Identification was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 1,900 hours of Levantine Arabic, Farsi, Dari, Pashto and Urdu conversational telephone speech with annotations of speech segments. The audio was retransmitted over eight channels, making 17,000 hours of total aud...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Aug 30, 2023
Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike, 2020, "Mixer 4 and 5 Speech", https://doi.org/10.35111/XQ98-YJ91
Abstract Introduction Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct speakers. The material was collected in 2007 as part of the Mixer pro...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.