101 to 110 of 2,445 Results
Jun 17, 2023
Ma, Xiaoyi, 2005, "Chinese News Translation Text Part 1", https://doi.org/10.35111/9N1N-0Q43
Abstract Introduction Chinese News Translation Text Part 1 was developed by the Linguistic Data Consortium (LDC) and contains approximately 474,000 characters of Chinese text and corresponding English translations, totalling approximately 285,000 words. All the stories in this corpus were collected and all translations made as Machine Translation (...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Jun 17, 2023
NIST Multimodal Information Group, 2010, "NIST 2002 Open Machine Translation (OpenMT) Evaluation", https://doi.org/10.35111/63W9-A726
Abstract Introduction NIST 2002 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2002 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems. The package was compiled and scoring software was developed by resea...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Jun 15, 2023
Statistics Canada, 2024, "National Travel Survey, 2021", https://hdl.handle.net/11272.1/AB2/VQFEXG, Statistics Canada
The National Travel Survey was developed to fully replace the Travel Survey of Residents of Canada (record number 3810) and replace the Canadian resident component of the International Travel Survey (record number 3152). The National Travel Survey collects information about the domestic and international travel of Canadian residents. The National T...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
May 11, 2023
Statistics Canada, 2024, "Survey on Before and After School Care in Canada, 2022", https://hdl.handle.net/11272.1/AB2/2VB9XT, Statistics Canada
The purpose of this survey is to address child care in Canada for children who are attending school (i.e. ages 4 to 12). The survey will ask about the different types of learning and child care arrangements used by families, difficulties some families may face when looking for care, as well as reasons for not using child care.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 27, 2023
Huang, Shudong; Walker, Kevin; Graff, David, 2023, "Mixer 3 Speech", https://doi.org/10.35111/S9JZ-3210
Abstract Introduction Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as part of the Mixer project, and recordings in this corpus were used in NIS...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 27, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tamil Representative Language Pack", https://doi.org/10.35111/T18N-PY11
Abstract Introduction LORELEI Tamil Representative Language Pack (LDC2023T03) consists of Tamil monolingual text, Tamil-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI program. The LORELEI (Low Resource Languages for Emergent Incidents) pro...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 27, 2023
Chen, Song; Bies, Ann; Griffitt, Kira; Ellis, Joe; Strassel, Stephanie, 2023, "DEFT English Light and Rich ERE Annotation", https://hdl.handle.net/11272.1/AB2/7KH7V4
Abstract Introduction DEFT English Light and Rich ERE Annotation was developed by the Linguistic Data Consortium (LDC) and consists of 1190 English discussion forum, newswire and proxy documents annotated for entities, relations and events (ERE). DARPA's Deep Exploration and Filtering of Text (DEFT) program aimed to address remaining capability gap...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 27, 2023
Choi, Jinho D.; Han, Na-Rae; Hwang, Jena D.; Kim, Hansaem, 2023, "Penn Korean Universal Dependency Treebank", https://hdl.handle.net/11272.1/AB2/ZW25WL
Abstract Introduction Penn Korean Universal Dependency Treebank contains 5,010 sentences and 132,041 tokens annotated in dependency format under the Universal Dependencies framework. It is a conversion of Korean Treebank Annotations Version 2.0 (LDC2006T09) which was produced in constituency format. In general, dependency grammar is based on the id...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 22, 2023
Statistics Canada, 2023, "Canada’s Core Public Infrastructure Survey (CCPI), 2020 Data tables", https://hdl.handle.net/11272.1/AB2/ZUPPXQ, Statistics Canada
The purpose of this survey is to collect statistical information on the inventory, condition, performance and asset management strategies of core public infrastructure assets owned or leased by various levels of Canadian government. The following 9 core public infrastructure assets are assessed: Bridge and tunnel assets Culture, recreation and spor...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 13, 2023
Fisheries and Oceans Canada, 2024, "Canadian Hydrographic Service Non-Navigational (NONNA) Bathymetric Data [10m resolution]", https://hdl.handle.net/11272.1/AB2/YJPER2
The Canadian Hydrographic Service (CHS) offers a complete inventory of bathymetric data free to the general public for non-navigational use called ‘CHS NONNA’ for the ‘NON-NAvigational’ purpose of the data. The product is available in a spatial resolution of 10 metres. The CHS NONNA-P10 Packages are ZIP files that contain product coverage (resoluti...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |