Chinese Ministry of Foreign Affairs Press Conferences Corpus (CMFA PressCon) (doi:10.7910/DVN/BAKGET)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Chinese Ministry of Foreign Affairs Press Conferences Corpus (CMFA PressCon)

Identification Number:

doi:10.7910/DVN/BAKGET

Distributor:

Harvard Dataverse

Date of Distribution:

2021-09-01

Version:

5

Bibliographic Citation:

Mochtak, Michal; Turcsanyi, Richard Q., 2021, "Chinese Ministry of Foreign Affairs Press Conferences Corpus (CMFA PressCon)", https://doi.org/10.7910/DVN/BAKGET, Harvard Dataverse, V5, UNF:6:5aIjqRFvBBIPqxGQtNC/aA== [fileUNF]

Study Description

Citation

Title:

Chinese Ministry of Foreign Affairs Press Conferences Corpus (CMFA PressCon)

Identification Number:

doi:10.7910/DVN/BAKGET

Authoring Entity:

Mochtak, Michal (Radboud University)

Turcsanyi, Richard Q. (Palacky University Olomouc)

Date of Production:

2025-01-02

Distributor:

Harvard Dataverse

Access Authority:

Mochtak, Michal

Depositor:

Mochtak, Michal

Date of Deposit:

2025-01-02

Date of Distribution:

2025-01-02

Holdings Information:

https://doi.org/10.7910/DVN/BAKGET

Study Scope

Keywords:

Computer and Information Science, Social Sciences, corpus, China, foreign affairs, text as data, discourse

Topic Classification:

international relations, political science, natural language processing, quantitative text analysis

Abstract:

The repository contains an original corpus of the Chinese Ministry of Foreign Affairs Press Conferences (CMFA PresCon) mapping two decades of Chinese diplomatic discourse and priorities in China’s foreign policy. The dataset is organized around a question/response structure extracted from the official transcripts of press conferences held between 15 October 2002 and 31 December 2024 and in its current version (v5) counts 33 199 data points.

Time Period:

2002-10-15-2024-12-31

Notes:

Unfortunately, we were not able to check all 33 199 question/answers dyads manually, so there is no guarantee that the dataset is without any errors. As we plan to maintain/update the dataset further, feel free to contact us if you spot anything problematic or something that needs to be fixed systematically. We will include the changes in the next release.

Methodology and Processing

Sources Statement

Data Sources:

Chinese Ministry of Foreign Affairs (https://www.fmprc.gov.cn/mfa_eng/) Wayback Machine (http://web.archive.org/)

Data Access

Conditions:

https://creativecommons.org/licenses/by-sa/4.0/

Notes:

Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0)

Other Study Description Materials

Related Publications

Citation

Title:

When using the CMFA PressCon data, please cite: Mochtak, Michal and Richard Q. Turcsanyi (2021): "Studying Chinese Foreign Policy Narratives: Introducing the Ministry of Foreign Affairs Press Conferences Corpus". <i>Journal of Chinese Political Science</i>, 26 (4): 743-761.

Identification Number:

10.1007/s11366-021-09762-3

Bibliographic Citation:

When using the CMFA PressCon data, please cite: Mochtak, Michal and Richard Q. Turcsanyi (2021): "Studying Chinese Foreign Policy Narratives: Introducing the Ministry of Foreign Affairs Press Conferences Corpus". <i>Journal of Chinese Political Science</i>, 26 (4): 743-761.

Other Study-Related Materials

Label:

changelog.txt

Notes:

text/plain

Other Study-Related Materials

Label:

CMFA_PressCon_annotated_corpus_answers_v5.RDS

Notes:

application/gzip

Other Study-Related Materials

Label:

CMFA_PressCon_annotated_corpus_questions_v5.RDS

Notes:

application/gzip

Other Study-Related Materials

Label:

CMFA_PressCon_v5.xlsx

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

CODEBOOK_CMFA_PressCon_v5.pdf

Notes:

application/pdf

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain