The China Biographical Database is a freely accessible relational database with biographical information about approximately 521,442 individuals as of August 2022, primarily from the 7th through 19th centuries. With both online and offline versions, the data is meant to be useful for statistical, social network, and spatial analysis as well as serving as a kind of biographical reference. The image below shows the spatial distribution of a cross dynastic subset of 190,000 people in CBDB by basic affiliations (籍貫).

The long-term goal of CBDB is systematically to include all significant biographical material from China’s historical record and to make the contents available free of charge, without restriction, for academic use. That data is regularly being enriched and new biographical entries are being created for Tang, Five Dynasties, Liao, Song, Jin, Yuan, Ming, and Qing figures.

CBDB originates with the work of Robert M. Hartwell (1932–1996). Professor Hartwell bequeathed his estate, including the first version of this database, to the Harvard-Yenching Institute which ceded its ownership.

The development of CBDB is now a joint project of:

Fairbank Center for Chinese Studies at Harvard University (費正清中國研究中心)
Institute of History and Philology of Academia Sinica (中研院歷史語言研究所)
Center for Research on Ancient Chinese History at Peking University (北京大學中國古代史研究中心)
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 10 of 37 Results
Sep 9, 2024
Wang, Hongsu, 2021, "ACCESS and SQLite DB Version (latest)", https://doi.org/10.7910/DVN/PAGGQS, Harvard Dataverse, V7
CBDB development log: https://projects.iq.harvard.edu/cbdb/cbdb-sources
Compressed Archive - 215.8 MB - MD5: 923e1e9a7b929f917d610f4f490ed17c
Compressed Archive - 231.0 MB - MD5: 420b8e3622286eee2b82d1e6935f1610
CBDB Access standalone database
Unknown - 837.2 MB - MD5: 22941352277549ead20e8e6faab9900b
Jun 13, 2023
Wang, Hongsu, 2023, "Index of the Complete Prose of the Yuan Dynasty(全元文)", https://doi.org/10.7910/DVN/HTGBQ3, Harvard Dataverse, V1, UNF:6:3XGdmZzwMCHJA2aZyBa0ww== [fileUNF]
Prepared for CBDB by Doctor Chen Wen-yi https://www1.ihp.sinica.edu.tw/en/Fellows/Wen-yi_Chen
Tabular Data - 2.5 MB - 5 Variables, 40199 Observations - UNF:6:3XGdmZzwMCHJA2aZyBa0ww==
7Z Archive - 209.7 MB - MD5: 52aaf9e065cbdc37c04917a04835aa21
CBDB be_20220727 version Access database
7Z Archive - 98.3 MB - MD5: 0a3fd957b0c613099c2bcb93860bb9a7
CBDB 20220727 version SQLite database
Jan 20, 2023
Luo, Queenie, 2023, "korean-romanization-transformer", https://doi.org/10.7910/DVN/16PSZE, Harvard Dataverse, V2
This dataset stores the korean-romanization-transformer model, and two tokenizer files.
Unknown - 104.7 MB - MD5: d299e2ad761c5296a3720f4e8119c343
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.