Description
|
This is the main bulk data repository for the Digitally Accountable Public Representation (DAPR) Database, an innovative archive that systematically tracks and analyzes the online communications of federal, state, and local officials in the U.S. Focusing on X/Twitter and Facebook, the current database includes 28,834 public officials, their demographic information, and 5,769,904 Tweets along with 450,972 Facebook posts, dating from January 2020 to December 2024 for X/Twitter and January 2020 to December 2021 for Facebook, offering a rich historical perspective on digital political discourse by elected officials in the U.S.. To comply with the terms of data access on platform APIs, the raw post-level data is aggregated to the week level, and we disseminate content information as bags-of-words rather than the original raw text of posts. The data does include URLs to the original posts, which can be used to "rehydrate" the original data. We also distribute metadata on the officials in the DAPR database. Due to the size of the individual files, we disseminate each post data file in a compressed format. Note that due to changes in the Twitter/X API in 2023, as well as funding for the DAPR project, the scope and coverage of data collected from X shifted between 2024. See readme.pdf for details on the data as well as a description of how the data collection was affected by changes in the X API.
|