1 to 10 of 21 Results
Apr 30, 2025
Sood, Gaurav, 2017, "CNN Transcripts 2000--2025", https://doi.org/10.7910/DVN/ISDPJU, Harvard Dataverse, V5
GitHub Repository: https://github.com/notnews/cnn_transcripts Related Datasets CNN Transcripts 2000—2025 Top News: Story URLs and Text from News Feeds of Major National News Sites (2022 to 03/2025) MSNBC Transcripts: 2003—2022 Fox News Transcripts (2003—2025) Closed Caption News Transcripts from the Internet Archive (2014—2023) |
Apr 28, 2025
Sood, Gaurav, 2016, "Warren’s TV and Cable Factbook data (1997--2002)", https://doi.org/10.7910/DVN/GTSTEY, Harvard Dataverse, V3
Warren’s TV and Cable Factbook Data (1997--2002). Data on cable operators and channels offered in each town in the US. Please read: https://www.gojiberries.io/town-level-data-on-cable-operators-and-cable-channels/ Note: Those who found this data interesting may also be interested in: 1996 Scans of TV Stations |
Mar 21, 2025
Sood, Gaurav, 2022, "Fox News Transcripts (2003--2025)", https://doi.org/10.7910/DVN/Q2KIES, Harvard Dataverse, V2
Transcripts for Fox News Scripts: FNC Related Data CNN Transcripts MSNBC Transcripts Fox News Transcripts |
Aug 20, 2023
Sood, Gaurav; Laohaprapanon, Suriyan, 2022, "Closed Caption News Transcripts from the Internet Archive (2014--2023)", https://doi.org/10.7910/DVN/OAJJHI, Harvard Dataverse, V3, UNF:6:QDjFCPMzxIXpoNWCI+1eNQ== [fileUNF]
Closed Caption News Transcripts from the Internet Archive (2014--2023). The nc- files are ones where the commercials have been stripped out using the data from https://tvnews.stanford.edu/export/commercial For scripts underlying the data pull, see: https://github.com/notnews/archive_news_cc |
Apr 28, 2023
Sood, Gaurav; Chintalapati, Rajashekar, 2022, "Piedomains: Predict the kind of content hosted by a domain using domain name and homepage text", https://doi.org/10.7910/DVN/YHWCDC, Harvard Dataverse, V5
Piedomains: Predict the kind of content hosted by a domain using domain name and homepage text. See: https://github.com/themains/piedomains |
Dec 14, 2022
Sood, Gaurav; Laohaprapanon, Suriyan; Sanjeevi, Madhu; Tran, Khanh, 2018, "Parsed Indian Electoral Rolls", https://doi.org/10.7910/DVN/MUEGDT, Harvard Dataverse, V25, UNF:6:Jul7B4cP7aiLbKdAC1sM5g== [fileUNF]
For getting the pdfs of electoral rolls, see https://github.com/in-rolls/electoral_rolls. For scripts used to parse the electoral rolls, see https://github.com/in-rolls/parse_elex_rolls If you would like access to the electoral rolls, please fill out the following form. You will need to also get IRB approval from your respective university or insti... |
Dec 9, 2022
Sood, Gaurav, 2022, "YouGov Pulse Data for 1200 people for June 2022", https://doi.org/10.7910/DVN/VIV4TS, Harvard Dataverse, V3, UNF:6:d9TUehTh/88K88T3gUX5IA== [fileUNF]
YouGov Pulse Data on 1200 people for June 2022. (This data seems to have come from RealityMine.) YouGov had initially accidentally sent data on only 900 respondents. On December 9th, they fixed the issue. I have kept both sets of files up. |
Nov 16, 2022
Sood, Gaurav, 2022, "Bihar Ration Card Data 2022", https://doi.org/10.7910/DVN/CAVZMQ, Harvard Dataverse, V1
Bihar Ration Card Data. See the scripts used to collect the data: https://github.com/in-rolls/ration_bihar |
Oct 8, 2022
Sood, Gaurav; Laohaprapanon, Suriyan, 2022, "Shallalist Web Page Data", https://doi.org/10.7910/DVN/ZXTQ7V, Harvard Dataverse, V5, UNF:6:5fL+1RYgg1AUNwuRDqlQOg== [fileUNF]
Web pages of domains in shallalist |
Jul 20, 2022
Green, Don; Metzger, Oliver; Sood, Gaurav; Zee, Michelle, 2022, "AAUW Data 97th--116th Congress", https://doi.org/10.7910/DVN/HD5VHI, Harvard Dataverse, V2, UNF:6:fXUVpUTN1n3HBL8BuFXUOg== [fileUNF]
AAUW positions + rollcalls and cosponsorships + bills directly related to women's issues. |