Persistent Identifier
|
doi:10.7910/DVN/QBJE3V |
Publication Date
|
2025-02-03 |
Title
| Indexing status of journals using Open Journal Systems and related properties |
Author
| Chavarro, DiegoSchool of Publishing, Simon Fraser University, Burnaby, BC, CanadaORCIDhttps://orcid.org/0000-0001-9116-0891
Alperin, Juan PabloSchool of Publishing, Simon Fraser University, Burnaby, BC, CanadaORCIDhttps://orcid.org/0000-0002-9344-7439
Willinsky, JohnGraduate School of Education, Stanford University, Stanford, California, USAORCIDhttps://orcid.org/0000-0001-6192-8687 |
Point of Contact
|
Use email button above to contact.
Chavarro, Diego (School of Publishing, Simon Fraser University, Burnaby, BC, Canada) |
Description
| This dataset is a comprehensive, journal-level collection of metadata for 47,625 active journals that publish using the Open Journal Systems (OJS) platform. It covers the period from 2020 to 2023 and aggregates information from multiple sources, including the PKP Beacon, ISSN.org, DOI resolution services, and bibliographic indices such as OpenAlex, DOAJ, and Scopus. The dataset not only captures the basic journal identifiers and descriptive metadata but also a rich set of indicators that support multifaceted analysis of scholarly publishing practices. Key characteristics of the dataset include:
Identifiers and Journal Metadata: – Primary and secondary ISSNs validated against the official registry. – Journal titles as registered within OJS, with standardization measures applied (e.g., transliteration and phonetic comparisons). – A consolidated country of publication determined through multiple sources such as ISSN records, DOAJ listings, and IP-address based geolocation.
Publication Activity: – Annual record counts for the years 2020 through 2023 along with cumulative document counts. – Detailed measures of scholarly output per journal that allow evaluation of publication volume, which serves as a proxy for journal activity and editorial engagement.
Indexing and DOI Usage: – Indicators showing whether a journal is indexed in key bibliographic databases like OpenAlex, Scopus, and DOAJ. – Variables indicating whether the journal assigns Digital Object Identifiers (DOIs) through registration agencies (with specific fields for Crossref, DataCite, Medra, JALC, Airiti, etc.). – Matched counts of DOIs verified against external resolvers, highlighting the reliability and completeness of a journal's metadata.
Economic and Regional Context: – Data on the country’s income group and GDP per capita, which serve as proxies for the resource environment and infrastructural capacity available to each journal. – The total number of JUOJS identified per country, providing a measure of the national landscape of scholarly publishing.
Digital Presence and Repository Characteristics: – Web visibility metrics provided by Open PageRank scores for both the individual journal’s webpage and its hosting repository’s endpoint. – The size of the OJS repository (i.e., the number of journals hosted on the same installation), offering insight into shared infrastructure and editorial scale.
Linguistic and Disciplinary Classification: – Automated language detection results and aggregated language proportions, highlighting the degree to which journals publish in English versus non‑English languages. – A machine-learning derived subject classification assigning each journal to a main scholarly discipline, which enables discipline-specific analysis.
Designed for bibliometric and scientometric research, the dataset enables users to explore the relationships between a journal’s editorial practices, its digital identifier usage, national and economic contexts, and its likelihood of being indexed in inclusive scholarly databases. The extensive metadata and derived metrics support complex analyses, such as classification modeling to identify determinants of indexing in OpenAlex and factors associated with the adoption of Crossref DOIs. The dataset is contributes to exploring trends in global scholarly communication and assess structural disparities in the digital dissemination of knowledge. (2025-01-01) |
Subject
| Computer and Information Science; Social Sciences; Other |
Keyword
| Open Journal Systems (OJS)
OpenAlex
Scholarly Indexing
Digital Object Identifiers (DOIs)
Global Scholarly Visibility
Equitable Knowledge Representation |
Topic Classification
| Equitable Knowledge Representation |
Related Publication
| Is Supplement To: Chavarro, D., Alperin, J. P., Willinsky, J. (2025). On the Open Road to Universal Indexing: OpenAlex and Open Journal Systems. |
Notes
| The dataset is a snapshot of OJS until February April 2023. Related information not available in the snapshot was built in 2024. This includes World Bank's data, CrossRef, DOI.ORG, DOAJ, Scopus, among others. |
Language
| English |
Production Date
| 2025-01-31 |
Production Location
| Canada; Spain |
Contributor
| Data Collector: Chavarro, Diego
Data Curator: Chavarro, Diego
Project Leader: Alperin, Juan Pablo
Hosting Institution: Scholcommlab, Simon Fraser University, Canada |
Funding Information
| This research was supported by the Social Sciences and Humanities Research Council (SSHRC) Canada |
Distributor
| Scholcommlab (Scholcommlab, Simon Fraser University, Canada) |
Depositor
| Chavarro, Diego |
Deposit Date
| 2025-01-31 |
Time Period
| Start Date: 1966; End Date: 2023 |
Date of Collection
| Start Date: 2024-01-01; End Date: 2024-11-30 |
Data Type
| bibliographic data |
Software
| R
MySQL
Python |
Related Dataset
| doi:10.7910/DVN/OCZNVY |