SilicoData: An Annotated Benchmark CXR Dataset for Silicosis Detection

Version 3.0
Akhter, Yasmeena; Ranjan, Rishabh; Vatsa, Mayank; Singh, Richa; chaudhury, santanu; Anjali Agrawal; Shruti Aggarwal; Arjun Kalyanpur; Anurita Menon, 2025, "SilicoData: An Annotated Benchmark CXR Dataset for Silicosis Detection", https://doi.org/10.7910/DVN/QH199J, Harvard Dataverse, V3
Dataset Metrics
6 Downloads
Request Access
Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Enable access request
You must enable request access or add terms of access to restrict file access.
Save Changes
Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Edit Retention Period

The selected file or files have already been published. Contact an administrator to change the retention period date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.


Select File(s)

Please select one or more files.

Share Dataset

Share this dataset on your favorite social media networks.

Continue
Dataset Citations

Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.

Sorry, no citations were found.
Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired or the files can only be transferred via Globus.

You may request access to any restricted file(s) by clicking the Request Access button.

Ineligible Files Selected

The selected file(s) may not be transferred because you have not been granted access or the file(s) have a retention period that has expired or the files are not Globus accessible.

You may request access to any restricted file(s) by clicking the Request Access button.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 15.0 GB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired.

Click Continue to download the files you have access to download.

Ineligible Files Selected

Some file(s) cannot be transferred. (They are restricted, embargoed, with an expired retention period, or not Globus accessible.)

Click Continue to transfer the elligible files.

Delete Dataset

Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Dataset Preview URL

Preview URL can only be used with unpublished versions of datasets.

Unpublished Dataset Preview URL

Are you sure you want to disable the Preview URL? If you have shared the Preview URL with others they will no longer be able to use it to access your unpublished dataset.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Compute

This dataset contains restricted files you may not compute on because you have not been granted access.

Deaccession Dataset

Are you sure you want to deaccession? This is permanent and the selected version(s) will no longer be viewable by the public.

Deaccession Dataset

Are you sure you want to deaccession this dataset? This is permanent an it will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details
 
Version:
Last Updated:
Version:
Last Updated:
Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

  You need to Sign Up or Log In to request access.

Dataset Terms

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom Dataset Terms - the following Custom Dataset Terms have been defined for this dataset.

The researcher(s) agree to the following conditions on the Silicodata dataset: 1. The researcher(s) shall have no rights with respect to the Database or any portion thereof and shall not use the Database except as expressly outlined in this Agreement. 2. Re-identification is strictly prohibited. All recipients agree that they will not attempt to re-identify any individual data subjects from the Dataset. The recipients will not publish any information on an individual patient if the individual patient can be identified. Any re-identification of any individual data subject shall be immediately reported to the authors at databases@iab-rubric.org. 3. Subject to the terms and conditions of this agreement, the Silicodata dataset is available for academic and research use only, with a royalty-free, nonexclusive, non-transferable license subject to the following conditions: a. The Database must not be copied, distributed, published, or reproduced in any form except for creating a secure backup by the registered user. Sharing, transferring, or disclosing any part or the entirety of the dataset to third parties, in any form, is strictly prohibited without prior written authorization from the IAB Lab. Any individual or organization wishing to access or use the Dataset must independently register and agree to all terms and conditions outlined in this DUA. b. The Dataset must not be used for clinical purposes, including diagnosis or patient care, without appropriate approvals from relevant authorities. IIT Jodhpur and the IAB Lab assume no liability for any outcomes resulting from clinical use. c. Any violation of this DUA or other impermissible use shall be grounds for immediate termination of use of the Dataset d. Any work made public, whatever the form, based directly or indirectly on any part of the Database will include the following reference: Reference: (To be updated) Bibtex: @article{Akhter2024, author = "Yasmeena Akhter and Rishabh Ranjan and Mayank Vatsa and Richa Singh and Santanu Chaudhury and Anjali Agarwal and Shruti Aggarwal and Arjun Kalyanpur and Anurita Menon", title = "{Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection}", year = "2025", month = " ", url="", doi = " "} Endnote: Akhter, Y. et al. Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection.
To obtain access to the dataset, please email the duly filled-out license agreement to databases@iab-rubric.org with the subject line "Licence Agreement for the Silicodata Dataset."
The license agreement has to be signed by someone having the legal authority to sign on behalf of the institute, such as the head of the institution or registrar. If a license agreement is signed by someone else, it will not be processed further.
1. The researcher(s) shall have no rights with respect to the Database or any portion thereof and shall not use the Database except as expressly outlined in this Agreement. 2. Re-identification is strictly prohibited. All recipients agree that they will not attempt to re-identify any individual data subjects from the Dataset. The recipients will not publish any information on an individual patient if the individual patient can be identified. Any re-identification of any individual data subject shall be immediately reported to the authors at databases@iab-rubric.org. 3. Subject to the terms and conditions of this agreement, the Silicodata dataset is available for academic and research use only, with a royalty-free, nonexclusive, non-transferable license subject to the following conditions: a. The Database must not be copied, distributed, published, or reproduced in any form except for creating a secure backup by the registered user. Sharing, transferring, or disclosing any part or the entirety of the dataset to third parties, in any form, is strictly prohibited without prior written authorization from the IAB Lab. Any individual or organization wishing to access or use the Dataset must independently register and agree to all terms and conditions outlined in this DUA. b. The Dataset must not be used for clinical purposes, including diagnosis or patient care, without appropriate approvals from relevant authorities. IIT Jodhpur and the IAB Lab assume no liability for any outcomes resulting from clinical use. c. Any violation of this DUA or other impermissible use shall be grounds for immediate termination of use of the Dataset d. Any work made public, whatever the form, based directly or indirectly on any part of the Database will include the following reference:
Reference: (To be updated) Bibtex: @article{Akhter2024, author = "Yasmeena Akhter and Rishabh Ranjan and Mayank Vatsa and Richa Singh and Santanu Chaudhury and Anjali Agarwal and Shruti Aggarwal and Arjun Kalyanpur and Anurita Menon", title = "{Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection}", year = "2025", month = " ", url="", doi = " "} Endnote: Akhter, Y. et al. Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection.
Write to access the password of the zip file via databases@iab-rubric.org
1. The researcher(s) shall have no rights with respect to the Database or any portion thereof and shall not use the Database except as expressly outlined in this Agreement. 2. Re-identification is strictly prohibited. All recipients agree that they will not attempt to re-identify any individual data subjects from the Dataset. The recipients will not publish any information on an individual patient if the individual patient can be identified. Any re-identification of any individual data subject shall be immediately reported to the authors at databases@iab-rubric.org. 3. Subject to the terms and conditions of this agreement, the Silicodata dataset is available for academic and research use only, with a royalty-free, nonexclusive, non-transferable license subject to the following conditions: a. The Database must not be copied, distributed, published, or reproduced in any form except for creating a secure backup by the registered user. Sharing, transferring, or disclosing any part or the entirety of the dataset to third parties, in any form, is strictly prohibited without prior written authorization from the IAB Lab. Any individual or organization wishing to access or use the Dataset must independently register and agree to all terms and conditions outlined in this DUA. b. The Dataset must not be used for clinical purposes, including diagnosis or patient care, without appropriate approvals from relevant authorities. IIT Jodhpur and the IAB Lab assume no liability for any outcomes resulting from clinical use. c. Any violation of this DUA or other impermissible use shall be grounds for immediate termination of use of the Dataset d. Any work made public, whatever the form, based directly or indirectly on any part of the Database will include the following reference: Reference: (To be updated) Bibtex: @article{Akhter2024, author = "Yasmeena Akhter and Rishabh Ranjan and Mayank Vatsa and Richa Singh and Santanu Chaudhury and Anjali Agarwal and Shruti Aggarwal and Arjun Kalyanpur and Anurita Menon", title = "{Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection}", year = "2025", month = " ", url="", doi = " "} Endnote: Akhter, Y. et al. Silicodata: An Annotated Benchmark CXR Dataset for Silicosis Detection.
Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

https://qa.dataverse.org/api/access/datafile/

Compute Batch
Clear Batch
Dataset Persistent Identifier Change Compute Batch
Submit for Review

You will not be able to make changes to this dataset while it is in review.

Publish Dataset

Are you sure you want to republish this dataset?

By default datasets are published with the CC0-“Public Domain Dedication” waiver. Learn more about the CC0 waiver here.

To publish with custom Terms of Use, click the Cancel button and go to the Terms tab for this dataset.

Select if this is a minor or major version update.

Publish Dataset

This dataset cannot be published until SilicoData is published by its administrator.

Publish Dataset

This dataset cannot be published until SilicoData and Harvard Dataverse are published.

Return to Author

Return this dataset to contributor for modification. The reason for return entered below will be sent by email to the author.

Curation Status History
StatusDateAssigner
No records found.
Add/Edit a Version Note
Styled Citation