Cryo2StructData : Full Dataset (doi:10.7910/DVN/FCDG0W)
(Cryo2StructData)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Cryo2StructData : Full Dataset

Identification Number:

doi:10.7910/DVN/FCDG0W

Distributor:

Harvard Dataverse

Date of Distribution:

2023-10-26

Version:

2

Bibliographic Citation:

Giri, Nabin; Wang, Liguo; Cheng, Jianlin, 2023, "Cryo2StructData : Full Dataset", https://doi.org/10.7910/DVN/FCDG0W, Harvard Dataverse, V2

Study Description

Citation

Title:

Cryo2StructData : Full Dataset

Subtitle:

Cryo2StructData: A Large Labeled Cryo-EM Density Map Dataset for AI-based Modeling of Protein Structures

Alternative Title:

Cryo2StructData

Identification Number:

doi:10.7910/DVN/FCDG0W

Authoring Entity:

Giri, Nabin (University of Missouri System)

Wang, Liguo (Brookhaven National Laboratory)

Cheng, Jianlin (University of Missouri System)

Other identifications and acknowledgements:

Giri, Nabin

Producer:

Cheng, Jianlin

Date of Production:

2023-03-20

Grant Number:

R01GM146340

Distributor:

Harvard Dataverse

Distributor:

Giri, Nabin

Access Authority:

Giri, Nabin

Access Authority:

Cheng, Jianlin

Depositor:

Giri, Nabin

Date of Deposit:

2023-10-24

Date of Distribution:

2023-10-24

Holdings Information:

https://doi.org/10.7910/DVN/FCDG0W

Study Scope

Keywords:

Computer and Information Science, Medicine, Health and Life Sciences, Other, cryo-electron microscopy

Abstract:

This repository contains the complete Cryo2StructData dataset, which encompasses 7,392 density maps. The data are organized based on EMD-IDs.

Notes:

Please note that this repository does not include labeled maps. We have provided a collection of scripts in our GitHub repository specifically designed to generate the label maps (amino acid types, atom types, and secondary structure types). These label maps, in combination with the normalized density map, are intended for training and testing AI-based models to build protein structures from cryo-EM density maps, as detailed in the Cryo2StructData paper (https://doi.org/10.1038/s41597-024-03299-9) .

Methodology and Processing

Sources Statement

Documentation and Access to Sources:

The source code and instructions to reproduce our results are freely available at https://github.com/BioinfoMachineLearning/cryo2struct

Data Access

Other Study Description Materials

Related Materials

The source code and instructions to reproduce our results are freely available at https://github.com/BioinfoMachineLearning/cryo2struct

Related Studies

The original cryo-EM density map are available in EMDataBank (https://www.emdataresource.org/) and the corresponding protein structure are available in Protein Data Bank (https://www.rcsb.org/) .

Other Study-Related Materials

Label:

EMD_0.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 0.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_1.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 1.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_2.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 2.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_3.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 3.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_4.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 4.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_5.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 5.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_6.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 6.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_7.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 7.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_8.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 8.

Notes:

application/zip

Other Study-Related Materials

Label:

EMD_9.zip

Text:

Cryo-EM density maps of EMD-IDs starting with 9.

Notes:

application/zip