Overview

The TBHubbard dataset is a collection of data with a tight-binding description of metal organic frameworks (MOFs). The structures are derived from the QMOF database, where first-principles calculations are performed to obtain the electronic density which is then projected into a localized atomic basis set using PAOFLOW. The data collection is divided into two sub-sets: tight_binding_model and extended_hubbard_model.

Tight-Binding Model

The Tight-Binding Model offers a comprehensive dataset for 10,435 metal-organic frameworks (MOFs), providing key electronic structure data. The electronic density for each MOF is projected onto a localized atomic basis set, generating a tight-binding lattice Hamiltonian. This allows for the study of the electronic properties and interactions within the MOF structures. Additionally, Smooth Overlap of Atomic Positions (SOAP) descriptors are computed for 20,375 MOFs, enriching the dataset with detailed topology information about the local atomic environments.

  • tb_dft/: This directory contains the Quantum ESPRESSO (QE) calculations used for the tight-binding projections. It includes all relevant input and output files, the tight-binding Hamiltonian, and detailed results from PAOFLOW projections (e.g., arry.pkl, paoflow.out). The bader.out, ACF.dat, and other related files provide further insights into the charge distribution and electronic structure. SCF calculation outputs such as rho.cube and scf files are also included to allow for a deeper understanding of the electronic density. For detailed instructions on the tight-binding projection workflow, please refer to the tight_binding_model/README.md.

  • soap_of_mofs/: This folder includes the SOAP descriptors, which are essential for understanding the local atomic environments within the MOFs. SOAP descriptors come in two variations: SOAP-3 Å and SOAP-5 Å. These descriptors capture the atomic structure at different length scales, offering both detailed and broader topological information. The filenames are given appending to the MOFs name the suffix _soap.npz. Each file contains these descriptors and allows for easy extraction of essential data. For further information on computing SOAP descriptors, please refer to the tight_binding_model/scripts/compute_soap-descriptors/README.md.

  • scripts/: A collection of helper tools for visualizing the data and generating necessary inputs for further analysis. These scripts make it easier to manipulate, visualize, and utilize the tight-binding and SOAP data for subsequent computational studies and modeling. To learn how to compute tight-binding embeddings from QE, check the tight_binding_model/scripts/compute_tight-binding_embeddings_from_qe/README.md. To set up and run SCF calculations with QE, follow the instructions in tight_binding_model/scripts/setup_qe_scf/README.md.

For more detailed guidance, please refer to the appropriate README.md files in each directory.

Extended Hubbard Model

Electronic structure calculations for 242 MOFs. The electronic density is projected onto a localized atomic basis set, providing a tight-binding lattice Hamiltonian of MOFs. A set of 428 calculations are also provided for the self-consistent computation of Hubbard parameters U and V of 242 MOFs. The set is divided according to the manifold chosen for U and V, where d and s orbitals corresponds to ds_perturbations; and d and p orbitals corresponds to dp_perturbations. The tight-binding projection along with the Hubbard parameters constructs the Extended Hubbard model lattice Hamiltonian.

  • dp_perturbations: QE calculation with the tight-binding projection, including input and output files, as well as the tight-binding Hamiltonian. Hubbard parameter are also provided for d and p orbitals.
  • ds_perturbations: QE calculation with the tight-binding projection, including input and output files, as well as the tight-binding Hamiltonian. Hubbard parameter are also provided for d and s orbitals.
  • extend_hubb_data.json: Tabulated property containing the main input and output QE information for each MOF, divided in dp and ds perturbations.
  • scripts/: helper tools for visualization and input generation.

License

All dataset files are distributed under the CDLA-Permissive-2.0 license, while the source code files are distributed under the BSD-3-Clause license.

Copyright (c) 2025, International Business Machines All rights reserved.

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

161 to 170 of 205 Results
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 9c1c255d81c750e224b2bfe260e89cb4
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: bec1c46104243025fc31df01ba285259
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: b0afd2b128c7c56e4d4e71a4492b4ac5
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 5717a3b2e5e229fd083e20fe320a34ad
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: c4f3ba4fbbfee21c6e2dc1b09d253871
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 16b80d052dfab74efc21fd044c0ca417
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: c025398cc0bebc983c909e8ce6ac4abf
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 62cbf18273702abcbf0a03d0855189e7
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 9090728e719addfc00ddcc6e599dea52
Mar 14, 2025 - TBHubbard Dataset
Unknown - 2.3 GB - MD5: 8289cfe43fe2c91ff75a8de206d6c8e5
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.