Vector-QM24 (VQM24) dataset
Creators
Description
Quantum chemistry dataset of ~836 thousand small organic and inorganic molecules.
Density Functional Theory (DFT) properties for all 784,875 conformers in local minima; 258,242 constitutional isomers (most stable conformer) and 51,072 saddle point structures are available in the DFT_all.npz, DFT_uniques.npz and DFT_saddles.npz files respectively.
Diffusion quantum Monte Carlo (DMC) data for 10,793 constitutional isomers is available in the DMC.npz file.
All molecules are ordered in the same way across every array.
Keys for accessing each property are tabulated in the paper.
Usage example :
import numpy as np
data = np.load('DFT_all.npz', allow_pickle=True)
print(data.files) #see a list of all properties
key = 'freqs'
property = data[key] #DFT vibrational frequencies of all molecules
print(property[42]) #Frequencies of molecule number 42 in the array (HSCl, Thiohypochlorous acid)
Input file samples, tools and kernel ridge regression, graph neural network models : https://github.com/dkhan42/VQM24
Atomic energies (in Hartree) used to calculate the atomization energies :
wavefunctions.tar.gz.molden file for a specific molecule from the dataset can be found using the
'compounds' array in 'DFT_all.npz' file.For instance : the 0-th entry in the
'compounds' array of DFT_all.npz corresponds to 'SH2_0/conformer_1'Wavefunction file for this molecule will be found at
'wavefunctions/SH2_0/conformer_1.molden' after untarring wavefunctions.tar.gzMultiwfn (http://sobereva.com/multiwfn/) can be used to read the .molden wavefunction files
Dataset is described in the paper : https://www.nature.com/articles/s41597-025-05428-4
Files
Files
(108.2 GB)
Additional details
Identifiers
Software
- Repository URL
- https://github.com/dkhan42/VQM24
- Programming language
- Python
- Development Status
- Active