githubEdit

MPtrj

The Materials Project contains a large set of relaxation data across its entire database. These relaxation "trajectories," along with single-point data, for the PBE GGA and GGA+U calculations are stored in the tasksarrow-up-right collection of Materials Project. Deng et al. collated the energies, forces, stresses, structures, and on-site magnetic moments (when available) from this ionic step data, and used it to train a graph neural network, CHGNet [1], for the potential energy surface of most of the periodic table.

The collected dataset is called MPtrj in the CHGNetarrow-up-right [1]. Subsets of MPtrj were used in the earlier universal machine learning interatomic potentials (MLIPs) M3GNetarrow-up-right [2] and later ones such as MACE-MP-0 [3].

The original MPtrj dataset is hosted on MPContribs (explorerarrow-up-right) (bulk downloadarrow-up-right) with a parquet format bulk download. An example notebook for working with parquet data is also availablearrow-up-right.

References

[1] B. Deng, P. Zhong, K.J. Jun, J. Riebesell, K. Han, C. J. Bartel, and G. Ceder. "CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling". Nature Mach. Intell., vol. 5, pp. 1031–1041, yr. 2023. (DOIarrow-up-right) (original figsharearrow-up-right)

[2] C. Chen and S.P. Ong, "A universal graph deep learning interatomic potential for the periodic table". Nature Comput. Sci., vol. 2, pp. 718–728, yr. 2022. (DOIarrow-up-right)

[3] I. Batatia et al., "A foundation model for atomistic materials chemistry". arXiv:2401.00096v3, yr. 2025. (DOIarrow-up-right)

Last updated

Was this helpful?