Database Versions
A changelog of Materials Project (MP) database releases.
This page contains a summary of major changes for each version of the Materials Project database.
We are aware of a community need for more detailed change logs, and hope to improve our reporting for future database versions.
Database versions are labelled via the date they become generally available to the public. For advanced users, the public database label is mapped to an internal pull request number, and this is accessible in the API via the builder_meta
key.
You can verify the current database version powering the website on the footer of every page. If you are using the API, there is a get_database_version
method available.
v2024.12.18
This version went live on December 20th, 2024 at about 11pm Pacific.
Major updates include the addition of r2SCAN calculations and improvements to thermodynamic data handling.
New Content:
Added 15,483 GNoME-originated materials calculated using r2SCAN. We remind our users that the GNoME structures are licensed BY-NC (non-commerical purposes). Explicitly accepting theBY-NC license is now required to access the GNoME dataset in the Materials/GNoME explorers and the API. A further release of an additional ~100k GNoME materials is in preparation.
Core Changes:
Modified the
run_type
requirement for the definition of a valid material (emmet-core
622f2e3):Previously: A material was required to have at least one
GGA(+U)
calculationNow: MP accepts materials with only r2SCAN calculations, as going forward MP will be prioritizing r2SCAN workflows
This change restored 736 previously deprecated materials
Thermodynamic Data Updates:
New hierarchy for
thermo
data presentation:Affects Materials Explorer and
MPRester().summary
endpointResolves display issues on the Materials Explorer for 586 materials with valid thermodynamic data that were found to have failed to generate thermodynamic stability data using MP's GGA/GGA+U/r2SCAN Mixing scheme
These values were not passed through to the
summary
endpoint as a strictthermo_type
ofGGA_GGA+U_R2SCAN
was requiredNew preference order for
thermo_type:
GGA_GGA+U_R2SCAN
>r2SCAN
>GGA_GGA+U
The
thermo_type
for a material can be found on the material's Material Detail Page under Properties in the Thermodynamic Stability tab
v2024.11.14
This version went live on December 12th, 2024 around noon Pacific.
Transition in document schemas for the
tasks
collection:Part of forward-looking transition from
atomate
toatomate2
workflow orchestration packagePrevious:
emmet.core.vasp.task_valid.TaskDocument
Current:
emmet.core.tasks.TaskDoc
Accessing fields is slightly different with
TaskDoc
(ex: eachCalculation
incalcs_reversed
is no longer accessed like a dict, but as aCalculation
object), butTaskDoc
should be fully backwards compatible with operations onTaskDocument
.TaskDoc
has some advantages overTaskDocument
, such as dynamically updatingtask_type
,run_type
, andcalc_type
. This would avoid long-term errors such as noted below for certain NSCF calculations, or noted issues with incorrectly parsing r²SCAN meta-GGA calcs as (PBE) GGA
21,144
tasks
were incorrectly assigned atask_type
ofNSCF Uniform
when they were reallyNSCF Line.
NSCF Uniform
tasks are used to calculate DOSes,NSCF Line
tasks are used to generate band structure scans. These and associated properties inmaterials/summary
(band gaps, DOS, etc.) have been corrected.39,374 materials were mistakenly assigned a DOS from a deprecated
NSCF Uniform
task. These have been corrected and removedThe current set of 2,047 GNoME-originated materials has been deprecated in preparation for a release of about 120,000 GNoME materials
While the XAS data has not changed, be sure to update to the newest version of
pymatgen
to avoid issues parsing certain XAS tasks
v2023.11.1
Improved and expanded set of elasticity data. Note that there are schema changes with how it is accessed in
SummaryDoc
andElasticityDoc
.Conversion electrode data added alongside existing insertion electrode data.
~10k new materials added, with ~5k deprecated. This includes a temporary deprecation of all compounds containing
Yb
while they are being re-run. This is in response to pseudopotential issues identified which were providing incorrect energies.
v2022.10.28
This database build incorporates Materials Project’s (R2)SCAN calculations as pre-release data. The default fields returned by the website and API will remain unchanged from the previous release at the GGA(+U) level of theory, but the (R2)SCAN data is now available for advanced users. Either see the “Pre-release Data” section of a relevant material details page, generate an R2(SCAN) phase diagram with the Phase Diagram app, or access the data via the thermo API endpoint. This database release also incorporates several new perovskite materials from a collaboration with Zachary Bare, University of Colorado.
v2021.11.10
This will be the first release with our new website and API. It does not contain any new data but is built using our new database building methods and is largely consistent with the previous database release. Some changes exist to the previous release due to improvements to detection of multi-anion systems leading to changes in the applied formation energy corrections.
Be aware, database version v2021.11.10 onwards is only available on the new Materials Project website and API. The legacy website and legacy API are frozen to the v2021.05.13 database release.
v2021.05.13
This release updates the energy correction scheme we use to generate phase diagrams and compute formation energies. As with any new database release, formation energies for many compounds have changed; however in this case the change is due only to our new energy correction scheme and not to any new data. We are proud to report that the new correction scheme has reduced the overall error in formation energy in our database by 7% compared to experiment.
You can see details of each correction that has been applied by inspecting the energy_adjustments
attribute of a ComputedEntry
retrieved via the API. In addition, the new correction scheme is available for manual use via the MaterialsProject2020Compatibility
class in pymatgen.
We realize that this change may be disruptive to ongoing work, and want to assure you that the historical corrections are still available in pymatgen if needed. They may be recovered by manually reprocessing ComputedEntry
using the legacy MaterialsProjectCompatibility
class. An example notebook demonstrating how to do this available on matgenb 25.
Below we summarize the most significant changes associated with the new MaterialsProject2020Compatibility
correction scheme. For complete details and documentation, please refer to this manuscript 32.
1. Refitted corrections for legacy species Corrections applied to oxygen compounds, diatomic gases, and transition metal oxides and fluorides have been refit using more up to date DFT calculations and a larger compilation of computed and experimental formation enthalpy data.
2. Corrections for additional species We have added corrections for Br, I, Se, Si, Sb, and Te, which did not previously have energy corrections. As a result, formation energies for materials containing these species will generally be lower than they were previously.
3. Diatomic gas corrections moved to compounds
Previously, corrections for H, F, Cl, and N were applied to the elements. One consequence of this was that polymorphs of H2, N2, Cl2 and F2 were alwaysassigned a zero energy above hull, even if some polymorphs were higher in energy. This made interpretation of these values confusing. With this release, energy corrections are applied to the material (e.g., LiH) and not the element. This also means that unstable polymorphs of diatomic gases will now have non-zero e_above_hull
4. Oxidation state based corrections
Our build process now estimates the likely oxidation states of each species in a material, and uses this information to intelligently apply corrections to anionic species only when their estimated oxidation state is negative. For example, in the compound MoCl3O
, estimated oxidation states for both Cl and O are negative, so both anions receive corrections.
Our algorithms are not always successful in predicting the oxidation state. When this occurs, we apply anion corrections to only the most electronegative element in the material. As a result, some ternary or higher compounds in the database may be destabilized in this release because their oxidation states could not be determined. This is the case for MoCl5O (mp-1196724) for example, which does not receive a Cl correction because O is more electronegative.
If this affects your work, you can manually assign oxidation states by populating the oxidation_states
key of the .data
attribute of any ComputedEntry
and then reprocessing the data using MaterialsProject2020Compatibility
.
5. Uncertainty Quantification We now compute the estimated uncertainty associated with the energy corrections on a material. Uncertainties reflect the measured uncertainty in the underlying experimental data that we use to determine the corrections, as well as uncertainty associated with the fitting procedure itself. This information enables new methods of assessing phase stability, as described in this manuscript 32
A Note for API and MPRester Users
For API users, if you are retrieving formation energies directly via the API, you will get the correct, latest formation energies from the current database release. However, if you are using get_entries
or get_pourbaix_entries
which apply the correction scheme on-the-fly, make sure to update to the latest version of pymatgen (v2022.0.8 or later) to get the correct values. If you are using pymatgen v2021 or earlier, this will use the old correction scheme by default when using get_entries
and get_pourbaix_entries
.
v2021.03.22
This release updates some older materials with new calculations, and adjusts our rules for deprecating older calculations. It does not contain any new materials. Thanks to the new calculations many materials that were previously deprecated are now accessible again. This release is in preparation for a switch to our new compatibility scheme which will improve our predictions of formation energy.
v2021.02.08
We had a small new database release today, this introduces new higher-quality calculations for around 30,000 materials. It also deprecates 78 materials since we currently do not have calculations for these materials that match our current quality standards; we hope to restore these 78 materials in a subsequent release. For an exact list, please see the attached file.
As a reminder, all historical calculation tasks remain available via our API and the task detail pages, and information on deprecated materials also remain available via the API. More information on our deprecation policy is in our documentation. We continue our work on better ways communicate database diffs and to more easily provide access to historical information, so stay tuned for future announcements here.
db_v2020_09_08_to_v2021_02_08_diff.yaml (376.9 KB)
v2020.09.08
This releases addresses issues noticed in the previous release with formation energies and updates the energies of approximately 6k materials where this error was greatest. We are planning a further supplemental release.
We’re also looking at ways to put in place a process to be more transparent with database changes and updates to share more specifically what has changed, as well as providing means to access historical versions of the database, since we know this is a common requirement.
Note that, wherever possible, we continue to keep individual historical calculation data available via its task_id even in cases where the aggregated information (such as that presented on the materials detail page) might change.
V2020.08.20 Released
In this release we have added thousands of new band structure and density of states calculations, improving our overall material coverage and data quality. Additionally, we have overhauled the plotting for these quantities on the material details page. This is a first step in improving the electronic structure data within the Materials Project as part of our new tool set 46 for band structure calculations.
We are also working through an on-going issue affecting the energies of a small number of materials. In the previous release, we added a large batch of higher-quality calculations for our energetics as well as fixing numerous bugs. However, we discovered an error in our calculation parameters leading to larger energies than expected for a minority of materials and issues such as those discussed here 27. We are currently re-running these calculations and will be fixing this data in a supplemental update in the next few weeks. We advise anybody performing large screening studies to do so with caution or wait until this supplemental update has been released.
v2020.06
In this database release, we have added several thousand materials and many magnetic ground states, improved the quality of our energetics, and fixed many bugs. This database release is part of on-going efforts in 2020 to improve database reliability and quality, following the introduction of our deprecation process last year. There are still known issues with this release which we are working to address, please let us know if you encounter any in our forum.
v2019.12.05
The issue mentioned in 2019-12-04 has now been addressed, however approximately 7% of materials saw errors in their reported energies above hull of greater than 0.05 eV/atom. Values calculated via pymatgen or via the phase diagram app on the website during this time were correct, while values reported on the materials details page and via the e_above_hull
API key were incorrect.
We encourage users who accessed convex hulls from the website between the latest database release and 2019-12-05 to re-check any values obtained from the website.
We apologize for the error, and will be incorporating additional checks into our automated testing to prevent similar errors in the future.
v2019.12.04
We are aware of an on-going issue with the reported energies above hull on the materials detail pages. We will update this thread when a fix has been fully implemented and with further details.
Until this issue is fully resolved, correct energies above hull can be retrieved using pymatgen as follows:
If you have not previously used pymatgen, it is a Python code and can be installed using pip install pymatgen
or conda install --channel conda-forge pymatgen
.
Note: the above information for v2019-12-04 is now out of date.
v2019.11.21
During deployment of the new v2019.11 database, there was temporary issue with generating interactive phase diagrams leading to incorrect formation enthalpies for a small number of chemical systems. This has now been fixed. Data presented on the materials detail pages was unaffected by this issue.
v2019.11
Introduced 3,971 new materials
Amorphous materials added with
amorphous
tagAdded
theoretical
which is True when the material matches no known experimental structure from ICSDFixed several inconsistency bugs for
band_gap
, piezo tensors, elastic warnings, and total magnetic moment.
v2019.05
Introduced a new
deprecated
field to materials. By default the website and API only search for materials that are not deprecated: {“deprecated”: false}.Deprecated 15,000 and added 3,600 new materials. We will be recomputing the deprecated materials to fill these spaces back up. Some of these new relaxations may end up matching current materials, so the total number of materials is not guaranteed to be the same as in V2019.02. This also affects downstream properties. Most notably, ~3k elastic tensors associated with the deprecated materials have been removed from the database and are no longer accessible.
Fixed an issue with sandboxes not properly building the whole hull. Previously, only the sandboxed chemical systems were being recalculated for energy_above_hull searches
v2019.02
Added over 47,000 new materials from orderings of disordered ICSD as well as compounds from the Pauling File
Finalized enforcing symmetry on piezo tensors
Moved third order elastic data to elasticity_third_order so that people are not swamped by the mountain of information associated with it.
v2018.12
Adjusted the mp-id naming scheme to fix “mvc” ids taking over old mp-ids.
Fixed piezoeletric max_direction to be a miller index rather than a unit vector.
v2018.11
Changed the grouping of magnetic materials to aggregate all magnetic orderings of a given material into a single material-id, and report the lowest energy ordering
Fixed incorrect calculation and display of polycrystalline dielectric constants
Fixed labeling of all materials as high-pressure. Note we’re parsing ICSD tags for this labeling so while some materials may not conventionally be considered high-pressure, a single matching ICSD entry can tag a material as such. We would love to hear comments on how we could better tag high-pressure materials
Begun enforcing the symmetry of the structure on piezo tensors. In general, this reduces the expected piezo value.
Last updated