. 2022 Dec 31;3(1):113–123. doi: 10.1021/jacsau.2c00540

Upper-Bound Energy Minimization to Search for Stable Functional Materials with Graph Neural Networks

Jeffrey N Law ^†,^*, Shubham Pandey ^‡, Prashun Gorai ^‡,^§,^*, Peter C St John ^†,^*

PMCID: PMC9875372 PMID: 36711088

Abstract

graphic file with name au2c00540_0008.jpg

The discovery of new materials in unexplored chemical spaces necessitates quick and accurate prediction of thermodynamic stability, often assessed using density functional theory (DFT), and efficient search strategies. Here, we develop a new approach to finding stable inorganic functional materials. We start by defining an upper bound to the fully relaxed energy obtained via DFT as the energy resulting from a constrained optimization over only cell volume. Because the fractional atomic coordinates for these calculations are known a priori, this upper bound energy can be quickly and accurately predicted with a scale-invariant graph neural network (GNN). We generate new structures via ionic substitution of known prototypes, and train our GNN on a new database of 128 000 DFT calculations comprising both fully relaxed and volume-only relaxed structures. By minimizing the predicted upper-bound energy, we discover new stable structures with over 99% accuracy (versus DFT). We demonstrate the method by finding promising new candidates for solid-state battery (SSB) electrolytes that not only possess the required stability, but also additional functional properties such as large electrochemical stability windows and high conduction ion fraction. We expect this proposed framework to be directly applicable to a wide range of design challenges in materials science.

Keywords: materials discovery, structure prediction, graph neural networks, solid state batteries, reinforcement learning

1. Introduction

Overcoming critical barriers in materials science will require the discovery of yet unknown structures with precisely tailored properties. Computational searches driven by quantum chemistry calculations have accelerated materials exploration over large compositional and structural spaces, but are still limited to few tens of thousands of materials.¹⁻⁴ Often, such searches are restricted to structures previously documented in crystallographic databases such as the Inorganic Crystal Structure Database (ICSD)⁵ and Cambridge Structural Database (CSD),⁶ which predominantly comprise experimentally synthesized inorganic materials. Their use in discovery campaigns for novel materials is therefore limited. Solid-state batteries (SSBs) are one such application, where the use of solid electrolytes makes SSBs safer and more energy dense than traditional Li-ion technology. SSBs require materials that meet several performance metrics,⁷ and remain limited by the thermodynamic instability of electrode–electrolyte interfaces.^8,9 For instance, well-known solid electrolytes such as Li₁₀GeP₂S₁₂, Li₆PS₅Cl (argyrodite), and Li₇La₃Zr₂O₁₂ decompose at the interface with Li-metal anode forming solid-electrolyte interphases that are difficult to control and cause performance degradation.^10,11 Therefore, further improvements to SSB design require searching for new materials that are stable at suitable reduction and oxidation potentials.

A central problem in using machine learning (ML) methods to accelerate the search for novel crystal structures is finding structures that are thermodynamically stable, i.e., stable against decomposition into competing phases. Compositional models are not sufficiently accurate to reliably predict thermodynamic stability.^12,13 Graph neural network (GNN) models have achieved impressive results in predicting formation energy and decomposition energy with mean absolute error (MAE) close to chemical accuracy (0.03–0.05 eV/atom).¹⁴⁻¹⁷ However, GNN models require the crystal structure as inputs, which are available only after performing expensive density functional theory (DFT) relaxations. A considerably harder problem is predicting a priori whether a hypothetical crystal structure input will be stable before performing DFT relaxation. Recent studies have made some progress in this direction through the use of scale-invariant approaches, where the volume of input structures is normalized to make GNN models less sensitive to volume changes that often occur during relaxation.^18,19 It has been previously shown that including high-energy structures as training data is critical to developing models intended to rank the stability of potential crystal structures for a given composition.¹³ Analogously, training GNN models only on fully relaxed structures and their DFT total energies may bias the models to under-predict the energies of high-energy structures in unfavorable arrangements.

In this work, we develop a generalized approach for finding new inorganic crystal structures that are likely to be stable. We first demonstrate that the success of scale-invariant approaches in predicting the DFT-relaxed total energy of a crystal structure depends heavily on the degree to which the structure relaxes away from the initial unrelaxed structure. In a wide search over structures created by ionic substitution,²¹ DFT often alters the initial structures drastically during relaxation to a local energy minimum. Predictive models for energy trained with these unrelaxed structures as inputs are therefore inaccurate and unsuitable for screening potentially stable decorations.

We present an alternate approach to finding new stable structures over a large decoration space that is compositionally and structurally diverse. First, we construct a database of constrained DFT relaxations over only the unit cell volume, which by design yields an upper bound to the total energy of the unconstrained (full) relaxation. This upper-bound energy can be predicted to a high accuracy by scale-invariant ML models (MAE ∼ 0.05 eV/atom), since fractional coordinates in the unit cell are known precisely. By subsequently searching for decorations that minimize this predicted upper-bound energy, we find novel crystal structures that are highly likely to be stable. Out of 14.3 million decorated structures, this approach predicted a stable structure for 2003 compositions. Validating these top candidates with DFT confirmed >99% of them to be thermodynamically stable, i.e., having negative decomposition energy. We find many of these stable candidates also have suitable functional properties for SSBs, with structural similarities to previously explored solid electrolytes, electrodes, and coatings.

Expanding our search to an even larger number of compositions and prototypes will exponentially increase the search space, making it computationally intractable to exhaustively assess the stability of every structure. To address this future need, we demonstrate a reinforcement learning (RL)-augmented search strategy that finds stable structures using our surrogate stability function at a fraction of the computational cost. Overall, this study shows that ML strategies are able to drastically reduce the computational cost and time to find promising inorganic functional materials.

2. Results and Discussion

2.1. Challenges in Predicting Thermodynamic Stability

To be useful in screening candidate structures for stability, a machine learning (ML) surrogate model must be able to predict the total energy of a relaxed structure using only information available before the relaxation is performed. To provide our surrogate model with relevant training examples, we first constructed a database of example hypothetical structures through ionic substitution with compositions suitable for SSBs. We selected 67 489 candidate structures for full DFT relaxation by decorating prototype ICSD structures with new compositions (Section 4.3.2). We refer to this as the fully relaxed data set. The corresponding total energies in this data set are denoted by E_tot.

We first trained a scale-invariant GNN model (Section 4.1) on the ICSD and fully relaxed data sets, where we paired the unrelaxed structures with their corresponding total energy after relaxation. We withheld ∼5% of structures in each data set for the validation and test sets. While the model performed well on ICSD structures (gray points in Figure 1c, MAE = 0.05 eV/atom), we found that the error was much larger on the fully relaxed data set (orange points in Figure 1c, MAE = 0.13 eV/atom). The higher MAE for the fully relaxed data set is likely due to many of the input structures starting in highly unfavorable configurations, such that DFT relaxation significantly alters their volume, cell shape geometry, and fractional atomic coordinates in finding a local energy minimum.

Initial surrogate model development. (a) Cosine distances between initial and DFT-relaxed structures in the Open Quantum Materials Database (OQMD).^18,20 (b) Cosine distances between initial and DFT-relaxed structures for the 53 000 hypothetical decorated structures used in this study (Figure 2). (c) Predicted vs DFT total energy of a GNN trained on ICSD and *unrelaxed* hypothetical structures and evaluated on the energy of the DFT-relaxed structures. The prediction accuracy for ICSD structures (gray) is high, but low for unrelaxed hypothetical structures (orange).

To quantify the structural change during DFT relaxation, we computed the cosine distance between the unrelaxed and relaxed structure pairs using Matminer fingerprints (see Section 4.4).²² Here, a cosine distance of zero indicates high structural similarity, ignoring any volume changes. With our wide range of prototypes and decorations, we found that over 86% of pairs had a cosine distance above 0.1, meaning the vast majority of structures change quite dramatically after relaxation (Figure 1b). This is in stark contrast to the distribution of distances for structures in the Open Quantum Materials Database (OQMD)²⁰ (Figure 1a), where similar GNN models are able to achieve high accuracy for unrelaxed structures, i.e., MAE < 0.05 eV/atom.¹⁸ This result implies that while scale-invariant methods are robust to changes in the cell volume during DFT relaxation, they cannot account for large changes in fractional coordinates and cell shape geometry when DFT relaxes a structure far away from a high-energy starting configuration.

2.2. Volume-Only Structure Relaxations

Rather than attempt to directly predict the energy resulting from a full DFT relaxation, we developed an alternate approach. We noticed in our fully relaxed data set that when a prototype decoration was favorable for a new composition, the structure tends to relax with minimal changes in its fractional coordinates. A surrogate model with the goal of differentiating between favorable and unfavorable decorations would need examples of each to perform well. We therefore constructed a second database of DFT relaxed structures where we fix the unit cell geometry and fractional atomic coordinates and only relax their volumes. By design, the cosine distances between the unrelaxed and volume relaxed pairs are zero.

The success of scale-invariant GNNs in previous applications suggests that the optimal volume and energy for a given structure can be predicted by its fractional coordinates and cell shape geometry.^18,19 By constraining these features during a volume-only relaxation, we are able to augment our training set with high-energy examples, and provide a better foundation to distinguish favorable from unfavorable structure decorations. The volume relaxation also provides us with an accurate upper bound to the total energy calculated by the unconstrained relaxation, since the energy must stay the same or decrease when DFT is allowed to fully relax the structure. Note that this upper bound is not a theoretical limit to the total energy of the structure, but serves as a useful reference point. While searching for stable structures in a large decoration space, if a volume relaxed structure is not predicted to be stable w.r.t. competing phases, the fully relaxed structure may or may not be stable. However, if a volume relaxed structure is stable, the fully relaxed structure will be stable as well. Assuming at least some of the volume relaxed structures are stable, we can efficiently screen for them in the unrelaxed decoration space using an accurate surrogate model (Figure 2a).

Overview of approach and data set generation. (a) Starting from an unrelaxed structure, we predict an upper bound for the total energy i.e., energy after a constrained relaxation. We then evaluate the thermodynamic stability relative to competing phases and prioritize structures predicted to be stable, meaning the upper bound of the decomposition energy is <0 eV/atom. If the decomposition energy upper bound is >0 eV/atom, the structure could still be stable after full relaxation. (b) Our element library consists of conducting ions (C), framework cations (F), and anions (A). We build valence-balanced compositions of the general form , where x, y_i, and z_j are the stoichiometries corresponding to C, F, and A, respectively. Here, i and j are 1–2; *i.e.*, we consider up to 2 framework cations and 2 anions. For a given composition, we decorate the elements in prototype structures (from a prototype library) via ionic substitution. These structures are then relaxed with DFT in two ways: (i) full relaxation and (ii) volume-only relaxation, where the cell shape and atom positions are held constant.

Inline graphic — Overview of approach and data set generation. (a) Starting from an unrelaxed structure, we predict an upper bound for the total energy i.e., energy after a constrained relaxation. We then evaluate the thermodynamic stability relative to competing phases and prioritize structures predicted to be stable, meaning the upper bound of the decomposition energy is <0 eV/atom. If the decomposition energy upper bound is >0 eV/atom, the structure could still be stable after full relaxation. (b) Our element library consists of conducting ions (C), framework cations (F), and anions (A). We build valence-balanced compositions of the general form , where x, y_i, and z_j are the stoichiometries corresponding to C, F, and A, respectively. Here, i and j are 1–2; *i.e.*, we consider up to 2 framework cations and 2 anions. For a given composition, we decorate the elements in prototype structures (from a prototype library) via ionic substitution. These structures are then relaxed with DFT in two ways: (i) full relaxation and (ii) volume-only relaxation, where the cell shape and atom positions are held constant.

We performed a volume-only relaxation on each of the ∼68 000 unrelaxed structures used as inputs to the fully relaxed data set. We pruned ∼9000 that did not pass quality control filters (Section 4.3.4) and refer to these ∼58 700 structures as the volume relaxed data set (Figure 2b). Figure 3a shows the differences in total energy between the two data sets, which confirms that the volume-only relaxations are indeed an upper bound. These volume-only relaxations augment the original fully relaxed data set, providing examples of high-energy decorations that will serve to guide an ML model toward choosing low-energy initial structures.

Effect of the volume relaxed data set on energy and surrogate model performance. (a) DFT energy differences between volume relaxed and fully relaxed structures. (b) Predicted vs DFT total energy of the model trained and evaluated on the three data sets. (c) Learning curves of the model’s prediction error by data set size.

With the addition of the volume relaxed data set, we trained a scale-invariant GNN on all three data sets (Figure 2b). The model uses a scale-invariant approach, scaling the input crystal’s volume to make the minimum edge length 1 Å (section 4.1). In mixing the volume relaxed with the ICSD and fully relaxed data sets, we use the relaxed geometry for the fully relaxed structures instead of their input geometry. In this way, the model is tasked with predicting the total energy of a structure in its given, scale-invariant configuration, rather than attempt to predict the energy to which an input geometry might ultimately relax. The prediction accuracy for all structures improved substantially (Figure 3b), as the GNN had access to the correct fractional coordinates for all inputs. Learning curves of the prediction error as a function of the data set size show the model benefits from additional data up to the full data set size (Figure 3c).

2.3. Surrogate Model Predictions

With the trained GNN surrogate model, we predicted the upper bound energy of all 14.3 million possible decorations (Section 4.2). Because predicting a candidate structure’s upper-bound energy requires only a single forward pass through our trained ML model, evaluation of all 14.3 million took only 2 h using a single Tesla V100 GPU accelerator. We estimated the thermodynamic phase stability of each of these structures by computing their decomposition energy obtained through a convex hull analysis. Here, the convex hull is constructed by considering competing phases from the ICSD. The total energy of ICSD structures is taken from the NREL Materials Database,²³ as explained in Section 4.3.3. The decomposition energy (E_decomp) is a measure of the thermodynamic stability of a structure against chemical decomposition into competing phases.¹²E_decomp is the minimum energy that the formation energy of an unstable material has to be lowered (more negative) before it becomes stable. Similarly, for a stable compound, E_decomp is the maximum energy that the formation energy can be increased (less negative) before it becomes unstable.¹² For each composition, we selected the structure with the lowest predicted energy. About 1.7% of compositions (3719) had a structure with a negative E_decomp < 0.001 eV/atom. To account for potential errors in the model predictions, we applied a more stringent cutoff of a negative E_decomp < −0.1 eV/atom, which resulted in 2003 compositions. Before analyzing these structures, we first validate the predicted stable structures with DFT.

2.4. DFT Confirmation of Predicted Stable Structures

We performed both full DFT relaxation and volume-only relaxation for the 2003 predicted stable structures. Each of these 2003 structures has a unique composition because we chose the lowest-energy structure for a given composition. Of the 1707 structures where the DFT calculations successfully converged, we find the model predicts the energy upper bound, i.e., volume relaxed total energy, to a high accuracy (MAE = 0.045 eV/atom, Figure 4a). Nearly all predicted upper-bound total energies are larger than the fully relaxed DFT energies (Figure 4b), consistent with our hypothesis that the volume relaxed energies are an upper bound. We confirm that 99% (1700/1707) of the predicted structures are in fact stable, as determined from a convex hull analysis (Figure 4c). The DFT decomposition energies (calculated using fully relaxed DFT total energy) are more negative i.e., more stable, than the predicted values, demonstrating the success of our upper-bound approach.

DFT confirmation of predicted stable structures. (a) Predicted vs volume-only DFT relaxation total energy. (b) Predicted vs full DFT relaxation total energy. (c) Predicted vs full DFT relaxation decomposition energy. Points in blue indicate structures that remain stable after evaluation of the self-consistent decomposition energy (Section 2.4).

E_decomp in Figure 4c are calculated from a convex hull construction by considering competing phases from the ICSD. However, to be self-consistent, the newly predicted stable structures should also be considered as competing phases. Therefore, we supplemented the ICSD structures with the 1707 fully relaxed structures as well as the lowest predicted energy for each of the ∼220 000 compositions in our decoration space. Of these 1707 compositions, 31 are found in the ICSD and therefore, are not considered further in our analysis. We reconstructed the convex hulls with this combined data set and calculated what we term “self-consistent (SC) decomposition energy”. After this re-evaluation, 285 structures had a SC E_decomp < 0 eV/atom. We provide the structures, predicted, volume relaxed, and fully relaxed total energies, as well as the SC decomposition energies (see Section 4.5).

2.5. Novel Stable Structures

Thermodynamic stability is a prerequisite in the search for novel functional materials. Beyond stability, such materials must also exhibit specific functional properties. Our motivation for this study is to find new battery materials, which inspired our choice of chemistries to build the training data set. We evaluate the suitability of the 285 newly predicted compositions/structures as solid-electrolytes in SSBs.¹⁰ For application in metal-anode SSBs, the solid electrolyte should have a low reduction potential, i.e., close to 0 V w.r.t. Li/Li⁺. For compatibility with high-voltage cathode, the oxidation potential should be large, ideally > 4.0 V. In addition, a large electrochemical stability window (ESW) is desired. The volume fraction of the structure available to the conduction ions is a rough measure of the ionic conductivity, although more refined descriptors have been proposed.^24,25 In summary, we sought structures with the following features: (F1) SC decomposition energy < −0.1 eV/atom, (F2) low reduction potential < 2.0 V w.r.t. Li/Li⁺, (F3) high oxidation potential > 4.0 V w.r.t. Li/Li⁺, (F4) large ESW > 2.0 V, and (F5) large volume fraction available to conduction ion ≥ 30%. These criteria (F1–F5) represent a set of choices that can be easily adjusted for further analysis. Here, ESW is calculated as the difference between the oxidation and reduction potentials, which depend only on thermodynamic stability. While ESW calculated in this manner provides a useful guide, recent studies have highlighted the need to consider electronic band alignment between the electrolyte and electrodes to rigorously determine ESW.²⁶ These band alignment calculations are computationally intensive and beyond the scope of our study.

Figure 5b shows the number of structures that pass each feature cutoff, as well as combinations of feature cutoffs. Structures that pass all cutoffs would be of particular interest. While we did not find any such structures that pass all feature cutoffs, several structures passed 3–4 cutoffs, as shown in Figure 5b. Some of these structures and family of structures are labeled in Figure 5b and their DFT-relaxed crystal structures are shown in Figure 6. As no structure simultaneously possessed all the desired features, we examined the Pareto surface of all five features. We identified 61 structures lying on the Pareto frontier, which are included in the supplemental data (see Data Availability Statement). The most interesting Pareto front occurred between conducting ion volume and the electronic stability window, which we have included as Figure S1. Here, structures with a high conducting ion volume seem to have a lower ESW and vice versa, possibly indicating a trade-off between battery lifetime (i.e., stability) and performance (charge transfer rates). Six structures lie on this Pareto curve, four of which we discuss below.

Functional features of the predicted stable structures relevant for battery applications. (a) Histogram showing the distribution of self-consistent decomposition energies for the 2003 structures originally predicted to be stable. (b) UpSet plot of the 285 candidate structures. Combinations of feature cutoffs with less than five members are not visualized. Example compositions are listed for several sets.

Example crystal structures predicted to be stable and exhibit certain features (Figure 5) suitable for application as solid electrolytes. (a) LiSc₂F₇. (b) LiY₂Br₇. (c) Li₂HfBr₆. (d) LiW₂Zn₄N₇. (e) NaLaP₄N₈. (f) LiSc₂HfBr₁₁. (g) Li₂HfN₂. (h) KScS₂. (i) Na₂ZrO₃.

A family of compounds with the general formulas CM₂X₇ (C = Li, Na; M = Sc, Y; X = halogens) are identified as stable structures that exhibit features F1 through F4, but are C-poor compositions, which contributes to their low conducting ion volume fraction. LiSc₂F₇ (Figure 6a, space group P2) and LiY₂Br₇ (Figure 6b, space group Pnma) are two examples from the CM₂X₇ family of compounds. These structures are derived from the K and In rare-earth phyllochlorides,²⁷ which feature unique 7-fold coordinated trivalent rare-earths. In the DFT-relaxed structures of LiSc₂F₇ and LiY₂Br₇, shown in Figure 6, Sc and Y form edge- and corner-shared [ScF₇] and [YBr₇] polyhedra.

Li₂HfBr₆ (Figure 6c) and Li₂ZrBr₆ also pass the feature F1–F4 cutoffs. The predicted stable structures (space group R3̅) contain isolated [HfBr₆] and [ZrBr₆] octahedra with interspersed Li. LiW₂Zn₄N₇ (Figure 6d, space group C2) forms a tetrahedrally bonded structure consisting of edge-connected [ZnN₄] and [WN₄] tetrahedra. The structure is derived from Cu₄NiSi₂S₇, which crystallizes in monoclinic distorted sphalerite superlattice.²⁸ NaLaP₄N₈ is yet another structure that fulfills F1–F4 cutoffs. The initial structure of NaLaP₄N₈ is created by decorating the BaSrFe₄O₈ trigonal (space group P3̅1m) structure,²⁹ with P occupying the tetrahedrally bonded Fe sites and La on the octahedrally coordinated Sr sites.

LiHfSc₂Br₁₁ (Figure 6f) satisfies feature cutoffs F1, F2, and F4 and is derived from the NaZnZr₂F₁₁ structure (space group C2m). NaZnZr₂F₁₁ is a known stable compound that has been experimentally realized and contains octahedrally coordinated Zn, which is uncommon. [ScBr₄] octahedra are highly distorted while [HfBr₄] octahedra are less so. Li₂HfN₂ has a layered structure consisting of face-sharing [HfN₆] octahedra interspersed with Li (Figure 6g). Interestingly, Li₂HfN₂ is predicted to be stable at the interface with Li-metal anode (reduction potential 0.0 V), but stable only up to an oxidation potential of 1.2 V. It also has a high conducting ion volume fraction (0.41). Together, these features make Li₂HfN₂ a promising candidate for Li-anode coatings. In fact, Li₂HfN₂ is a hypothetical structure that is predicted to be stable in the Materials Project database³⁰ and has been previously proposed as a candidate material for Li-anode coatings.³¹

KScS₂ (Figure 6h) and KAlS₂ pass feature cutoffs F1 and F3–F5 and adopt the α-NaFeO₂ structure (space group R3̅m). KScS₂ is thermodynamically stable with K-metal anode (reduction potential 0.0 V) and has an oxidation potential of 2.7 V. The layered structure of KScS₂ lends itself to a high K ion volume fraction of 0.31 and possibly facile K⁺ ion diffusion. The layered Na₂ZrO₃ (Figure 6i) and Na₂HfO₃ also fulfill F1 and F3–F5 and possess the Li₂SnO₃-type structure (space group C2/c). Due to its layered structure, Li₂SnO₃ has been studied as a promising cathode material.³² In fact, Na₂ZrO₃ is predicted to be a stable structure in the Materials Project database³⁰ and Y-doped Na₂ZrO₃ has been theoretically investigated as a Na-rich cathode material.³³ We predict that Na₂ZrO₃ should be stable against Na-metal anode, which is also confirmed by a phase stability analysis of the ternary Na–Zr–O chemical space on Materials Project.

Overall, we find that many of the 285 structures that are predicted to be stable contain group-3 (Sc, Y, La) and group-4 (Zr, Hf) elements. Most of the structures are halides, but we also find some oxides, chalcogenides, nitrides, and mixed-anion chemistries among the stable structures. The dominance of halides can be attributed to the ionic nature of the compounds containing alkali elements (Li, Na, K) and halogens—a direct consequence of the large electronegativity differences between them. Valence-balanced ionic compounds tend to have high formation enthalpies and therefore, are generally stable. Furthermore, we observe that the cations in the predicted structures adopt their preferred coordination with anions, e.g., Sc, Y, La in 6-fold coordination and 4-fold coordinated Zn in tetrahedral geometry.

2.6. Reinforcement Learning Optimization of Structures

Although in this study we were able to predict the stability of all 14.3 million decorated structures, other structure searches where a brute-force computation would be intractable require a more efficient approach. Examples include (i) cases where the prototype and composition libraries are much larger, leading to an explosion of potential decorated structures, (ii) a costlier evaluation function, and (iii) allowing decorated structures to go “off-prototype,” meaning structure parameters (e.g., cell shape, atomic positions) are allowed to change, leading to a potentially infinite search space. Here we demonstrate the use of reinforcement learning (RL) to improve the search efficiency in such applications.

RL, particularly methods based on a directed tree search such as Monte Carlo Tree Search (MCTS), enable precise control over the search space and function(s) to optimize. MCTS has previously been demonstrated to solve complex optimization problems on both organic^34,35 and inorganic materials.³⁶ We developed an action space for the crystal structure design problem based on the steps for generating a decorated structure through ionic substitution (see the Supporting Information). We then implemented an MCTS optimization framework to find structures with desired properties, similar to the implementation by Sowndarya et al. for designing organic molecules.^35,37 Following the approach of AlphaZero, this MCTS framework is augmented with a policy model that replaces the simulation phase (using a random policy) of MCTS with a predicted value score.

As AlphaZero was originally designed for competitive games, we used a ranked reward strategy to enable tabula rasa self-play for the single-player combinatorial optimization problem.³⁸ In this strategy, the final reward of a rollout is rescaled to 0 or 1 depending on whether the reward is greater than the 90th percentile of the last 500 results. Thus, starting from an initially random walk over structure search space, the rollouts are guided by the policy to higher-reward structures.

To search for optimal candidate structures, we also implemented a weighted reward function based on the desired features discussed in Section 2.5 (see the Supporting Information). We employed 90 rollout workers split across 5 CPU nodes for 4 h, with a single node equipped with dual Tesla V100 GPUs handling the continual training of the policy model. To improve the efficiency of the search, we continuously restricted the action space to structures not yet evaluated by any of the rollout workers. This resulted in 38 000 rollouts and ∼4.2 million structures evaluated (see Figure 7a and b for the rollout rewards and policy training losses).

Reinforcement learning (RL) structure optimization. (a) Crystal building rollout rewards and (b) losses for the policy model vs time as the optimization proceeds. In (a), the r₉₀ line in orange represents the cutoff at which the result was considered a win or loss. (c) Improvement in efficiency of RL compared to a brute-force search. Top candidates are structures with a decomposition energy < −0.1 eV/atom.

We examined the improvement in efficiency for finding top candidate structures (i.e., predicted decomposition energy < −0.1) relative to the number of structures explored during the search. The largest improvement in efficiency was achieved during the beginning of the search; 34% of top-candidates were found after exploring 8% of decorations—a 4× increase relative to a brute-force search (Figure 7c). The reason the RL agent is unable to continually improve the reward and search efficiency may be because it exhausts branches of the search tree that are more densely populated with high-reward structures and is forced into continual exploration through generally low-reward spaces.

3. Conclusions

In this study, we have demonstrated an approach to finding stable, functional inorganic crystal structures by combining a GNN surrogate model with RL-based structural search. These results confirm the success of our upper-bound minimization strategy, where starting from a low-energy but unrelaxed starting point yields even lower energy stable crystal structures. We therefore rely on upper-bound optimization to find low-energy structure inputs, sidestepping the difficult problem of mapping unrelaxed structures to their corresponding fully relaxed energies. We demonstrate the method by searching for candidates for solid electrolytes, a demanding application that requires materials to simultaneously satisfy several competing criteria. Our method reveals 285 novel structures, many of which appear reasonable when compared with structures currently being explored for this application. In an independent but related effort, Chen and Ong predicted the stability of 31 million hypothetical structures using the M3gnet framework.³⁹ Of our 285 DFT-confirmed compositions, 102 appear in the matterverse.ai web platform, with 47 of these predicted to have a negative decomposition energy. This agreement between methods with separate training data and computational approaches lends further evidence to the potential stability of these materials.

While the method presented here was demonstrated by searching over a discrete space defined by the available structural prototypes, combining our upper-bound optimization with more flexible generative methods^40,41 is likely to result in even more stable candidate structures. Even with our current element substitution approach, by continuing to feed fully relaxed structures back to the RL agent as possible new prototypes to be decorated, we continually expand the search space considered by the optimization. Future work will also expand the number of variables optimized by the RL algorithm, targeting the search toward functional battery materials.

4. Computational Methods

We start with an overview of the approach, then provide specific details. To find stable, functional structures, we first generate a pool of candidate structures from a set of prototype structures and a set of suitable compositions, second, train a GNN model to predict the total energy of each of these structures, and third, calculate their thermodynamic stability w.r.t. competing phases. An implicit assumption is that some of the structures in the pool are in a stable configuration already, and our approach is simply to identify them. For our SSB application, we calculate additional features and filter the stable structures identified by the model to those with features of interest.

4.1. GNN Architecture

We utilized a similar GNN architecture as was developed by Pandey et al.¹³ To input crystal structures to the model, each structure is converted into a graph where each atom site is a node and the 12 nearest sites of each atom in terms of raw distances (taking periodicity into consideration) constitute the edges. For node features, we use only the identity of the elements at each site, and for edges, we use the distance (in Å) between the two sites. We use six message passing layers in the GNN. One important difference from the GNN used by Pandey et al. is that we scale each structure such that the minimum distance between atoms is 1 Å, as was done in Pal et al.¹⁸ Thus, the model learns a scale-invariant version of the structures.

4.2. Data Sets

Here we describe the prototype structures, the battery compositions, and how we decorated the prototype structures.

4.2.1. Prototype Structures from ICSD

Inspired by prior studies on ionic substitution,^21,42,43 we constructed a library of prototype structures from the ICSD by first classifying them into composition types. Here, composition type is defined as the sorted stoichiometry that is agnostic of the element type. For example, structures with compositions A₃B₁C₂, A₁B₂C₃, and A₁B₃C₂ are categorized into the composition type “1–2–3”. The prototypes are limited to ordered (fully occupied lattice sites) and stoichiometric ICSD structures. At this stage, we also filtered out erroneous ICSD structures with multiple atoms occupying the same lattice site in a way that the total site occupation is larger than 1. The structures within each composition type were then deduplicated by comparing their space groups and sorted list of Wyckoff site labels. With this procedure, we constructed a prototype library containing 4000 composition types spanning > 13 000 structures.

4.2.2. Battery Compositions

Most well-known solid-state battery materials, including solid electrolytes and electrodes, are ternary and multinary compounds with a distinct conducting ion (C), and a structural framework composed of cations (F) and anions (A). For example, the solid-electrolyte Li₃ScCl₆ structure contains Li⁺ ions interspersed within a framework composed of [ScCl₆] octahedra. Therefore, we chose compositions of the general form C_xF_yA_z, where x, y, z are the number of C, F, and A per formula unit. For computational tractability, we limit the search to ternary, quaternary, and quinary compositions such that x + y + z ≤ 15. To summarize, each composition is composed of 3–5 elements: (1) one conducting ion, C, (2) 1–2 framework cations, F, and (3) 1–2 anions, A. Informed by common battery chemistries, the following elements and their oxidation states are chosen: (1) C = Li⁺, Na⁺, K⁺, (2) F = Sc³⁺, Y³⁺, La³⁺, Ti⁴⁺, Zr⁴⁺, Hf⁴⁺, W⁶⁺, Zn²⁺, Cd²⁺, Hg²⁺, B³⁺, Al³⁺, Si⁴⁺, Ge⁴⁺, Sn⁴⁺, P⁵⁺, Sb⁵⁺, and (3) A = F^–, Cl^–, Br^–, I^–, O^2–, S^2–, N^3–, P^3–. We only consider valence-balanced compositions in our search. In total, there are 220 824 valence-balanced compositions spanning 174 composition types.

4.2.3. Decorated Structures

For each composition, we generated decorated structures by considering all prototypes for the corresponding composition type (Section 4.2.1). For each prototype, we perform all possible decorations using ionic substitution where the elements in the prototype structure are replaced with the elements from the battery compositions (Section 4.2.2). A typical approach to generate new structures from prototypes is to establish a set of substitution rules where elements in the prototype are replaced with similar elements. In our application, we sought to remove potential human biases in the substitution process and instead constrained our substitutions solely based on valence-balanced stoichiometries. While this approach generates some structures that are very unstable and high in energy, these structures help the trained GNN to differentiate between high-energy and low-energy decorations during high-throughput screening.

For example, if the battery composition is Mg₂ZrO₄ and the chosen prototype is BaAl₂S₄ (ICSD # 35136, space group Pa3̅), we substitute Al with Mg, Ba with Zr, and S with O in the structure. To keep the decoration space tractable, we do not perform decorations by Wyckoff sites and simply decorate all Wyckoff sites associated with an element with the substituting element. We also consider all possible stoichiometric combinations when performing the decorations. For example, the “1–2–4” composition type allows only one unique decoration while the “1–1–1–1–1” composition type accommodates 120 unique stoichiometric decorations on the same prototype structure. We also limit the search to prototypes with less than 50 atoms in the unit cell. We note that although C_xF_yA_z compositions are inspired by the structure of well-known battery materials, we do not explicitly impose any bonding constraints (e.g., F bonded to A) in constructing the decorated structures. In total, the 220 824 valence-balanced compositions result in 14.3 million hypothetical decorated structures.

4.3. Training Data Set

We train a GNN model to predict the total energy of a given structure, which acts as a surrogate model for DFT volume-only relaxations. To train the GNN model, we selected a subset of hypothetical decorated structures for DFT relaxation to build a training data set. Here, we describe the training data sets comprising ICSD and hypothetical structures, and the two types of DFT relaxations performed on the hypothetical structures.

4.3.1. ICSD Structures

We used the same data set of ICSD structures and their total energies as was used by Pandey et al.,¹³ which consists of ∼14 000 structures available in the NREL Materials Database (NRELMatDB)^23,44 as well as ∼2500 structures for which additional DFT calculations were performed.

4.3.2. Decorated Structures for DFT Relaxation

We sampled a subset of the decorated structures (Section 4.2.3) to perform DFT calculations. For computational tractability, we sampled ternary, quaternary, and quinary compositions such that x + y + z ≤ 10. With these constraints, there are 914 valence-balanced battery compositions spanning 72 composition types and 150 345 decorated structures. For each composition type, we randomly selected up to 10 compositions in way that every element accommodated by that composition type (condition of valence balance) are sampled. For each composition, we then consider all prototype structures (Section 4.2.3) in the corresponding composition type. We selected ∼68 000 structures for full DFT relaxation and volume-only relaxation (Section 4.3.3).

4.3.3. DFT Relaxations

DFT relaxations for the hypothetical structures were performed with VASP.⁴⁵ Details of the calculations are previously published in refs (13) and (44). The constrained volume relaxation was also performed with VASP,⁴⁵ using the Atomic Simulation Environment Python package.⁴⁶ The optimization of the scalar volume was performed in a gradient-free fashion through repeated one-shot, self-consistent DFT calculations, using the Brent method implemented in scipy. Volumes were bounded between 10 Å³ (to prevent negative volumes) and two times a volume predicted with the data-mined lattice scheme (DLS) as implemented in Pymatgen.⁴⁷ A rough initial volume guess (prior to DLS volume prediction) was generated with a linear model on composition trained on ICSD structures. Structures that ran into the upper bound volume during the bounded optimization tended to be unstable, i.e., the energy continues to decrease as the volume increases, and were pruned from the database. Finally, we ensured the volume-energy curve was sufficiently smooth and removed structures where the minimum energy was more than 10 meV/atom lower than the second-lowest energy on the volume-energy curve.

4.3.4. Data Quality Control

A number of data quality control checks were performed to remove problematic structures and relaxations from the DFT database prior to fitting the GNN model. First, we removed calculations derived from different ICSD prototypes that relax to the same final structure upon full DFT relaxation. We observed that despite being given different initial prototypes, multiple relaxations for the same composition would occasionally converge to the same fully relaxed structure. As this complicates the accurate splitting of train and validation structures, we removed duplicated results by comparing their fingerprints after relaxation (Section 4.4). We used scikit-learn to recursively cluster all fully relaxed structures, using a cosine distance of 0.01 as the distance threshold and the maximum distance between clusters as the merging criterion. For each composition, only a single fully relaxed structure per cluster was kept (one with the lowest energy), resulting in 13 133 fewer DFT data points. Next, we removed 1391 unconstrained DFT relaxations where a lower energy was obtained from the constrained relaxation. These calculations indicated that the full DFT relaxation reached a local energy minimum.

We next removed crystals with energies and volumes well outside the expected range. We fit a robust linear model to predict total energy as a function of crystal composition using scikit-learn, and removed 1551 calculations (10 full-relaxed, 1,541 volume-relaxed) with either a residual energy less than −2 eV/atom or greater than 5 eV/atom, or a residual volume less than −20 Å³/atom or greater than 50 Å³/atom. The final data set sizes are as follows: 16 409 ICSD, 52 949 fully relaxed, and 58 669 volume-relaxed structures.

4.4. Structure Fingerprints and Distances

Similarity between structures was calculated using the Matminer Python package.²² Fingerprints for each site were calculated using a local order parameter fingerprint, and converted to a crystal-level fingerprint by taking the mean and standard deviation over each site. Notably, the fingerprint method did not consider the overall volume of the unit cell, nor the chemical identity of the element at each site. Distances between crystal structures were then calculated using the cosine distance method as implemented in scikit-learn.

4.5. Surrogate Model Training

Of the ∼128 000 training structures, we used stratified random sampling to hold out 1500 structures for validation and 1500 for testing. We also selected 100 compositions uniformly at random and held out their structures (1492). We trained the model with a batch size of 64 structures for 100 epochs over the training data. To optimize training, we used the AdamW algorithm with an initial learning rate of 10^–4, decayed by ∼10^–5 each update step. We set the weight decay to an initial value of 10^–5, also decayed by ∼10^–5 each update step.

Here we also provide details of the learning curve evaluation. For each of five repeats, we first held out 1500 structures using stratified sampling for testing. Then, for each of the 10 training set sizes, we subsampled the training data to that size using stratified sampling, and held out 5% of that data for validation when training the model.

Acknowledgments

We thank Vladan Stevanović for inspiring the creation of the structural prototype library. This work was authored in part by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy, LLC, for the US Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. The information, data, or work presented herein was funded in part by the Advanced Research Projects Agency-Energy (ARPA-E), U.S. Department of Energy, under Award Number DE-AR0001205. P.G. also acknowledges support from NSF through award DMR-2102409. The views expressed in the article do not necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for U.S. Government purposes.

Data Availability Statement

The code and data used to train the GNN models, the main results, and structures presented in this work are available at github.com/jlaw9/upper-bound-energy-gnn (see also doi.org/10.5281/zenodo.7089031). Code used to run the RL optimization is available at github.com/jlaw9/rl_materials.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/jacsau.2c00540.

Plot comparing the conducting ion volume and ESW; description of the action space and reward function for the RL optimization of structures (PDF)

Author Contributions

Jeffrey N. Law: Investigation, Data Curation, Writing (Original Draft), Writing (Editing). Shubham Pandey: Data Curation, Writing (Editing). Prashun Gorai: Conceptualization, Investigation, Data Curation, Writing (Editing), Supervision, Project Administration. Peter C. St. John: Conceptualization, Investigation, Data Curation, Writing (Editing), Supervision, Project Administration.

The authors declare no competing financial interest.

Supplementary Material

au2c00540_si_001.pdf^{(189.4KB, pdf)}

References

Curtarolo S.; Hart G. L.; Nardelli M. B.; Mingo N.; Sanvito S.; Levy O. The High-throughput Highway to Computational Materials Design. Nat. Mater. 2013, 12, 191. 10.1038/nmat3568. [DOI] [PubMed] [Google Scholar]
Hautier G.; Jain A.; Ong S. P. From the Computer to the Laboratory: Materials Discovery and Design using First-principles Calculations. J. Mater. Sci. 2012, 47, 7317. 10.1007/s10853-012-6424-0. [DOI] [Google Scholar]
Jain A.; Shin Y.; Persson K. A. Computational Predictions of Energy Materials using Density Functional Theory. Nature Reviews Materials 2016, 1, 15004. 10.1038/natrevmats.2015.4. [DOI] [Google Scholar]
Gorai P.; Stevanovic V.; Toberer E. S. Computationally Guided Discovery of Thermoelectric Materials. Nature Reviews Materials 2017, 2, 17053. 10.1038/natrevmats.2017.53. [DOI] [Google Scholar]
Belsky A.; Hellenbrandt M.; Karen V. L.; Luksch P. New Developments in the Inorganic Crystal Structure Database (ICSD): Accessibility in Support of Materials Research and Design. Acta Crystallographica B 2002, 58, 364. 10.1107/S0108768102006948. [DOI] [PubMed] [Google Scholar]
Groom C. R.; Bruno I. J.; Lightfoot M. P.; Ward S. C. The Cambridge Structural Database. Acta Crystallographica Section B 2016, 72, 171–179. 10.1107/S2052520616003954. [DOI] [PMC free article] [PubMed] [Google Scholar]
Famprikis T.; Canepa P.; Dawson J. A.; Islam M. S.; Masquelier C. Fundamentals of inorganic solid-state electrolytes for batteries. Nature materials 2019, 18 (12), 1278–1291. 10.1038/s41563-019-0431-3. [DOI] [PubMed] [Google Scholar]
Wang S.; Xu H.; Li W.; Dolocan A.; Manthiram A. Interfacial chemistry in solid-state batteries: Formation of interphase and its consequences. J. Am. Chem. Soc. 2018, 140 (1), 250–257. 10.1021/jacs.7b09531. [DOI] [PubMed] [Google Scholar]
Koerver R.; Aygün I.; Leichtweiß T.; Dietrich C.; Zhang W.; Binder J. O.; Hartmann P.; Zeier W. G.; Janek J. Capacity fade in solid-state batteries: Interphase formation and chemomechanical processes in nickel-rich layered oxide cathodes and lithium thiophosphate solid electrolytes. Chem. Mater. 2017, 29 (13), 5574–5582. 10.1021/acs.chemmater.7b00931. [DOI] [Google Scholar]
Richards W. D.; Miara L. J.; Wang Y.; Kim J. C.; Ceder G. Interface stability in solid-state batteries. Chem. Mater. 2016, 28 (1), 266. 10.1021/acs.chemmater.5b04082. [DOI] [Google Scholar]
Li Y.; Canepa P.; Gorai P. Role of electronic passivation in stabilizing the lithium-li_xpo_yn_z solid-electrolyte interphase. PRX Energy 2022, 1, 023004. 10.1103/PRXEnergy.1.023004. [DOI] [Google Scholar]
Bartel C. J.; Trewartha A.; Wang Q.; Dunn A.; Jain A.; Ceder G. A critical examination of compound stability predictions from machine-learned formation energies. npj Computational Materials 2020, 6 (1), 97. 10.1038/s41524-020-00362-y. [DOI] [Google Scholar]
Pandey S.; Qu J.; Stevanović V.; John P. S.; Gorai P. Predicting energy and stability of known and hypothetical crystals using graph neural network. Patterns 2021, 2 (11), 100361. 10.1016/j.patter.2021.100361. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xie T.; Grossman J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 2018, 120 (14), 145301. 10.1103/PhysRevLett.120.145301. [DOI] [PubMed] [Google Scholar]
Park C. W.; Wolverton C. Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery. Physical Review Materials 2020, 4 (6), 063801. 10.1103/PhysRevMaterials.4.063801. [DOI] [Google Scholar]
Chen C.; Ye W.; Zuo Y.; Zheng C.; Ong S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 2019, 31, 3564–3572. 10.1021/acs.chemmater.9b01294. [DOI] [Google Scholar]
Chen C.; Zuo Y.; Ye W.; Li X.; Ong S. P. Learning properties of ordered and disordered materials from multi-fidelity data. Nat. Comp. Sci. 2021, 1 (1), 46–53. 10.1038/s43588-020-00002-x. [DOI] [PubMed] [Google Scholar]
Pal K.; Park C. W.; Xia Y.; Shen J.; Wolverton C. Scale-invariant machine-learning model accelerates the discovery of quaternary chalcogenides with ultralow lattice thermal conductivity. npj Computational Materials 2022, 8 (1), 48. 10.1038/s41524-022-00732-8. [DOI] [Google Scholar]
Gibson J. B.; Hire A. C.; Hennig R. G. Data-augmentation for graph neural network learning of the relaxed energies of unrelaxed structures. npj Comput. Mater. 2022, 211. 10.1038/s41524-022-00891-8. [DOI] [Google Scholar]
Saal J. E.; Kirklin S.; Aykol M.; Meredig B.; Wolverton C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (oqmd). Jom 2013, 65, 1501–1509. 10.1007/s11837-013-0755-4. [DOI] [Google Scholar]
Hautier G.; Fischer C.; Ehrlacher V.; Jain A.; Ceder G. Data mined ionic substitutions for the discovery of new compounds. Inorg. Chem. 2011, 50 (2), 656–663. 10.1021/ic102031h. [DOI] [PubMed] [Google Scholar]
Ward L.; Dunn A.; Faghaninia A.; Zimmermann N. E.; Bajaj S.; Wang Q.; Montoya J.; Chen J.; Bystrom K.; Dylla M.; Chard K.; Asta M.; Persson K. A.; Snyder G. J.; Foster I.; Jain A. Matminer: An open source toolkit for materials data mining. Comput. Mater. Sci. 2018, 152, 60–69. 10.1016/j.commatsci.2018.05.018. [DOI] [Google Scholar]
NREL Materials Database. materials.nrel.gov.
Ohno S.; Banik A.; Dewald G. F.; Kraft M. A.; Krauskopf T.; Minafra N.; Till P.; Weiss M.; Zeier W. G. Materials design of ionic conductors for solid state batteries. Progress in Energy 2020, 2 (2), 022001. 10.1088/2516-1083/ab73dd. [DOI] [Google Scholar]
Sendek A. D.; Yang Q.; Cubuk E. D.; Duerloo K.-A. N.; Cui Y.; Reed E. J. Holistic computational structure screening of more than 12,000 candidates for solid lithium-ion conductor materials. Energy Environ. Sci. 2017, 10, 306–320. 10.1039/C6EE02697D. [DOI] [Google Scholar]
Peljo P.; Girault H. H. Electrochemical potential window of battery electrolytes: the homo–lumo misconception. Energy Environ. Sci. 2018, 11, 2306–2309. 10.1039/C8EE01286E. [DOI] [Google Scholar]
Meyer G.; Ax P.; Cromm A.; Linzmeier H. Seven-coordinate trivalent rare earths: the phyllochlorides are2cl7 (a k, rb, cs; re sm, lu, y) and the crystal structure of iny₂cl₇. Journal of the Less Common Metals 1984, 98, 323–337. 10.1016/0022-5088(84)90306-0. [DOI] [Google Scholar]
Schäfer W.; Scheunemann K.; Nitsche R. Crystal structure and magnetic properties of cu₄nisi₂s₇. Mater. Res. Bull. 1980, 15, 933–937. 10.1016/0025-5408(80)90218-4. [DOI] [Google Scholar]
Hermann-Ronzaud D.; Bacmann M. La structure cristalline de BaSrFe₄O₈. Acta Crystallographica Section B 1975, 31, 665–668. 10.1107/S0567740875003585. [DOI] [Google Scholar]
Jain A.; Ong S. P.; Hautier G.; Chen W.; Richards W. D.; Dacek S.; Cholia S.; Gunter D.; Skinner D.; Ceder G.; Persson K. A. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Materials 2013, 1 (1), 011002. 10.1063/1.4812323. [DOI] [Google Scholar]
Zhu Y.; He X.; Mo Y. Strategies based on nitride materials chemistry to stabilize li metal anode. Advanced Science 2017, 4, 1600517. 10.1002/advs.201600517. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zulueta Y. A.; Nguyen M. T.; Dawson J. A. Boosting li-ion transport in transition-metal-doped li₂sno₃. Inorg. Chem. 2020, 59, 11841–11846. 10.1021/acs.inorgchem.0c01923. [DOI] [PubMed] [Google Scholar]
Yin H.; Huang J.; Luo N.; Zhang Y.; Huang S. Theoretical study on y-doped na₂zro₃ as a high-capacity na-rich cathode material based on anionic redox. Phys. Chem. Chem. Phys. 2022, 24, 16183–16192. 10.1039/D2CP02219B. [DOI] [PubMed] [Google Scholar]
Zhou Z.; Kearnes S.; Li L.; Zare R. N.; Riley P. Optimization of molecules via deep reinforcement learning. Sci. Rep. 2019, 9, 10752. 10.1038/s41598-019-47148-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
S. V S. S.; Law J. N.; Tripp C. E.; Duplyakin D.; Skordilis E.; Biagioni D.; Paton R. S.; John P. C. S. Multi-objective goal-directed optimization of de novo stable organic radicals for aqueous redox flow batteries. Nature Machine Intelligence 2022, 4 (8), 720–730. 10.1038/s42256-022-00506-3. [DOI] [Google Scholar]
Dieb T. M.; Ju S.; Shiomi J.; Tsuda K. Monte carlo tree search for materials design and discovery. MRS Commun. 2019, 9 (2), 532–536. 10.1557/mrc.2019.40. [DOI] [Google Scholar]
Biagioni D.; Tripp C. E.; Clark S.; Duplyakin D.; Law J.; John P. C. S. graphenv: a python library for reinforcement learning on graph search spaces. Journal of Open Source Software 2022, 7 (77), 4621. 10.21105/joss.04621. [DOI] [Google Scholar]
Laterre A.; Fu Y.; Jabri M. K.; Cohen A.-S.; Kas D.; Hajjar K.; Dahl T. S.; Kerkeni A.; Beguir K.. Ranked reward: Enabling self-play reinforcement learning for combinatorial optimization. arXiv (Machine Learning), December 6, 2018, 1807.01672, ver. 3. 10.48550/arXiv.1807.01672 [DOI] [Google Scholar]
Chen C.; Ong S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2022, 2, 718. 10.1038/s43588-022-00349-3. [DOI] [PubMed] [Google Scholar]
Dan Y.; Zhao Y.; Li X.; Li S.; Hu M.; Hu J. Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials. npj Computational Materials 2020, 6 (1), 84. 10.1038/s41524-020-00352-0. [DOI] [Google Scholar]
Zhao Y.; Al-Fahdi M.; Hu M.; Siriwardane E. M. D.; Song Y.; Nasiri A.; Hu J. High-throughput discovery of novel cubic crystal materials using deep generative neural networks. Advanced Science 2021, 8 (20), 2100566. 10.1002/advs.202100566. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gorai P.; Ganose A. M.; Faghaninia A.; Jain A.; Stevanovic V. Computational Discovery of Promising New n-type Dopable ABX Zintl Thermoelectric Materials. Mater. Horiz. 2020, 7, 1809–1818. 10.1039/D0MH00197J. [DOI] [Google Scholar]
Qu J.; Stevanovic V.; Ertekin E.; Gorai P. Doping by design: finding new n-type dopable abx₄ zintl phases for thermoelectrics. J. Mater. Chem. A 2020, 8, 25306–25315. 10.1039/D0TA08238D. [DOI] [Google Scholar]
Stevanović V.; Lany S.; Zhang X.; Zunger A. Correcting density functional theory for accurate predictions of compound enthalpies of formation: Fitted elemental-phase reference energies. Phys. Rev. B 2012, 85 (11), 115104. 10.1103/PhysRevB.85.115104. [DOI] [Google Scholar]
Kresse G.; Furthmüller J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 1996, 54 (16), 11169–11186. 10.1103/PhysRevB.54.11169. [DOI] [PubMed] [Google Scholar]
Hjorth Larsen A.; Mortensen J. J.; Blomqvist J.; Castelli I. E.; Christensen R.; Dułak M.; Friis J.; Groves M. N.; Hammer B.; Hargus C.; Hermes E. D.; Jennings P. C.; Jensen P. B.; Kermode J.; Kitchin J. R.; Kolsbjerg E. L.; Kubal J.; Kaasbjerg K.; Lysgaard S.; Maronsson J. B.; Maxson T.; Olsen T.; Pastewka L.; Peterson A.; Rostgaard C.; Schiøtz J.; Schütt O.; Strange M.; Thygesen K. S.; Vegge T.; Vilhelmsen L.; Walter M.; Zeng Z.; Jacobsen K. W. The atomic simulation environment—a python library for working with atoms. J. Phys.: Condens. Matter 2017, 29 (27), 273002. 10.1088/1361-648X/aa680e. [DOI] [PubMed] [Google Scholar]
Chu I.-H.; Roychowdhury S.; Han D.; Jain A.; Ong S. P. Predicting the volumes of crystals. Comput. Mater. Sci. 2018, 146, 184–192. 10.1016/j.commatsci.2018.01.040. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

au2c00540_si_001.pdf^{(189.4KB, pdf)}

PERMALINK

Upper-Bound Energy Minimization to Search for Stable Functional Materials with Graph Neural Networks

Jeffrey N Law

Shubham Pandey

Prashun Gorai

Peter C St John

Abstract

1. Introduction

2. Results and Discussion

2.1. Challenges in Predicting Thermodynamic Stability

Figure 1.

2.2. Volume-Only Structure Relaxations

Figure 2.

Figure 3.

2.3. Surrogate Model Predictions

2.4. DFT Confirmation of Predicted Stable Structures

Figure 4.

2.5. Novel Stable Structures

Figure 5.

Figure 6.

2.6. Reinforcement Learning Optimization of Structures

Figure 7.

3. Conclusions

4. Computational Methods

4.1. GNN Architecture

4.2. Data Sets

4.2.1. Prototype Structures from ICSD

4.2.2. Battery Compositions

4.2.3. Decorated Structures

4.3. Training Data Set

4.3.1. ICSD Structures

4.3.2. Decorated Structures for DFT Relaxation

4.3.3. DFT Relaxations

4.3.4. Data Quality Control

4.4. Structure Fingerprints and Distances

4.5. Surrogate Model Training

Acknowledgments

Data Availability Statement

Supporting Information Available

Author Contributions

Supplementary Material

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases