Data-guided design of double-atom catalysts for enhanced electrocatalytic performance

Chenyang Wei; Wenbo Mu; Hongyuan Zhang; Zhenghui Liu; Tiancheng Mu

doi:10.1039/D5TA03021H

View PDF Version

DOI: 10.1039/D5TA03021H (Paper) J. Mater. Chem. A, 2025, Advance Article

Data-guided design of double-atom catalysts for enhanced electrocatalytic performance†

Chenyang Wei^a, Wenbo Mu*^b, Hongyuan Zhang^a, Zhenghui Liu*^c and Tiancheng Mu*^ad
^aSchool of Chemistry and Life Resources, Renmin University of China, Beijing 100872, P.R. China. E-mail: tcmu@ruc.edu.cn
^bDepartment of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093-0404, USA. E-mail: wmu@uscd.edu
^cSchool of Pharmaceutical and Chemical Engineering, Taizhou University, Taizhou 318000, Zhejiang, China. E-mail: liuzhenghui@iccas.ac.cn
^dKey Laboratory of Green Chemical Media and Reactions, Ministry of Education, School of Chemistry and Chemical Engineering, Henan Normal University, Xinxiang, Henan 453007, P. R. China

Received 16th April 2025 , Accepted 10th July 2025

First published on 10th July 2025

Abstract

Double-atom catalysts (DACs) are promising electrocatalysts due to their synergistic metal–metal interactions and high atom utilization. However, the vast chemical space arising from diverse metal pairs and substrates presents a major challenge for rational design. Here, we combine high-throughput density functional theory (DFT) calculations with machine learning (ML) analysis to systematically investigate DACs for the CO₂ reduction reaction (CO₂RR), hydrogen evolution reaction (HER), and oxygen evolution reaction (OER). We establish a predictive ML framework capable of rapidly screening DAC candidates with near-DFT accuracy, enabling efficient evaluation across a wide range of substrates. Guided by ML and DFT approaches, we identify PtZn/N-C₃N₄ as a highly active OER catalyst with a theoretical overpotential of ∼0.15 eV, and CuNi/N-C₃N₄ as a top-performing bifunctional catalyst for overall water splitting. For CO₂RR, VTi/N-C₃N₄ shows a limiting potential approaching ∼0.15 V, close to the optimal volcano plot peak, along with strong HER suppression. In summary, this work offers key insights for the design of ACs, providing substantial time savings and demonstrating the immense potential of ML as a universally applicable tool in diverse energy-related fields.

Introduction

The increasing concentration of atmospheric CO₂, coupled with the demand for sustainable growth, has driven significant interest in electrocatalysis as a promising solution.^1–3 Electrocatalysis offers advantages such as high energy efficiency, excellent atom economy, and a reduced environmental footprint. However, the current catalyst design methodologies face challenges, particularly in achieving optimal reaction selectivity and mitigating material degradation under electrochemical conditions. These challenges necessitate the development of advanced catalysts that combine high activity with specificity to enhance electrocatalysis for widespread adoption.

Atomic catalysts (ACs), particularly double-atom catalysts (DACs), have garnered attention due to their exceptional metal atom utilization and enhanced catalytic activity.^4–9 By dispersing metal atoms across two-dimensional (2D) substrates, ACs offer a high density of active sites. DACs, which introduce a second metal atom, further enhance catalytic versatility and promote complex inter-metallic interactions, modifying the electronic and geometric properties of active sites. These changes allow for targeted optimization of catalytic performance, highlighting DACs as a promising platform for advancing catalytic processes. However, DACs face a significant challenge: the vast chemical space of possible combinations (Fig. 1), especially when considering diverse 2D substrates. Within this vast chemical landscape, some DAC configurations may suffer from thermodynamic or electrochemical instability, while others may fail to meet desired catalytic criteria. This brings us to a key question: How can we effectively predict the properties of DACs within such an expansive chemical space? Traditional density functional theory (DFT) calculations, while accurate, are computationally expensive, and relying solely on chemists' intuition often lacks the precision when predicting the effects of complex substrate–metal interactions.


	Fig. 1 Schematic representation of the theoretical design and analytical workflow for DACs, employing the integration of DFT calculations and ML methodologies: (a) illustration of the chemical design space of DACs; (b) data analysis using ML and DFT; (c) high-throughput DFT calculations and ML-based screening.

The rapid advancement of machine learning (ML) has positioned it as a transformative tool in material discovery, synthesis, and characterization.^10–17 ML efficiently detects patterns and relationships within large datasets, enabling rapid predictions and accelerating research. By uncovering complex relationships that are often beyond traditional models, ML has the potential to revolutionize scientific research. For instance, Google DeepMind's Graph Networks for Materials Exploration (GNoME) has enabled the creation of an extensive database of stable crystals, predicting approximately 220 million structures.¹⁸ Likewise, Amir Kotobi's team used graph neural networks (GNNs) to predict X-ray absorption spectra (XAS) of organic molecules, enhancing interpretability through class activation maps (CAM).¹⁹ Another example includes Tongtong Yang and collaborators, who proposed a ML approach leveraging spectroscopic descriptors to predict catalytic properties and achieve structural inversion.²⁰ These advancements highlight ML's potential to replace traditional labor-intensive methods and deliver substantial time efficiencies. Specifically, some instances indicate that ML has the promise in the design of ACs.^4,5,21 However, most studies have focused on transition-metal-based ACs on specific substrates or employed conservative doping strategies that minimally alter the substrate's structural framework (Fig. S1†). This trend is even more pronounced in DACs research, where simplified approaches facilitate the analysis of metal–metal interactions and the development of straightforward predictive frameworks. While these methods are effective in idealized scenarios, they diverge from real-world complexities. In practical applications, optimal ACs require exploring diverse substrate–metal combinations, as substrates can significantly alter the electron density and structure of metals, imparting unique properties to the catalysts.^22,23 These complexities pose significant challenges for ML, which must account for the dynamic interactions between substrates and metals to accurately reflect real-world conditions.

To address these challenges, this study employs four prototypical 2D graphitic carbon nitride (C_xN_y) substrates, which, to our knowledge, have not been extensively used in DACs. These substrates introduce significant modifications to the substrate structure and coordination environment surrounding the metal atoms, bringing the study closer to realistic material design processes. Using high-throughput DFT calculations, we initially investigate DAC-driven CO₂ reduction reaction (CO₂RR) activity across 46 distinct DACs, revealing that conventional descriptor-the Gibbs free energy change for CO adsorbates (ΔG_CO*) does not adequately capture CO₂RR performance. Consequently, we leverage ML to predict the limiting potential (U_L) of CO₂RR and estimate key stability metrics-binding energy (ΔE_bind), dissolution potential (U_diss), and the Gibbs free energy for H adsorbates (ΔG_H*)-which serve as proxies for thermal, electrochemical stability and competing hydrogen evolution reaction (HER) activity. In subsequent analyses, ML further dissects the adsorption behavior of intermediates, revealing that the d-electron count within bimetallic systems-via alterations in the d-band center and electron transfer-is crucial for modulating adsorption, while DAC stability is influenced by both substrate and intrinsic metal properties. Building on preceding insights, we identify the N vacancy in graphitic carbon nitride (N-C₃N₄) as a promising substrate for stabilizing bimetallic atoms and explore the oxygen evolution reaction (OER) activity, noting that traditional descriptors like OH adsorption Gibbs free energy (ΔG_OH*) fall short, thus prompting ML-based prediction of the theoretical overpotential (η^OER). Finally, we present an innovative framework that integrates high-throughput DFT with advanced ML techniques to predict DAC performance in key catalytic reactions. By employing symbolic regression via the PySR library, we enhance the interpretability of adsorption predictions for key intermediates-thereby laying the groundwork for future descriptor development-while reducing computation time by approximately 3750 times. Ultimately, our approach streamlines the DAC discovery process and offers an efficient pathway for identifying high-performance catalysts for sustainable energy applications.

Experimental

DFT calculations

In this study, spin-polarized DFT calculations were performed using version 6.3.0 of the Vienna Ab initio Simulation Package (VASP),²⁴ utilizing the projector augmented wave (PAW) method. All calculations were conducted with a set cutoff energy for plane waves of 500 eV. Electronic interactions were described utilizing the generalized gradient approximation (GGA) in conjunction with the Perdew–Burke–Ernzerhof (PBE) functional.²⁵ DFT+U corrections were not applied, consistent with common practice in high-throughput catalyst screening involving diverse transition metals. This decision ensures methodological consistency and computational efficiency across 3d, 4d, and 5d systems. This approach has been commonly adopted in recent high-throughput studies, where GGA-PBE functionals have been used without Hubbard U corrections to efficiently evaluate catalyst performance trends across a broad chemical space.^26–30 The Grimme's semiempirical DFT-D3 method was employed for dispersion correction to address long-range van der Waals (vdW) interactions.³¹ To preclude interactions between neighboring layers, a vacuum buffer of 15 Å was maintained along the z-axis. The Brillouin zone was sampled using a 2 × 2 × 1 Monkhorst–Pack k-point grid for geometric optimization, while a finer 4 × 4 × 1 k-point grid was selected for electronic structure calculations requiring higher precision. The convergence criterion for energy was established at 10⁻⁵ eV, and for forces, it was set at 0.02 eV Å⁻¹. The free energy change for each reaction step was computed utilizing the computational hydrogen electrode (CHE) model,³² as described in the ESI.† Solvent effects were not explicitly modelled in our DFT calculations, which were conducted under vacuum conditions within the CHE framework. This approximation is widely adopted in high-throughput electrocatalysis studies to enable efficient screening of large catalyst libraries, and has been shown to preserve meaningful energetic trends across diverse catalyst systems.^33–38 The thermodynamic stability of superior DACs was carried out by ab initio molecular dynamics (AIMD) simulation within the NVT ensemble at 400 K up to 10 ps with a time step of 2 fs.

ML models

Analysis models. In this study, all models for analyzing DFT data were developed using the Random Forest (RF) algorithm implemented via the Scikit-Learn package.³⁹ We selected RF for its relatively high interpretability and accuracy. Each model was fine-tuned using grid search and ten-fold cross-validation, with optimal hyperparameters listed in Table S3.†

The innovative prediction framework. Our innovative prediction framework (or ‘the prediction framework’) comprises four modules designed for distinct purposes: (1) and (2) evaluate whether DACs meet the prerequisites for thermal and electrochemical stability as well as high HER activity; (3) determine if DACs that satisfy both stability and low HER activity criteria exhibit high CO₂RR activity, making them viable CO₂RR candidates; and (4) recommend DACs with the optimal substrate among four candidate substrates, then use ML to assess whether they possess both high OER and HER activity for potential dual-functional water splitting. In Module 1, DAC stability is evaluated via multi-objective prediction. In Modules 2–4, we use the XGBoost algorithm enhanced with active learning to continuously improve model performance, supplementing limited DFT data with generated models to achieve accurate predictions. More details can be found in the ESI.†

The overall selection criteria are as follows:

(1) stability: ΔE_bind < 0, while U_diss > 0;

(2) high CO₂RR activity: l U_L > −0.4 eV;^40,41

(3) high HER activity: |ΔG_H*| < 0.15 eV;

(4) high OER activity:η^OER < 0.66 eV (using IrO₂ as the benchmark⁴²).

Feature engineering

In this study, we integrated both elemental and structural descriptors in our ML models to capture the characteristics of DACs. All models incorporate elemental information-including atomic mass, atomic number, and valence electronic configurations-to represent the intrinsic chemical properties of each atom (see Tables S1 and S2†). Formal oxidation states were not included as descriptors, as they are often ill-defined under DFT and strongly dependent on the local bonding environment, particularly in supported bimetallic systems. Instead, tabulated elemental properties such as d-electron count serve as robust and physically meaningful proxies for redox behavior, consistent with established practices in ML-based catalyst discovery.^26,43 To further probe oxidation-related effects, we conducted additional DFT analyses using Bader charge and projected density of states (PDOS), which provided insight into charge transfer and local electronic structure for selected DAC configurations. For structural descriptors, two distinct sets were employed for different purposes. For our analysis models, we used bond lengths around the bimetallic centers (i.e., distances between the two metals, between metals and surrounding atoms, and to adsorbates) to characterize the local bonding environment (Table S1†). In contrast, for predicting the catalytic properties of DACs, we employed two global descriptors: Coulomb Matrix (CM) and Smooth Overlap of Atomic Positions (SOAP).

Due to the high dimensionality of CM and SOAP, we applied dimensionality reduction techniques (PCA or t-SNE) to retain essential information while reducing the computational burden.

Coulomb matrix (CM). The Coulomb matrix is defined as follows:


	(1)

where Z_i is the atomic number of atom i, and R_i is its coordinate vector. This descriptor captures both the atomic identity and the interatomic distances, thereby representing the electrostatic environment of the system.

Smooth overlap of atomic positions (SOAP). The SOAP descriptor characterizes the local atomic environment by expanding the atomic density around each atom in a basis of radial functions and spherical harmonics. For an atom i the neighbor density is expanded as:


	(2)

where R_n(r) are radial basis functions, Y_l^m(θ, ϕ) are spherical harmonics, and c_nlm⁽ⁱ⁾ are the expansion coefficients. The rotationally invariant SOAP power spectrum is then computed as:


	(3)

which serves as a robust representation of the local atomic environment in a manner that is invariant to rotations.

Above is a table summarizing the descriptors and their respective applications (Table 1).

Table 1 The summary of each type of descriptors used in different models

Feature category	Description	Usage
Elemental info	Atomic number, or d-electrons number, et al.	Used in all models
Local structural info	Bond lengths around bimetallic centers	Used in analysis models only
Global structural info	CM	Used in the prediction framework only
Global structural info	SOAP	Used in the prediction framework only

Model interpretation and analysis

Correlation analysis. The Pearson correlation coefficient, denoted by p was utilized to quantify the linear relationship between two features. It is defined as:


	(4)

where v and w represent the two features under consideration, and [v with combining macron]

and

are their respective mean values. The coefficient p ranges from −1 to 1; a value with an absolute magnitude close to 1 indicates a strong linear correlation.

Feature importance analysis. Shapley Additive exPlanations (SHAP) is an established method for analyzing, interpreting, and visualizing feature contributions.^44–46 SHAP assigns an importance score to each feature according to:

y_pred = ȳ + f(x₁) + f(x₂) + ⋯ + f(x_n)

where y_pred symbolizes the predicted outcome based on the feature x, ȳ denotes the mean prediction across all data points, and f(x_n) quantifies the prediction contribution of the nth feature in x. Generally, the magnitude of a feature's SHAP absolute value correlates with its predictive importance; positive SHAP values suggest a positive contribution to the prediction, whereas negative values indicate a detractive effect.

Results and discussion

Setting the stage for DACs selection in CO₂RR applications

Before rapidly screening DACs for CO₂RR activity, it is essential to define the selection prerequisites. A common strategy is to employ descriptors that can quickly differentiate catalysts with high CO₂RR activity from those with significant hydrogen evolution reaction (HER) activity. For HER, ΔG_H* is widely recognized as a robust descriptor due to HER's relatively simple mechanism involving only H adsorption and desorption.⁴⁷ In contrast, CO₂RR proceeds via multiple steps involving intermediates such as COOH* and CO*, complicating the choice of a descriptor, a subject of ongoing debate.^48,49 To assess whether the widely accepted ΔG_CO* can reliably predict CO₂RR performance in our case, we analyzed its correlation with U_L, which directly quantify CO₂RR activity, on DACs across the selected four distinct substrates (Fig. 2). The analysis revealed an approximate volcano relationship between ΔG_CO* and U_L. However, beyond the volcano's peak (approximately −0.105 eV), the correlation weakens, likely due to variations in CO binding strength. This divergence originates from the fact that ΔG_CO*, as a single-intermediate descriptor, only reflects the thermodynamics of one step in the overall pathway. It fails to capture the energy landscape of other critical elementary steps-particularly COOH formation and protonation-which often act as the potential-determining steps (PDS) depending on the local binding environment. For DACs with strong CO adsorption (i.e., more negative ΔG_CO*), CO desorption becomes more energetically demanding, shifting the PDS to step 3 of CO₂RR-namely, CO release (CO* + H₂O → Slab + CO(g) + H₂O). In this regime, ΔG_CO* remains a relevant descriptor for U_L. Conversely, when CO binds weakly to the DAC surface, the PDS shifts to either step 1-protonation of CO₂ (Slab + CO₂(g) + 2H⁺ + e⁻ → COOH* + H⁺)-or step 2-protonation of COOH (COOH* + H⁺ + e⁻ → CO* + H₂O). In these cases, ΔG_CO* no longer governs the kinetics, leading to a diminished correlation with U_L and limiting its predictive utility.


	Fig. 2 Free energy diagrams and volcano plots for CO₂RR on various DACs; panels (a)–(d) display the relative free energy profiles for CO₂RR catalyzed by DACs on different substrate configurations: (a) CN, (b) C₂N, (c) g-C₃N₄, and (d) N-C₃N₄. Panel (e) shows the volcano plot illustrating the relationship between U_L and ΔG_CO* across the studied DAC systems.

In contrast, the limiting potential U_L inherently captures the thermodynamic span of all elementary steps and explicitly identifies the highest-energy barrier across the entire reaction pathway. This makes it a more comprehensive and physically meaningful descriptor for multi-step reactions like CO₂RR, where the PDS can shift dynamically depending on catalyst composition or coordination environment, making it a more robust descriptor for CO₂RR. To improve high-throughput prediction in this context, we further leveraged ML to construct composite descriptors that integrate electronic and geometric, features-beyond single-adsorbate energetics-to more accurately represent the complex multi-step nature of CO₂RR on diverse DAC surfaces.

To support this framework, we constructed a comprehensive dataset encompassing both stability and reactivity metrics. In addition to U_L, we computed the ΔE_bind and U_diss of DACs to evaluate their stability. A more negative ΔE_bind indicates enhanced thermal stability and a higher likelihood of experimental validation, while a positive U_diss suggests metal resistance to dissolution during electrochemical processes. An illustrative heatmap (Fig. S2†) compares these stability metrics across various DACs. Additionally, we computed ΔG_H* for approximately 120 DACs to quantify their HER activity, thereby providing further data for subsequent ML modeling.

Elucidating DAC parameters: a symbiosis of ML and DFT insights

ML transcends basic prediction; when interpreted properly, it provides profound insights into data. By employing SHAP, we can identify the features most influential in our ML models. Although our previous analysis indicates that ΔG_CO* is not a flawless descriptor for CO₂RR activity, it remains crucial for two reasons: first, ΔG_CO* reflects CO binding strength and thus helps pinpoint the PDS in CO₂RR; second, when ΔG_CO* falls below approximately −0.105 eV, it correlates reliably with CO₂RR performance. This underscores the importance of examining CO adsorption properties, so we focus our ML models on this intermediate with SHAP analysis guiding our interpretation.

SHAP analysis (Fig. 3b) highlights that descriptors related to the coordination environment-such as the ratio of nitrogen atoms near the bimetallic center (denoted ‘number of N’)-are highly influential. The high SHAP value associated with the nitrogen ratio indicates that nitrogen atoms are essential for modulating the electronic configuration of the bimetallic centers, which directly influences their ability to adsorb intermediates like CO. Previous studies have demonstrated how nitrogen atoms in the substrate can interact with the metal's d-electrons, enhancing or reducing CO adsorption strength.^50–52 Furthermore, the contribution of transition metal d-electrons to predicting ΔG_CO* suggests that both the nitrogen environment and the d-electron configuration of the metals are crucial for optimizing catalytic performance. By selecting appropriate substrates and metal pairs based on these features, we can design more efficient catalysts for reactions like CO₂ reduction.


	Fig. 3 Comprehensive analysis of electrocatalytic descriptors for the CO₂RR process on DACs; (a) heatmap of p among various electron-related features. The color gradient (ranging from red to blue) indicates the magnitude of p, with intense colors at the extremes reflecting strong correlations (the complete heatmap is shown in ESI Fig. S8a†). (b) Bar plot of the top 11 features' relative importance based on SHAP values from the ML analysis of ΔG_CO* for DACs, highlighting the most influential descriptors (full SHAP values for all features are provided in ESI Fig. S8b†). (c) Heatmap depicting the relationship between l_C–O and the d-electron counts of Metal 1 (M1) and Metal 2 (M2). The accompanying color bar represents the range of l_C–O, with the red region indicating elevated CO activation. (d) The four selected DACs for CO₂RR discussed in this study: (i) PtPt_C₂N, (ii) PtNi_C₂N, (iii) NiSc_g-C₃N₄, and (iv) NiW_g-C₃N₄. (e) & (f) Correlation analyses of ε_d, \|e\|, and ΔG_CO*: panels (e) and (f) display data points colored red, green, blue, and orange for DACs with CN, C₂N, g-C₃N₄, and N-C₃N₄ substrates, respectively. The shaded area in (f) highlights DACs with ε_d near E_F. Together, panels (e) & (f) elucidate the interrelationships among these key catalytic descriptors.

To substantiate the impact of d-electrons on CO adsorption, we applied the Pearson correlation coefficient (p) to evaluate the relationships between the number of outermost electrons (N_e), s-electrons (θ_s), d-electrons (θ_d), and the C–O bond length (l_C–O) (Fig. 3a). Our findings reveal a strong correlation between the number of d-electrons in transition metals and l_C–O-a trend not observed for s-electrons-suggesting that d-electrons uniquely promote CO activation by elongating the C–O bond. Bader charge analysis (Table S4†) further confirms that transition metal d-electrons predominantly donate charge to CO, enhancing its activation. Moreover, correlating Bader charge transfer (|e|) with l_C–O shows that greater d-electron transfer corresponds to longer C–O bonds, signaling amplified CO activation (Fig. S3†). A heatmap (Fig. 3c) further illustrates a critical balance in the electronic configuration of bimetallic systems: neither an excess nor a deficiency of d-electrons favors optimal CO activation, as fully occupied d-orbitals impede activation while too few d-electrons lead to insufficient electron donation, and consequently, is detrimental to CO₂RR efficiency.

Subsequently, to elucidate how bimetallic atoms influence adsorption behavior and electronic structure, we computed the PDOS (Fig. S4† and 3d). These PDOS plots reveal the interaction between the two metals-evident from overlapping regions-and demonstrate how bimetallic structures modulate the d-band center (ε_d), as summarized in Table S5.† For example, in Pt–Pt and Pt–Ni combinations on a C₂N substrate (Fig. 3di and ii), replacing one Pt atom with Ni shifts the remaining Pt's ε_d closer to the Fermi level (E_F), resulting in the overall ε_d being adjusted toward E_F, thereby altering ΔG_CO*. A similar trend is observed in other metal pairings (e.g., NiSc and NiW on g-C₃N₄ substrates), underscoring the mutual influence of the metal partners. Such ε_d shifts, by modulating the adsorption strength of CO, directly influence CO₂RR activity through changes in intermediate stabilization and desorption energetics. Similar effects on OH adsorption are expected in OER, as discussed later.

To gain a more integrated understanding of the factors influencing ΔG_CO*, we examined the correlations among ΔG_CO*, ε_d, and |e| (Fig. S5†, 3e and f). Fig. S5† presents a synthesized view of these parameters, revealing a volcano relationship between ΔG_CO* and ε_d. Notably, ε_d has a more substantial impact on ΔG_CO* than |e|, in line with the majority of related works that position ε_d as an informative descriptor for adsorption energetics for transition metals; although higher ε_d generally corresponds to increased |e|, the relationship is not strictly linear. Fig. 3e further illustrates a volcano-shaped dependence, where a smaller gap between ε_d and E_F facilitates electron migration from the metal site to the adsorbate, thereby enhancing CO adsorption. This observation is supported by Fig. 3f, which shows that DACs with ε_d near E_F exhibit significant charge transfer |e|, emphasizing the role of electron displacement in CO activation.

Interestingly, certain DACs enriched with pre-transition metals (e.g., Zr, Ti, Sc) display a positive ε_d due to their incompletely occupied d-orbitals, resulting in empty d-bands above E_F. In DACs such as NbZr_g-C₃N₄, VZr_CN, and FeSc_CN, these unpaired d-electrons are more available, leading to higher Bader charge transfer values (Fig. 3f) and affect ΔG_CO* and CO₂RR activity. Aside from these anomalies, the overall volcanic trend between ε_d and ΔG_CO* is evident. Moreover, we hypothesize that these exceptions may partly arise from alternative CO adsorption modes: while most DACs adopt an end-on conformation (with C as the adsorption site), the anomalous DACs favor side-on adsorption (Fig. S6†).

Finally, we evaluated ΔG_H* for a subset of 12 DACs along with their ε_d and |e| values (Fig. S7†). These results indicate that a ε_d closer to E_F enhances hydrogen adsorption and that d-electron migration modulates ΔG_H* in a manner similar to CO adsorption. The high SHAP values associated with d-electrons further reinforce these findings.

For analyzing ΔE_bind, our ML model (Fig. 4b) indicates that both the spatial distance between transition metals and substrate atoms, and the distance between the two metal atoms themselves, are crucial. This is intuitively plausible: if the metal atoms are too close or if they possess excessively large atomic radii, strong repulsive forces may arise, compromising DAC stability. Concurrently, substrate-related features are also highly influential, underscoring the critical role of substrate type and structure. To clearly present these findings and reinforce our ML analysis, we detail ΔE_bind in Fig. 4a. Both the substrate and the transition metals type significantly affect ΔE_bind, consistent with our ML insights. Among the substrates, N-C₃N₄ exhibits a notably lower ΔE_bind compared to others, suggesting a stronger potential for securely anchoring dual metal atoms.⁵³ We postulate that this enhanced binding is attributed not only to the large pore structures and surface area of N-C₃N₄, but also critically to the presence of nitrogen vacancies, which introduce undercoordinated carbon atoms and locally distorted triazine units that provide flexible, asymmetric binding pockets for dual-metal anchoring. These defect-induced coordination environments enable diverse metal–support interactions and facilitate charge transfer from the substrate to the metal centers, thereby stabilizing the metal pair and modulating their oxidation states.^54,55 This dual effect-structural adaptability and electronic enrichment-underlies the superior anchoring capability of N-C₃N₄ observed in our ΔE_bind analysis.


	Fig. 4 Detailed evaluation of ΔE_bind and descriptor significance in DACs; (a) compilation of ΔE_bind values for DACs with various substrate structures. Data points are color-coded: red, green, blue, and orange represent DACs on CN, C2N, g-C₃N₄, and N-C₃N₄ substrates, respectively, facilitating comparison across different substrates. (b) Bar chart of the top 11 features ranked by relative importance, as determined by SHAP values from the ML model for ΔE_bind analysis. This chart highlights the most influential factors affecting ΔE_bind predictions. For a full comparison of SHAP values for all features, refer to ESI Fig. S9.†

Furthermore, our analysis reveals that the atomic radii of the di-metal pairs impact ΔE_bind. Metal pairs with similar radii tend to have comparable ΔE_bind values, while those with significant radii differences show pronounced disparities. For instance, on the CN substrate, metal pairs such as NiFe, NiCo, CoFe, and FeFe-having similar atomic radii-exhibit similar ΔE_bind values. A similar pattern is observed for NiCo and NiFe on N-C₃N₄ and for CoPt, CrPt, and CuPt on C₂N. Conversely, metal pairs with large radii differences (e.g., ZrV and ZrNb on N-C₃N₄, or NbSc, NiSc, VFe, and VW on g-C₃N₄) display substantial variations in ΔE_bind.

Finally, the d-electron configuration of transition metals also influences DAC thermal stability. Transition metals such as Zn, Cd, Ag, and Au typically have a stable configuration with 10 d-electrons, which limits their ability to bond with surrounding atoms and hinders DAC formation, often resulting in altered ΔE_bind values and distorted configurations.

Probing the OER facilitated by DACs

The oxygen evolution reaction (OER) is critical for sustainable energy, and DACs with both strong OER and HER activity are particularly promising for overall water splitting.^56–58 Based on previous analyses, we identified DACs with N-C₃N₄ substrates as exceptionally stable; hence, our focus here is on these systems.

Since OER activity is closely linked to the adsorption behavior of intermediates (OH*, O*, and OOH*) within the associative mechanism⁵⁹ and ΔG_OH* is a common descriptor for OER,^60,61 we first investigated ΔG_OH* in DACs with N-C₃N₄. Given that the OH adsorption strength also depends on the d electron count in transition metals,^14,61 we performed DFT calculations across diverse DACs to correlate ΔG_OH* with the number of d electrons. However, the presence of dual metal sites in DACs makes it challenging to determine the exact d electron count, unlike in single-atom catalysts' system. To address this, we developed a preliminary method to identify the dominant metal for OH adsorption by measuring the distances between the adsorbed OH and each metal atom. For example, in a MoFe DAC, the proximity of OH to Fe allowed us to assign a d electron count of 6 based on Fe's configuration (Fig. S10†). Cases with ambiguous adsorption preference were excluded from the initial analysis. The resulting violin plot in Fig. 5a shows that OH adsorption strength generally decreases as the d electron count increases, although potential biases from data exclusion warrant cautious consideration about this conclusion.


	Fig. 5 Correlative analysis, electronic structure characterization, and scaling relationship of OH adsorption on DACs; (a) & (b) relationship between ΔG_OH* and d-electrons: (a) shows ΔG_OH* plotted against the number of d-electrons from the transition metal closest to the adsorbate, which is assumed to predominantly influence adsorption. (b) Extends this analysis by incorporating the descriptor ϕ, which considers the combined contributions of both transition metals to provide a more comprehensive view of their impact on binding strength. (c) Graphical summary of the correlations among ε_d, \|e\|, and ΔG_OH, illustrating the interplay between electronic structure descriptors and adsorption energetics. (d) & (e) Scaling relations for oxygenated intermediates in the OER process: (d) shows the correlation between ΔG_OOH and ΔG_OH, and (e) ΔG_O and ΔG_OH, revealing linear trends among the adsorption Gibbs free energies. (f) Correlation between η^OER and ΔG_OH, depicting a volcano-type trend, albeit subdued, for OER activity in these DACs. (g) PDOS plots for selected DACs discussed in the manuscript, with the corresponding calculated ε_d indicated on the right.

To more comprehensively probe the relationship between ΔG_OH* and d electron count, we refined our method by introducing a weighted descriptor ϕ. In this approach, weights are assigned to the d electron count of each metal based on its proximity to the adsorbed O atom, since OH typically adsorbs with the O atom closest to the metal. Detailed methodology is provided in the ESI note accompanying Fig. S10.† This refined approach allowed us to include DACs where preferential OH adsorption was previously ambiguous. As illustrated in Fig. 5b, the data now more clearly reveal that a higher average d electron count correlates with weaker OH adsorption. Complementary SHAP analysis (Fig. S11†) further supports the critical role of d electron count and the spatial relationship between the adsorbed OH and the metal sites in determining ΔG_OH*.

To gain a refined understanding of adsorption in DACs, we selected six representative systems for detailed analysis to re-establish the relationship between ΔG_OH*, ε_d, and |e| (Fig. 5c and g). Our findings are twofold: (1) a ε_d closer to E_F correlates with stronger OH adsorption; and (2) increased d-electron transfer-as indicated by a higher |e|-correlates with a lower ΔG_OH*. However, these parameters do not exhibit a strict linear correlation, suggesting that additional factors (e.g., varied adsorption modes and steric hindrances in different DAC structures) also influence OH adsorption. An R² of approximately 0.6 (Fig. 5b) further indicates that, while the d-electron count is a key determinant, it alone cannot fully capture OH adsorption strength. Ultimately, to validate ϕ's rationality for essential adsorption characteristics, we examined its correlations with ε_d and |e| (Fig. S12†). We found that a higher ϕ corresponds to a more negative ε_d (i.e., shifted further from E_F), which attenuates OH adsorption-a trend consistent with previous findings linking increased d-electron presence to larger ΔG_OH*. Moreover, a higher ϕ is associated with reduced charge transfer, reflecting diminished electron donation as ε_d moves away from E_F. Thus, ϕ not only reflects the overall adsorption energy but also encapsulates intrinsic factors governing adsorption, validating it as generally adept and rational parameter for assessing OER activity in DACs with single substrate.

Following our examination of the correlation between ΔG_OH* and d electrons, we extended our investigation to the remaining OER intermediates-namely, O* and OOH* and their relationship with OH*. Accordingly, we computed the energy barriers along the OER pathway for approximately 30 DACs (Fig. S13†). Concurrently, we calculated the Gibbs free energy changes for O* and OOH* (ΔG_O* and ΔG_OOH*, respectively) and visualized their relationships with ΔG_OH* in Fig. 5d and e. The results indicate a linear relationship between ΔG_OH* and both ΔG_O* and ΔG_OOH*, although the correlation for ΔG_O* is notably weaker than that for ΔG_OOH*. Traditionally, on transition metal surfaces, the robust linear relationships of both yield a characteristic volcano plot correlating ΔG_OH* with η^OER, thereby establishing ΔG_OH* as a reliable descriptor of OER activity.²² However, in the DACs studied here, these relationships are obscured; when examining the correlation between η^OER and ΔG_OH* (Fig. 5f), the expected volcano trend is not clearly observed. This obscured trend underscores the complexity of intermediate adsorption in DACs, where adsorbates may bind to one or both metal atoms or even to the substrate, deviating from conventional transition metal surface models.

Constructing an integrative ML paradigm for DAC blueprinting

In our previous analysis, we examined the CO₂RR, HER, stability, and OER activities of DACs on various substrates. Our results indicated that relying solely on simple descriptors-such as adsorption energies of key intermediates or elemental properties only-fails to predict catalyst performance accurately. Consequently, integrating ML became essential. However, representing the complex structural information of DACs poses a challenge: manually inputting data for numerous atoms is impractical, while focusing only on atoms near the central bimetallic sites sacrifices critical structural detail. Our initial, basic analysis ML models could not capture such complexity, limiting their regression performance. To overcome this, we developed an advanced ML framework that incorporates DFT-derived data, enabling fast and accurate predictions for DAC material design while reducing reliance on labor-intensive DFT calculations across chemical space.

In summary, our workflow (Fig. 6a) consists of several modules built on the XGBoost algorithm, which performs well on small to medium-sized datasets and avoids overfitting common with neural networks. XGBoost's numerous hyperparameters allow extensive optimization for superior performance. We extract optimized atomic coordinates and unit cell information in batches and convert these data into either CM or SOAP-two widely used feature construction methods in DFT. Our comparison reveals that SOAP outperforms CM (Fig. 6b) because it accounts for crystal structure repeatability and complex van der Waals interactions, thereby better simulating the true atomic environment for crystal materials. t-SNE visualization of SOAP features (Fig. 6c) shows that the four clusters correspond to the four substrate types in our dataset, with both g-C₃N₄ and N-C₃N₄ grouped distinctly on the right, and CN and C₂N on the left, indicating that SOAP effectively retains the key structural information of these materials. We also tested the impact of hyperparameter tuning on model performance, investigating whether CM could outperform SOAP. However, even after additional hyperparameter tuning (using Optuna for optimization, Table S6†), the XGBoost model with SOAP consistently outperforms the one using CM. This confirms that capturing structural repeatability is crucial for materials feature engineering. Nonetheless, SOAP has limitations due to its high storage requirements and longer computational times, highlighting the need for future advancements in feature engineering methods.


	Fig. 6 The innovative prediction framework, module performance, and PySR-generated formula; (a) overview of the innovative prediction framework. (b) Comparison of accuracy between CM and SOAP feature construction methods. (c) t-SNE visualization of SOAP features for the four substrates studied, with different metal pairs. (d) Prediction results for each metric (ΔE_bind, U_diss, ΔG_H, U_L and η^OER), with the blue-grey shaded area indicating satisfaction of the screening criteria. (e) Prediction performance of each ML model. (f) Selected examples of 5 N-C₃N₄ DACs with the comparison between DFT and ML prediction. Blue shaded area indicates ideal η^OER or U_L, while red shaded area denotes high ΔG_H activity. (g) Summary of PySR-generated formula and the corresponding key intermediates' adsorption energies. This figure presents simplified versions of the formulas, with more detailed and high-precision fitting results provided in Table S10 and Fig. S14.†

In our framework, we employed an active learning strategy that iteratively selects the most uncertain examples (uncertainty sampling) for each learning loop. This approach, effective even when data are generated via generative diffusion models (Fig. S15†), continuously refines model predictions by focusing on uncertain cases. To ensure the reliability of the generated data, we performed a comprehensive analysis of the samples produced by the diffusion mode (Table S7 and Fig. S15†). The analysis involved three key metrics: Maximum Mean Discrepancy (MMD), Average Cosine Similarity (ACS), and Nearest-Neighbor Consistency (NNC), along with t-SNE visualization of generated data. These results confirm that the generated data align closely with the real data, thereby ensuring their suitability for model training and validation. In large-scale DFT predictions, active learning significantly reduces computational costs and broadens the model's applicability. Despite its reliance on uncertainty estimation, its iterative nature provides a strong theoretical and practical foundation for automated materials exploration. Using this strategy, we achieved high-precision predictions for DAC properties (Fig. 6d, e and Table S8†) sequentially: first assessing DAC stability (ΔE_bind and U_diss) using multi-objective predictions, then evaluating HER activity (ΔG_H*) and the CO₂RR limiting potential (U_L) with single-objective predictions to rapidly screen promising CO₂RR electrocatalytic DACs. Furthermore, by integrating these predictions with ranking recommendations (Table S9†), we quickly identified the most stable catalysts, particularly those with potential dual functionality for overall water splitting (the purple dots located in the blue shadow in Fig. 6d). Additionally, we selected five DACs with the N-C₃N₄ substrate, which were proved to be stable with AIMD analysis (Fig. S16†), and compared DFT-calculated and ML-predicted values (Fig. 6f). The results show minimal discrepancies between the calculated and predicted values. Notably, CuNi emerges as the most promising bifunctional water splitting catalyst, while CuPd and PtZn metal pairs excel only in OER performance. Among them, PtZn demonstrates an η^OER close to 0.15 eV, significantly outperforming other materials in OER. Both PtMn and VTi exhibit reasonable U_L values, but VTi stands out with strong CO₂RR activity and HER resistance, with its CO₂RR limiting potential approaching −0.15 V, nearly the peak of the theoretical volcano plot, while PtMn significantly underperforms in CO₂RR for its low HER resistance.

However, we still encountered a challenge: while high-dimensional descriptors such as SOAP and CM enhance accuracy by retaining more information, they also compromise interpretability for their high dimension. To address this, we incorporated High-Performance Symbolic Regression (PySR) into our framework. By integrating prior knowledge from SHAP analysis and chemical logic, we provided PySR with key descriptors most likely to influence key intermediate adsorption, achieving high-precision fits (Table S10† and Fig. 6g). The interpretability offered by PySR yields valuable insights into adsorption patterns; furthermore, focusing on atomic features in the immediate vicinity of the adsorbate suggests that the adsorption strength in DACs is primarily determined by the local bimetallic environment. This makes the selection of such features a more efficient approach for ML model construction, with minimal loss in predictive accuracy. In particular, we emphasize that the “simple” versions of the PySR models rely solely on elemental properties such as electronegativity, electron affinity, and the number of valence d-electrons-features that can be directly obtained from publicly available databases (e.g., the NIST database) without requiring prior DFT calculations. This enables rapid, interpretable estimation of catalytic behavior without incurring the computational cost of quantum mechanical simulations. Consequently, these compact symbolic expressions can serve as efficient surrogates for predicting adsorption energetics of key intermediates, allowing researchers to pre-screen candidate DACs at negligible cost. This balance between transparency and computational simplicity makes the simple models particularly suitable for early-stage catalyst discovery.

Importantly, our ML framework significantly expedites the identification of potential CO₂RR and water-splitting DACs. As shown in Fig. S17,† our strategy approximately outperforms complex DFT calculations by a factor of 3750, underscoring its efficiency. Note that the DFT computational time referenced pertains only to the DACs in our dataset; expanding DFT analysis to cover the entire chemical space would lead to an exponential increase in computational time and cost, rendering it impractical for real-world applications. We therefore anticipate that ML will increasingly supplant less efficient DFT methods and become increasingly prevalent in the discovery and development of novel materials systems.

Conclusions

In summary, our study provides pivotal insights into the design and optimization of DACs for various electrocatalytic processes. By integrating high-throughput DFT calculations with advanced ML algorithms for analysis, we studied numerous metal/substrate combinations and identified key parameters that govern catalytic efficiency. Regarding DACs for CO₂RR, our research challenges the traditional reliance on ΔG_CO* as the sole determinant of catalytic activity, highlighting the inherent complexity of these processes. We found that the number of d-electrons in transition metals critically influences CO adsorption by modulating ε_d, as evidenced by variations in Bader charge analyses. These variations lead to distinct C–O bond lengths and diverse CO binding strengths, which in turn profoundly impact CO₂RR activity-a trend that parallels observations in DAC-mediated HER processes. Furthermore, our ML approaches reveal that DAC stability is largely governed by the nature of the supporting substrate and the metal's atomic radius, primarily due to atomic repulsion. Notably, configurations with a fully occupied outer d-shell (10 d-electrons) exhibit diminished bonding strength. In the context of OER, our study demonstrates that ΔG_OH*, despite being influenced by the transition metals' d-electrons, is insufficient as a standalone metric for OER activity because complex adsorption modes introduce discrepancies that weaken the expected volcano relationship.

Moreover, in the latter part of our study, we established a ML framework not only delivers precise catalytic predictions consistent with theoretical results but also substantially reduces computational costs, underscoring ML's transformative potential in catalyst development. Ultimately, the methodologies and insights from this study chart a definitive course for using ML to advance the design of atomic catalysts for energy conversion and storage, a key step toward a sustainable energy paradigm.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Author contributions

Chenyang Wei: writing – original draft, validation, methodology, investigation, visualization, data curation. Wenbo Mu: conceptualization, methodology, formal analysis, investigation. Hongyuan Zhang: formal analysis, visualization, data curation. Zhenghui Liu: supervision, resources, conceptualization. Tiancheng Mu: writing – review & editing, supervision, resources, project administration, funding acquisition.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

The authors thank the National Natural Science Foundation of China (22238011) and Open Research Fund of School of Chemistry and Chemical Engineering, Henan Normal University for financial support.

Notes and references

N. J. Harmon and H. Wang, Angew. Chem., Int. Ed., 2022, 61, e202213782 CrossRef CAS PubMed .
Y. Li, H. Wang, C. Priest, S. Li, P. Xu and G. Wu, Adv. Mater., 2021, 33, e2000381 CrossRef PubMed .
S. Navarro-Jaen, M. Virginie, J. Bonin, M. Robert, R. Wojcieszak and A. Y. Khodakov, Nat. Rev. Chem., 2021, 5, 564–579 CrossRef CAS PubMed .
L. Wu, T. Guo and T. Li, Adv. Funct. Mater., 2022, 32, 2203439 CrossRef CAS .
M. Sun, A. W. Dougherty, B. Huang, Y. Li and C. H. Yan, Adv. Energy Mater., 2020, 10, 1903949 CrossRef CAS .
Q.-P. Zhao, W.-X. Shi, J. Zhang, Z.-Y. Tian, Z.-M. Zhang, P. Zhang, Y. Wang, S.-Z. Qiao and T.-B. Lu, Nat. Synth., 2024, 3, 497–506 CrossRef CAS .
L. Peng, L. Shang, T. Zhang and G. I. N. Waterhouse, Adv. Energy Mater., 2020, 10, 2003018 CrossRef CAS .
C. Liu, T. Li, X. Dai, J. Zhao, D. He, G. Li, B. Wang and X. Cui, J. Am. Chem. Soc., 2022, 144, 4913–4924 CrossRef CAS PubMed .
Y. Ying, X. Luo, J. Qiao and H. Huang, Adv. Funct. Mater., 2020, 31, 2007423 CrossRef .
C. Gao, X. Min, M. Fang, T. Tao, X. Zheng, Y. Liu, X. Wu and Z. Huang, Adv. Funct. Mater., 2021, 32, 2108044 CrossRef .
J. Chen, J. B. Luo, M. Y. Hu, J. Zhou, C. Z. Huang and H. Liu, Adv. Funct. Mater., 2022, 33, 2210095 CrossRef .
Y. Liu, X. Tan, J. Liang, H. Han, P. Xiang and W. Yan, Adv. Funct. Mater., 2023, 33, 2214271 CrossRef CAS .
Y. Shen, C. Fu, W. Luo, Z. Liang, Z.-R. Wang and Q. Huang, Green Chem., 2023, 25, 7605–7611 RSC .
J. Moon, W. Beker, M. Siek, J. Kim, H. S. Lee, T. Hyeon and B. A. Grzybowski, Nat. Mater., 2024, 23, 108–115 CrossRef CAS PubMed .
Z. Wang, X. Tan, Z. Ye, S. Chen, G. Li, Q. Wang and S. Yuan, Green Chem., 2024, 26, 9569–9598 RSC .
Y. Sun, H. Liao, J. Wang, B. Chen, S. Sun, S. J. H. Ong, S. Xi, C. Diao, Y. Du, J.-O. Wang, M. B. H. Breese, S. Li, H. Zhang and Z. J. Xu, Nat. Catal., 2020, 3, 554–563 CrossRef CAS .
M. Wang and H. Zhu, ACS Catal., 2021, 11, 3930–3937 CrossRef CAS .
A. Merchant, S. Batzner, S. S. Schoenholz, M. Aykol, G. Cheon and E. D. Cubuk, Nature, 2023, 624, 80–85 CrossRef CAS PubMed .
A. Kotobi, K. Singh, D. Hoche, S. Bari, R. H. Meissner and A. Bande, J. Am. Chem. Soc., 2023, 145, 22584–22598 CrossRef CAS PubMed .
T. Yang, D. Zhou, S. Ye, X. Li, H. Li, Y. Feng, Z. Jiang, L. Yang, K. Ye, Y. Shen, S. Jiang, S. Feng, G. Zhang, Y. Huang, S. Wang and J. Jiang, J. Am. Chem. Soc., 2023, 145, 26817–26823 CrossRef CAS PubMed .
L. Zhang, X. Guo, S. Zhang, T. Frauenheim and S. Huang, Adv. Energy Mater., 2023, 14, 2302754 CrossRef .
C. Cao, S. Zhou, S. Zuo, H. Zhang, B. Chen, J. Huang, X. T. Wu, Q. Xu and Q. L. Zhu, Research, 2023, 6, 0079 CrossRef CAS PubMed .
L. Yuan, S. Zeng, G. Li, Y. Wang, K. Peng, J. Feng, X. Zhang and S. Zhang, Adv. Funct. Mater., 2023, 33, 2306994 CrossRef CAS .
A. Fonari and S. Stauffer, vasp_raman.py, 2013, https://github.com/raman-sc/VASP/.
J. P. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1996, 77, 3865–3868 CrossRef CAS PubMed .
L. Yu, F. Li, J. Huang, B. G. Sumpter, W. E. Mustain and Z. Chen, ACS Catal., 2023, 13, 9616–9628 CrossRef CAS .
A. Das, S. C. Mandal, A. S. Nair and B. Pathak, ACS Phys. Chem. Au, 2021, 2, 125–135 CrossRef PubMed .
H. Zhu, Z. Guo, D. Lan, S. Liu, M. Liu, J. Zhang, X. Luo, J. Yu and T. Wu, J. Energy Chem., 2024, 99, 627–635 CrossRef CAS .
C. Fang, J. Zhou, L. Zhang, W. Wan, Y. Ding and X. Sun, Nat. Commun., 2023, 14, 4449 CrossRef CAS PubMed .
D. Misra, G. Di Liberto and G. Pacchioni, Phys. Chem. Chem. Phys., 2024, 26, 10746–10756 RSC .
S. Grimme, J. Antony, S. Ehrlich and H. Krieg, J. Chem. Phys., 2010, 132, 154104 CrossRef PubMed .
C. Ren, Y. Cui, Q. Li, C. Ling and J. Wang, J. Am. Chem. Soc., 2025, 147, 13610–13617 CrossRef CAS PubMed .
X. Jiang, G. Liu, L. Zhang and Z. Hu, Catalysts, 2025, 15, 309 CrossRef CAS .
J. A. Keith, V. Vassilev-Galindo, B. Cheng, S. Chmiela, M. Gastegger, K.-R. Müller and A. Tkatchenko, Chem. Rev., 2021, 121, 9816–9872 CrossRef CAS PubMed .
Z. Shu, H. Yan, H. Chen and Y. Cai, J. Mater. Chem. A, 2022, 10, 5470–5478 RSC .
M. Zafari, D. Kumar, M. Umer and K. S. Kim, J. Mater. Chem. A, 2020, 8, 5209–5216 RSC .
X. Lin, X. Du, S. Wu, S. Zhen, W. Liu, C. Pei, P. Zhang, Z.-J. Zhao and J. Gong, Nat. Commun., 2024, 15, 8169 CrossRef CAS PubMed .
H. Wang, J. Sun, Y. Li and W. Deng, Sci. Data, 2025, 12, 648 CrossRef CAS PubMed .
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot and E. Duchesnay, J. Mach. Learn. Res., 2011, 12, 2825–2830 Search PubMed .
W. Ma, X. He, W. Wang, S. Xie, Q. Zhang and Y. Wang, Chem. Soc. Rev., 2021, 50, 12897–12914 RSC .
Q. Zhu, Y. Gu, X. Wang, Y. Gu and J. Ma, JACS Au, 2024, 4, 125–138 CrossRef CAS PubMed .
F. Liao, K. Yin, Y. Ji, W. Zhu, Z. Fan, Y. Li, J. Zhong, M. Shao, Z. Kang and Q. Shao, Nat. Commun., 2023, 14, 1248 CrossRef CAS PubMed .
R. Ding, J. Chen, Y. Chen, J. Liu, Y. Bando and X. Wang, Chem. Soc. Rev., 2024, 53, 11390–11461 RSC .
D. Shi, F. Zhou, W. Mu, C. Ling, T. Mu, G. Yu and R. Li, Phys. Chem. Chem. Phys., 2022, 24, 26029–26036 RSC .
S. M. Lundberg and S.-I. Lee, Adv. Neural Inf. Process. Syst., 2017, 30, 4765–4774 Search PubMed .
W. Qiu, H. Chen, A. B. Dincer, S. Lundberg, M. Kaeberlein and S. I. Lee, Commun. Med., 2022, 2, 125 CrossRef PubMed .
J. Greeley, T. F. Jaramillo, J. Bonde, I. B. Chorkendorff and J. K. Norskov, Nat. Mater., 2006, 5, 909–913 CrossRef CAS PubMed .
H. Ma, E. Ibáñez-Alé, F. You, N. López and B. S. Yeo, J. Am. Chem. Soc., 2024, 146, 30183–30193 CrossRef CAS PubMed .
X. Liu, J. Xiao, H. Peng, X. Hong, K. Chan and J. K. Norskov, Nat. Commun., 2017, 8, 15438 CrossRef CAS PubMed .
G. Na, W. Hwang, H. Shin, S. Park, J. E. Park, J. Lee, Y. Shin, H. Choi, J. Shim, K. Yeom and Y. E. Sung, Adv. Energy Mater., 2024, 14, 2400565 CrossRef CAS .
Z. Jin, M. Yang, Y. Dong, X. Ma, Y. Wang, J. Wu, J. Fan, D. Wang, R. Xi, X. Zhao, T. Xu, J. Zhao, L. Zhang, D. J. Singh, W. Zheng and X. Cui, Nano-Micro Lett., 2023, 16, 4 CrossRef PubMed .
S. Zhang, M. Li, J. Li, Q. Song and X. Liu, Proc. Natl. Acad. Sci. U. S. A., 2023, 120, e2207080119 CrossRef CAS PubMed .
C. Zhang, D. Qin, Y. Zhou, F. Qin, H. Wang, W. Wang, Y. Yang and G. Zeng, Appl. Catal., B, 2022, 303, 120904 CrossRef CAS .
L. Li, Z. Yang, H. Xiong, M. Ma, R. Zhang and Z. Jiang, ACS Appl. Mater. Interfaces, 2025, 17, 15287–15300 CrossRef CAS PubMed .
N. Yang, Y. Zhao, P. Wu, G. Liu, F. Sun, J. Ma, Z. Jiang, Y. Sun and G. Zeng, Appl. Catal., B, 2021, 299, 120682 CrossRef CAS .
P. Zhai, M. Xia, Y. Wu, G. Zhang, J. Gao, B. Zhang, S. Cao, Y. Zhang, Z. Li, Z. Fan, C. Wang, X. Zhang, J. T. Miller, L. Sun and J. Hou, Nat. Commun., 2021, 12, 4587 CrossRef CAS PubMed .
W. Shi, B. Ge, P. Jiang, Q. Wang, L. He and C. Huang, Appl. Catal., B, 2024, 354, 124121 CrossRef CAS .
Y. Jia, L. Zhang, G. Gao, H. Chen, B. Wang, J. Zhou, M. T. Soo, M. Hong, X. Yan, G. Qian, J. Zou, A. Du and X. Yao, Adv. Mater., 2017, 29, 1700017 CrossRef PubMed .
X. Wu, Z. Yang, C. Li, S. Shao, G. Qin and X. Meng, ACS Catal., 2024, 15, 432–446 CrossRef .
A. Kulkarni, S. Siahrostami, A. Patel and J. K. Nørskov, Chem. Rev., 2018, 118, 2302–2312 CrossRef CAS PubMed .
S. Shen, H. Zhang, K. Song, Z. Wang, T. Shang, A. Gao, Q. Zhang, L. Gu and W. Zhong, Angew. Chem., Int. Ed., 2023, 63, e202315340 CrossRef PubMed .

Footnote

† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5ta03021h

Click here to see how this site uses Cookies. View our privacy policy here.