Xiurong Yang‡
ab,
Ying Wang‡ab,
Linhu Panab,
Ruihui Wangab,
Yi Wangab,
Honglei Xiaab,
Siwei Song*ab and
Qinghua Zhang
*ab
aNational Key Laboratory of Solid Propulsion, Northwestern Polytechnical University, Xi'an, Shanxi 710072, China. E-mail: ssw_sv@nwpu.edu.cn
bSchool of Astronautics, Northwestern Polytechnical University, Xi'an, Shanxi 710065, China. E-mail: qinghuazhang@nwpu.edu.cn
First published on 27th June 2025
Co-crystal engineering has emerged as a promising strategy to tailor the properties of organic materials, yet the mechanistic principles governing co-crystallization remain underexplored. Here, we systematically analyze co-crystal formation using intermolecular interaction as the criterion from a thermodynamic perspective. Building on these energetic insights, a computational framework integrating intermolecular interaction analysis and machine learning methods to enable rational design of co-crystals was established. First, anomaly detection via local outlier factor (LOF) algorithms preliminarily screened out abnormally designed co-crystals based on energy decomposition analysis of existing co-crystals. Next, a comparative energy benchmark was established, requiring that the intermolecular interaction between the two designed molecules is more stable than that in their individual component, ensuring thermodynamic feasibility. Furthermore, a graph neural network (GNN) model was implemented to rapidly predict intermolecular interaction profiles, enabling efficient co-crystal design. This machine learning-enhanced workflow accelerated the identification of two novel urea-based co-crystals, which were experimentally synthesized and characterized. Our findings not only quantify the energy steering supramolecular assembly but also break through the traditional method of co-crystal classification, providing new perspectives for the mechanism and prediction of co-crystal formation.
The scarcity of experimentally confirmed negative samples stems from publication bias favoring successful cases in crystallization studies. This imbalance of the positive and negative samples is evident in early methodologies. Vriza et al. exclusively trained models using positive π–π co-crystal data,13 while Soroush Ahmadi's dataset exhibited a severe 5394:
994 positive-to-negative ratio.16 To address this limitation, computational strategies like network-based link prediction17 and molecular similarity metrics14 have been employed to generate synthetic negative samples. However, these rule-based approaches lack experimental validation, potentially compromising model reliability in real-world applications. Furthermore, it is challenging to determine whether these negative samples (failed co-crystals) result from experimental technical limitations or an inherent inability of the components. This ambiguity underscores the critical need for establishing a mechanism-based criterion to evaluate the co-crystallization potential of selected coformers.
Over the past few decades, the development of supramolecular chemistry has substantially advanced our understanding of the co-crystallization mechanism, defining co-crystals as multicomponent systems stabilized by non-covalent interactions.18,19 Particularly, the supramolecular synthon concept, derived from statistical analyses of molecular interactions and involved functional groups, provides a theoretical foundation for coformer pairing.20 The supramolecular assembly of co-crystals is predominantly governed by directional non-covalent interactions, including hydrogen bonding, van der Waals forces, π–π stacking, and electrostatic forces. For instance, molecular structures featuring complementary hydrogen bond donors/acceptors (e.g., –COOH and –NH2) and π-conjugated systems (e.g., benzene rings) exhibit enhanced co-crystallization propensity, as validated across pharmaceutical and energetic material systems.21–25 Advanced computational techniques, including symmetry adapted perturbation theory (SAPT),26–28 Hirshfeld surface analysis and electrostatic potential mapping, have been used to elucidate the critical role of these interactions in co-crystalline systems.29–32 These methods have also been successfully applied to guide the design of co-crystals and unveil their formation mechanism.16,33 Furthermore, the analysis of the attention mechanism and feature importance in classification models also highlights the significance of hydrogen bond acceptors and donors. These insights into interactions shed light on a theoretical inspiration for the rational selection of coformers from a thermodynamic viewpoint.
Given the great potential of intermolecular interaction for predicting co-crystal formation, this work intends to establish an energy-based evaluation criterion for co-crystallization processes in energetic materials. To comprehensively analyze intermolecular interactions in co-crystals, we adopted a molecular cluster approach by resolving the crystal structure into multiple dimers, following the methodology established in our previous work.34–36 Then, the SAPT energy decomposition analysis was employed to quantify weak interactions between components of dimers, including electrostatic and dispersion components.37–39 By conducting statistical analysis of the distribution of energy values, we investigated the correlation between intermolecular interactions and co-crystal architectures, identifying dominant energy contributions during crystallization. Thereby, we proposed a thermodynamic criterion for evaluating the formation of designed co-crystals through a multi-step computational framework. First, Local Outlier Factor (LOF) algorithms were employed for outlier detection to identify abnormal co-crystal candidates. Then, an energetic benchmark was developed by comparing the intermolecular interaction between designed molecular pairs and their individual crystals, validating thermodynamic feasibility of co-crystallization. Finally, a graph neural network (GNN)-based framework was implemented to rapidly predict intermolecular interactions between co-crystal components (Fig. 1). With the help of the framework, two novel urea-based co-crystals were experimentally synthesized and characterized. This ML energy-based methodology avoids the inherent limitations of sample imbalance in traditional classification models, offering a new strategy for the design and screening of energetic co-crystals.
1. Only containing two chemically distinct polyatomic units.
2. Having three-dimensional structures and excluding structures exhibiting positional disorder.
3. The density range of samples is 1.3–2.2 g cm−3.
4. Only containing C, H, O, N, F, Cl, Br, and I elements.
5. Being neutral components to exclude salts.
6. Some unrecognized structures were deleted.
7. Excluding crystals with nitrogen mass fractions below 20%.
8. Discarded systems containing >30 heavy atoms per molecule.
Since our team mainly focuses on the application of co-crystal technology in the field of energetic materials, the screening conditions we set are mainly to cover energetic co-crystals as much as possible, and most drug-like co-crystals are excluded. Consequently, 2100 crystal structures were retrieved using the CSD Python API (Data S1†). This dataset includes 1034 solvates due to their analogous formation mechanisms with co-crystals. The solvates containing H2O were classified as hydrates, while those incorporating organic solvents such as methanol or acetone were categorized as non-aqueous solvates. There are 687 hydrates and 347 non-aqueous solvates.
It is also observed that in the structure with the strongest interaction, parts of the structures are the same molecules rotating and displacing. With analysis of the crystal cell of a co-crystal, this displacement structure was produced mainly due to crystal cell expansion, and the rotation structure was produced due to the asymmetric units in the unit cell, as illustrated in Fig. S1.† This demonstrates that interaction between the same components is critical for crystallization. As a result, we studies whether the most stable structures (with more powerful intermolecular interactions) were composed of similar or different molecules, and the distribution of energy is shown in Fig. S2.† Specifically, these interacting molecules with strong intermolecular interactions play a key role in the nucleation period of crystal growth.40,41
The extracted dimer structures and their corresponding intermolecular interactions were systematically analyzed to find regular patterns and correlations. As illustrated in Fig. S3,† the dominant intermolecular interactions are hydrogen bonding and π–π stacking in most structures. To further elucidate the relationship between energy contributions and structures, we examined the energy partitioning (Edis, Eele, and Eind) for the most stable dimer in each co-crystal, as shown in Fig. 3. Edis% and Eele% represent the value of Edis/(Edis + Eele + Eind) and Eele/(Edis + Eele + Eind), respectively. The analysis reveals that dispersion Edis (Edis% > 50%) dominates in 277 out of 938 co-crystals, while electrostatic interactions (Eele% > 50%) play a more significant role in 606 cases. This trend extends to solvates and hydrates; the dimer with (Edis% > 50%) and (Eele% > 50%) has a total ratio of (264:
306) and (502
:
563), respectively. Notably, dimers with Edis contributions between 50% and 90% universally exhibit parallel molecular packing, whereas those with Eele contributions in the 50–80% range consistently feature hydrogen bonding. These findings are further supported by the four strongest interaction configurations shown in Fig. S4,† where the maximum Edis and Eele values correspond to π–π stacking and hydrogen bonding structure, respectively. Even in systems where neither Edis nor Eele exceeds 50% as shown in Fig. S5,† the majority of structures retain both hydrogen bonding and stacking interactions. This comprehensive energy-structure correlation analysis demonstrates that the structural features of co-crystals can be effectively characterized through their intermolecular interaction profiles.
To gain deeper insights into the initial stages of co-crystal formation, we investigated the most stable configuration of molecular pairs (A and B), which likely represents the first step in co-crystal nucleation. Using the Molclus program,42 we generated the energetically most favorable dimer configurations, denoted as AB*, with a detailed methodology provided in ESI Methods.† Notably, as shown in Fig. S6,† some AB* configurations optimized by Molclus differ from their counterparts AB in co-crystals, suggesting that structural rearrangements occur during co-crystallization due to factors such as lattice constraints, multi-body interactions, and steric effects. This discrepancy is particularly pronounced for CL-20, where the cage-like structure is highly sensitive to surrounding molecular environments, leading to significant deviations between the optimized AB* and AB in co-crystals.
The intermolecular interaction EAB* of these optimized dimers was compared with EAA and EBB extracted from single-component crystals, as summarized in Table S1.† As shown in Fig. 4C, intermolecular interaction EAB* of the optimized dimers is generally comparable to or stronger than the corresponding single-component energies (EAA and EBB). This trend is further supported by the quantity 2EAB* − EAA − EBB, which demonstrates the thermodynamic stability of EAB* in most systems. However, certain CL-20 and nitrofurantoin co-crystals exhibit lower-than-expected EAB*, which may arise from the neglect of experimental conditions, incomplete sampling of the most stable AB* configurations and unexpected binding sites between A and B molecule. For instance, molecule A may combine with molecule B at sites exhibiting weaker EBB interactions. Furthermore, the elevated temperatures and polymorphic transformations may disrupt the native structures of A and B crystals, thereby promoting the formation of AB*.43 Additionally, the failed co-crystal systems reported by Jiang et al.12 were analyzed to validate the reliability of our co-crystal formation criterion; more details are shown in ESI Methods.† Fig. 4D demonstrates that in certain failed co-crystals, the intermolecular interactions of single-component (EAA and EBB) significantly exceed EAB*. These results further validate the feasibility and rationality of our proposed co-crystal formation criterion from a complementary perspective. In general, both EAB or EAB* are expected to significantly exceed the intermolecular interaction of the single-component EAA and EBB. At a minimum, EAB should be greater than either EAA or EBB, with a maximum energy gap not exceeding 50 kJ mol−1, while the EAB* should ideally surpass the intermolecular interaction of all single components. Based on these principles, we established a guideline for co-crystal design. However, since the calculation via PSI4 is computationally intensive and EAB for newly designed components is unknown, we propose leveraging ML models to rapidly predict these intermolecular interactions, thereby accelerating the co-crystal screening and design process.
The second model is designed to predict intermolecular interaction for dimers extracted from co-crystals. A large dataset comprising 18204 calculated energy values was split into training and test sets in an 8
:
2 ratio. To distinguish between dimers with identical components but differing intermolecular interactions, descriptors encoding relative positional information were incorporated as the global state descriptor u and concatenated with molecular graph features to form comprehensive descriptors. To evaluate the impact of these features, we conducted four comparative experiments with different feature combinations. The first model which only uses the molecular graph as features is labeled as Graph. The second model uses a feature combination of the molecular graph and smooth overlap of atomic positions (SOAP) descriptors, marked as Graph + SOAP, while the third model combines the molecular graph and intermolecular distance descriptors, labeled as Graph + DIST. At last, a combination of all descriptors was marked as Graph + SOAP + DIST. In addition, neural network models without graph features are also discussed. This systematic approach allows us to assess the contribution of each feature type to the predictive accuracy of the model.
Table 1 summarizes the performance of the models in predicting intermolecular interaction using different feature combinations. The Graph model, which relies solely on molecular graph features, exhibits poor predictive capability, achieving an R2 of 0.13 and an MAE of 12.22 kJ mol−1 on the test set. By augmenting the graph features with SOAP descriptors, the model's performance significantly improves, yielding an R2 of 0.88 and an MAE of 2.68 kJ mol−1. Similarly, incorporating DIST descriptors further enhances the model, resulting in an R2 of 0.97 and an MAE of 1.18 kJ mol−1. The most accurate Graph + SOAP + DIST model combines all feature types and achieves exceptional performance, and the R2 and MAE on the test set are 0.97 and 1.17 kJ mol−1, respectively. This enhancement is particularly critical given that most structures share similar node and edge features, making it essential to capture subtle differences in molecular configurations through comprehensive descriptors. This also shows that the relative position information of the components in the dimer configuration is very important for accurately predicting the intermolecular interactions. The neural network model trained without graph features achieved an R2 of 0.97 and an MAE of 1.26 kJ mol−1 on the test set.
Descriptors | Train | Test | ||
---|---|---|---|---|
R2 | MAE (kJ mol−1) | R2 | MAE (kJ mol−1) | |
Graph | 0.18 | 12.14 | 0.13 | 12.22 |
Graph + SOAP | 0.97 | 2.02 | 0.88 | 2.68 |
Graph + DIST | 0.99 | 1.03 | 0.97 | 1.18 |
SOAP + DIST | 0.99 | 1.23 | 0.97 | 1.26 |
Graph + DIST + SOAP | 0.99 | 1.06 | 0.97 | 1.17 |
As shown in Fig. 5A, the performance of the Graph + SOAP + DIST model in predicting individual energy components (Edis, Eele, Eind, Eexc and Etot) was evaluated using the kernel density estimation (KDE) to analyze the data distribution. The R2 between predicted and true values was calculated for each energy component. The dispersion energy Edis model demonstrates the highest accuracy, achieving an R2 value of 0.99 on the test set. The Eexc, Eele and Eind models also perform well, with R2 values of 0.97, 0.97 and 0.96, respectively, and larger deviations are observed for few points. Overall, the predicted values closely match the true values in high-density regions of the data distribution, demonstrating the high accuracy and reliability of the model. Furthermore, compared to other models in the literature,44,45 our approach, which is based on true structural descriptors, exhibits superior accuracy and computational efficiency.
![]() | ||
Fig. 5 Model performance evaluation. (A) Density distribution of true and predicted values on the test set. (B) Error distribution of predicted values on the test set. |
To enhance the capability and interpretability of the GNN, we incorporated an attention mechanism, which identifies and prioritizes critical structural features from complex molecular data. In graph-level tasks, global attention scores are computed based on node features, enabling the optimization of the feature space and providing insights into the model's decision-making process. For the dimers in co-crystal ACETAF01, as illustrated in Fig. 6A, the deeper color intensities on the benzene ring, hydroxyl group, and amino group highlight the focus of the model on regions involved in π–π conjugation and hydrogen bonding. In contrast, Fig. 6B shows dimers lacking significant hydrogen bonds, where the dark green regions emphasize the importance of π–π stacking interactions. These results demonstrate that the attention mechanism effectively captures key structural motifs governing intermolecular interactions, thereby significantly enhancing the model's predictive accuracy and interpretability.
To validate the reliability and applicability of the model, we systematically adjusted the intermolecular distance in the co-crystal ACETAF01 and predicted the corresponding intermolecular interaction. The distance between molecules was incrementally stretched by 0.05 Å along the vector of the shortest intermolecular contact, generating 100 configurations over a range of 0–5 Å. As shown in Fig. 6C, the predicted intermolecular interactions exhibit a gradual decrease as the distance increases by 0.25 Å. However, within the range of 0.1–0.25 Å, the energy remains nearly constant, indicating that the model is insensitive to small variations in intermolecular distance. As a result, the model shows significant deviations when predicting intermolecular interactions for structures optimized by Molclus, as evidenced by the comparison in Table S2.† This limitation suggests that the model is best suited for predicting intermolecular interactions within co-crystal environments, while the PSI4 module remains the preferred method for calculating energies of Molclus-optimized dimers.
Further analysis of weak intermolecular interactions in the synthesized co-crystals is presented in Fig. 7c–e. Molecular electrostatic potential (MESP) mapping of components within the crystalline lattice reveals distinct electrostatic complementarity,48,49 as shown in Fig. 7c, and significant van der Waals surface penetration is observed in hydrogen bond formation regions, where red (electron-rich) and blue (electron-deficient) areas interpenetrate, reflecting the electrostatic attraction nature of hydrogen bonding. Hirshfeld surface analysis further quantifies these interactions, demonstrating that hydrogen bonds constitute 48.11% and 62.08% of total surface contacts in the NTO/urea and DFP/urea co-crystals, respectively (Fig. 7d and S8†). This underscores the dominant role of hydrogen bonding in structural stabilization. Energy decomposition analysis via PSI4 calculations corroborates these findings, as shown in Fig. 7e. Electrostatic contributions (Eele) account for a high proportion of the total intermolecular interaction in most structures, confirming the prevalence of hydrogen-bonding. The predictive capability of our GNN model is validated through comparative analysis of the strongest intermolecular interaction shown in Fig. 7e and predicted EABML in Table S2.† The model achieves prediction errors of 8.37 kJ mol−1 and 3.79 kJ mol−1 for NTO/urea and DFP/urea, respectively; these deviations fall within acceptable thresholds for screening applications, highlighting the utility of the model in guiding experimental synthesis.
For energy decomposition analysis, we employed the open-source quantum chemistry package PSI4 to perform SAPT calculations.50 This method partitions the intermolecular interaction Etot into four distinct components: dispersion Edis, electrostatics Eele, induction Eind, and exchange repulsion Eexc values, as shown in eqn (1), enabling quantitative analysis of individual intermolecular interaction types. The SAPT0/jun-cc-pVDZ basis set was selected for calculation.
Etot = Edis + Eele + Eind + Eexc | (1) |
The fundamental principle of this methodology lies in comparative density analysis.51 The algorithm posits that non-outlier data points exhibit local density characteristics consistent with their neighboring observations, whereas outliers demonstrate significant density deviations within their local neighborhoods. This intrinsic mechanism renders the LOF particularly suitable for detecting subtle anomalies in complex energy datasets, as it accounts for both global distribution patterns and localized structural variations.
For a given sample point P, the distances to all other points were calculated and sorted in ascending order, where the distance to the k-th nearest point is denoted as dk(P); the points within the region centered at P with a radius of dk(P) are denoted as Nk(P). When determining the accessible distance between P and another sample point O, the maximum value between dk(O) and d(P,O) is selected, as expressed in eqn (2). The k-local accessible density of P is expressed as eqn (3):
reach_distk(P,O) = max{dk(O),d(P,O)} | (2) |
![]() | (3) |
The numerator represents the sum of the k-accessible distances from point P to all points in Nk(P).
The LOF is defined as the ratio of the average local accessible density of all points in Nk(P) to the local accessible density of P, as expressed in eqn (4):
![]() | (4) |
A ratio significantly greater than 1 indicates that P has a lower density compared to its neighbors, suggesting that it is an outlier. Conversely, a ratio close to or less than 1 implies that P has a higher density, classifying it as a normal point. Based on this principle, the LOF algorithm from the scikit-learn library was employed to calculate outlier scores for unknown components combining co-crystal datasets, enabling the evaluation of their co-crystal formation potential. The parameters n_neighbors and contamination were assigned values of 40 and 0.01, respectively.
The GNN model is specifically designed to deal with non-Euclidean domains and represented as a graph with a complex relationship and interdependency between objects.53 The graph of a molecule can be expressed as a combination of nodes and edges, in which nodes are represented as atoms and edges are represented as bonds. In molecular modeling, a co-crystal system can be represented as eqn (5):
G = (V, E, u) | (5) |
G = {(V(V1, V2), E(E1, E2), u)} | (6) |
The GNN architecture employs iterative message-passing operations to propagate and update atomic information. As shown in Fig. 1f, this process updates and aggregates the nodes along with their neighbor information via an adjacency matrix. By repeating this procedure, the model progressively captures information from more distant neighbors, corresponding to the feature extraction process of each convolutional layer.
The message-passing phase is followed by the readout phase, which is designed to generate a comprehensive graph-level representation by aggregating the hidden embeddings of subgraphs. To enhance the interpretability and performance of the model, we incorporate an attention mechanism during the readout phase. This mechanism optimizes the hidden information by assigning attention coefficients to each atom, which are computed based on their node features.
Following feature vector extraction in the pooling layer, the global state descriptor u is concatenated into these features, as shown in Fig. 1g. A four-layer fully connected (FC) neural network is constructed to establish the mapping between structural features and intermolecular interactions. Rectified Linear Unit (ReLU) activation functions are implemented between FC layers to enhance model expressivity while mitigating gradient vanishing issues. To further optimize predictive performance, we adopted a multitask learning paradigm to predict five energies. Hyperparameter optimization was systematically conducted through Bayesian optimization, with the parameter space and final configurations documented in Table S5.†
The mean square error (MSE) was chosen as the loss function for backpropagation. The MAE and R2 are used to evaluate the prediction performance of the GNN model. The MSE, MAE and R2 values are defined using eqn (7)–(9):
![]() | (7) |
![]() | (8) |
![]() | (9) |
Single crystals of urea co-crystals were obtained via slow evaporation in acetonitrile at room temperature (25 ± 2 °C). First, 10 mmol of DFP or NTO and 10 mmol of urea (1:
1 molar ratio) were dissolved in 10 mL of acetonitrile in a glass vial. After complete dissolution, the solution was filtered through a 0.22 μm membrane filter into a designated container to remove any undissolved particles. The filtrate was then left to evaporate slowly at room temperature. After approximately 1–2 days, single crystals of urea co-crystals were obtained. The multiscan absorption, Lorentz and polarization correction were carried out using the SADABS. The structures were solved by direct methods using SHELXS-2013 and OLEX2 software,57 and the refinement was further carried out by the full-matrix least-squares technique. The H atoms were refined isotropically, and the non-H atoms were refined anisotropically.
Single crystal X-ray diffraction (SCXRD) was used to determine the crystal structures for DFP/urea and NTO/urea, collected using an XtaLAB diffractometer (Rigaku Co.) with Mo-Kα (λ = 0.71073 Å) radiation at 170 K or 100 K, as shown in Table S6.†
Based on these insights, we established the criterion for evaluating co-crystal formation. At first, the intermolecular interaction EAB for designed coformers must reside within the dense energy region of known co-crystals, as indicated by a LOF score below 1.3. Then, from the perspective of thermodynamics, the EAB is expected to exceed the intermolecular interaction (EAA or EBB) in single-component crystals, with the energy difference between them not more than 50 kJ mol−1. And it is best for the intermolecular interaction EAB* from the most stable structure of two molecules to exceed the energy EAA and EBB for single components.
To accelerate the co-crystal screening and design process, two distinct graph neural network (GNN) frameworks were established for different intermolecular interaction predictions. The first model, which relies solely on molecular graph representations without global state descriptors u, achieves an MAE of 5.80 kJ mol−1 and is specifically designed to predict the strongest intermolecular interaction (EAB) in designed co-crystals. The second model for predicting intermolecular interactions for dimers extracted from co-crystals can reach a high accuracy with an MAE value of 1.69 kJ mol−1, which could be used for extensive co-crystal intermolecular interaction exploration. Guided by this framework, two urea co-crystals (NTO/urea and DFP/urea) were synthesized successfully. However, limitations such as the under-sampling of dimers extracted from co-crystals and AB* configurations from Molclus, discrepancies between computational and real-world conditions, and the fact that the energy-based criterion primarily applies to the initial stages of co-crystal formation may lead to the ineffective identification of potential co-crystals. Despite these challenges, this work provides a new strategy for predicting co-crystal formation. While concentrated on energetic co-crystals, the framework grounded in fundamental physicochemical principles demonstrates significant potential for broader application in pharmaceutical co-crystal discovery.
Footnotes |
† Electronic supplementary information (ESI) available. CCDC 2434054 and 2434055. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d5ta02975a |
‡ These authors contributed equally to this work and should be considered co-first authors. |
This journal is © The Royal Society of Chemistry 2025 |