A generative diffusion model enables multi-objective on-demand inverse design of piezoelectric metamaterials

Chun-Yu Lei; Jian Wang; Run-Lin Liu; Meng-Jun Zhou; Zhong-Hui Shen

doi:10.1039/D5NR01669J

View PDF Version

DOI: 10.1039/D5NR01669J (Paper) Nanoscale, 2025, Advance Article

A generative diffusion model enables multi-objective on-demand inverse design of piezoelectric metamaterials†

Chun-Yu Lei^ab, Jian Wang^a, Run-Lin Liu^ab, Meng-Jun Zhou^a and Zhong-Hui Shen*^ab
^aState Key Laboratory of Advanced Technology for Materials Synthesis and Processing, Center of Smart Materials and Devices, Wuhan University of Technology, Wuhan 430070, China. E-mail: zhshen@whut.edu.cn
^bSchool of Materials and Microelectronics, Wuhan University of Technology, Wuhan 430070, China

Received 24th April 2025 , Accepted 3rd July 2025

First published on 22nd July 2025

Abstract

Piezoelectric metamaterials have attracted increasing interest in areas of mechanoelectric conversion, such as robotics and medical treatment, due to their powerful performance programmability. However, how to design the metamaterial structure to achieve on-demand regulation among mutually exclusive metrics such as electrical, mechanical, and acoustic properties remains a major challenge. Here, we present a multi-objective design strategy based on latent diffusion models to achieve inverse design of piezoelectric metamaterials under different scenario requirements. This method effectively decouples the interdependencies of four different target parameters, enabling the generation of piezoelectric metamaterials that overcome the limitations of existing datasets and significantly enhance the overall piezoelectric response. By simply inputting the desired electrical, mechanical, and acoustic performance criteria, our method is able to output the ideal metamaterial structures whose properties deviate from the input targets by only 1.06% (mean absolute percentage error, MAPE). This study introduces a versatile framework for the multi-objective, on-demand inverse design of metamaterials, which not only shortens the material development cycle but also opens up new perspectives for the on-demand design of diverse functional materials.

1. Introduction

Piezoelectric materials are multifunctional electronic materials with the capacity to convert mechanical energy into electrical energy,^1,2 which have broad applications in wireless communications,³ ultrasonic sensing,^4,5 wearable sensing devices,^6,7 and robotics.^8,9 In addition to the piezoelectric coefficient, d₃₃, other parameters must be specifically tailored to meet the multiple performance requirements of different applications. For example, acoustic applications such as hydroacoustic transducers and medical ultrasonic probes require acoustic impedance (

, where ρ is the relative density, ρ₀ is the intrinsic density and E is Young's modulus) that matches with that of water and biological tissues,¹⁰ respectively. Mechanically, different sensing components within a robot demand tailored mechanical responses, while wearable devices must adapt to the body's ergonomic characteristics.¹¹ In sensing applications, a high piezoelectric voltage coefficient (

, where d₃₃ is the piezoelectric coefficient, ε^T₃₃ is the relative permittivity and ε₀ is the vacuum permittivity) is highly desirable for generating higher output voltage, while a high figure of merit

is critical for maximizing power output.¹² As illustrated in Fig. 1a, the above performance metrics are basically governed by four fundamental parameters: the piezoelectric coefficient d₃₃, relative permittivity ε^T₃₃, Young's modulus E and relative density ρ. However, these four parameters exhibit a robust interdependence, whereby any one property exerts a constraining influence on the others.¹³ As a result, traditional dense piezoelectric ceramics are unable to achieve on-demand tuning of multiple performance metrics by balancing the piezoelectric response and mechanical compatibility.


	Fig. 1 Overview of a multi-objective inverse design strategy for piezoelectric metamaterials. (a) Schematic diagram of different multi-application requirements and the corresponding multi-objective parameter demands in piezoelectric metamaterials. (b) Comparison of the designability and multi-performance tunability among bulk ceramic materials, conventionally fabricated porous materials, and 3D printed metamaterials. (c) Flowchart of the multi-objective inverse design process for piezoelectric metamaterials consisting of four steps: data generation, generator, filter and validator.

To meet the multi-performance demands of various application scenarios, introducing pores into piezoelectric materials has proved effective to break the strong coupling relationship between these four parameters.^13–15 Although pores simultaneously reduce both d₃₃ and ε^T₃₃, d₃₃ decreases more slowly, resulting in a higher g₃₃ and FOM₃₃. Moreover, pores lower the Z, expanding the application in the field of ultrasound with a lower ρ and enhancing the tunability of the piezoelectric material's mechanical properties.¹⁰ Therefore, the design of porous structures represents a pivotal aspect in the tailoring of the piezoelectric properties of porous ceramics. However, some common methods, such as the freeze-drying method,¹⁶ have inherent limitations in producing pores with diverse shapes, which in turn restricts structural design flexibility. The advances in additive manufacturing have enabled scientists to fabricate intricate and meticulously manipulable porous structures,^8,17,18 commonly referred to as metamaterials. Metamaterials, often realized through the self-assembly of small-scale blocks, exhibit properties unattainable in natural materials.¹⁹ As illustrated in Fig. 1b, piezoelectric metamaterials exhibit enhanced multi-performance tunability and expanded design flexibility in comparison with bulk ceramics and conventional porous ceramics.¹⁷ To date, researchers have achieved novel piezoelectric properties, including fully non-zero, negative,¹⁸ and twist piezoelectric coefficients,⁸ which show the potential to revolutionise robotics and intelligent sensing applications. However, the vast multidimensional structure–property space of piezoelectric metamaterials renders it nearly impossible for human intuition to precisely guide experiments that meet the demands of diverse scenarios. Furthermore, the structure–performance relationships required for multi-scenario and multi-objective designs are more complex than ever before.^20,21 Tailoring metamaterial architectures to achieve specific multi-objective behaviors across various application scenarios remains a significant challenge.^22,23

The ongoing developments in artificial intelligence (AI) and machine learning (ML)²⁴ are making the on-demand design of piezoelectric metamaterials increasingly attainable.^25–27 Nevertheless, the prevailing machine learning methods mainly achieve optimization of design indirectly by optimizing key parameters for the generation of metamaterial structures.^27–29 These methods often rely on specific configurations of training data and may encounter difficulties when applied to entirely new scenarios. In recent years, there has been growing interest in some generative models such as variational autoencoders (VAEs) and generative adversarial networks (GANs).²⁷ However, these models are largely limited to single-target designs³⁰ and may exhibit issues such as mode collapse and limited diversity in generated outcomes.³¹ The primary challenges in multi-objective design lie in the complex coupling and trade-offs among multiple properties, particularly when simultaneously optimizing the electrical, mechanical, and acoustic performance.¹³ Enhancing one property may significantly impact others, resulting in a high-dimensional, nonlinear optimization problem with potentially conflicting objectives.²⁰ The traditional ML models struggle to efficiently navigate and represent such nonlinear, multi-dimensional trade-offs. To date, there has been no satisfactory approach for addressing the on-demand design of piezoelectric metamaterials with coupled electrical, mechanical, and acoustic multi-objective properties.

The advent of text-to-image large models, such as latent diffusion models,³² has introduced a good approach towards multi-objective on-demand design. In this model, designers only need to input text descriptions containing multiple objective conditions (such as geometry, size, color, surface texture, etc.), and the model can generate images that meet these specifications.³² Compared with traditional generative methods such as VAEs and GANs, as well as optimization-based approaches like Genetic Algorithms (GAs), diffusion models offer superior generation fidelity, higher training stability, better controllability, and significantly improved efficiency, especially in high-dimensional and multi-modal design spaces. Furthermore, when conflicts or incompatibilities arise between multiple design objectives, the model is capable of generating innovative solutions, such as images depicting individuals under extreme conditions (e.g., advanced age incompatible with high athletic ability). This is enabled by the stochastic sampling mechanism and the rich latent space exploration capability of diffusion models, which allow diverse and flexible outputs while satisfying constraints. This emergent ability³³ (defined as novel features and behaviors that manifest as the model's scale increases) opens up new possibilities for solving complex coupling and trade-off problems between multiple features.

In this work, we propose a multi-objective inverse design method, designated as GFV (including a generator, filter, and validator), which is based on latent diffusion models (LDMs), convolutional neural networks (CNNs) and finite element simulations (FEMs). It could rapidly tailor metamaterial structures to achieve multi-objective on-demand designs across different application scenarios, as illustrated in Fig. 1c. This approach does not necessitate the input of expert knowledge, as designers are only required to input the desired performance indicators (d₃₃, ε^T₃₃, E, and ρ) to generate the corresponding metamaterial structure. Furthermore, this strategy is also applicable to other metamaterials with on-demand multi-objective design requirements, including those in mechanical, optical, and electromagnetic domains.

2. Results

As shown in Fig. 1c, we first generated a dataset of 17 [thin space (1/6-em)]

000 structure–performance pairs (ESI S1 and Fig. S1†) using high-throughput finite element simulations (see Methods for details) for GFV training, taking the piezoelectric ceramic lead zirconate titanate (PZT) as the scaffold material for the metamaterial as an example. Note that this model can be applied not only to ceramics like PZT but also to metamaterial design where polymers and their composites serve as the backbone. The GFV process consists of three main modules: the generator, the filter, and the validator. The generator employs a latent diffusion model to generate initial piezoelectric metamaterial structures from specified inputs (d₃₃, ε^T₃₃, E, ρ). The filter, powered by a convolutional neural network, evaluates the performance of the generated structures and discards those that significantly deviate from the target requirements. Finally, the validator verifies the filtered structures with the most probable desired piezoelectric properties using finite element models. A detailed description of each module follows below.

2.1 Generation of the structure database of piezoelectric metamaterials

As the latent diffusion model relies on a data-driven approach, a comprehensive dataset of metamaterial structures and their corresponding properties is essential. The design space for metamaterials is nearly infinite, encompassing a wide range of options,³⁴ including trusses, 2D shell structures, and 3D topological surfaces.^35,36 This study focuses on truss structures due to their superior mechanical flexibility and enhanced piezoelectric response under dynamic loading.³⁷ This flexibility makes truss-based designs particularly suitable for advanced applications such as precision actuators and robotics, compared to other structural types, and has been demonstrated to enable unprecedented performance.⁸

To achieve a rich and diverse performance space, we employed a virtual growth program³⁸ to generate piezoelectric metamaterial structures. The virtual growth program consists of four steps: (i) constructing the underlying network topology, (ii) designing the geometry of the building blocks that can be placed in each grid, (iii) defining the adjacency rules between the building blocks, and (iv) specifying the probability parameters for the building blocks. First, a 20 × 20 underlying network topology was employed. As shown in Fig. 2a, we selected several building block geometries, including the “L”-shaped building block, “−”-shaped building block, “T”-shaped building block, and “+”-shaped building block, along with their various rotations. We also considered the building blocks with multiple angles (53°, 60°, 75°, 90°, 105°, and 120°) to create a rich variety of structure types. Fig. 2b defines the adjacency rules, permitting only node-to-node connections to prevent unconnected paths. Fig. 2c illustrates how structure growth can be regulated using the input building block probability parameters. Based on the Wave Function Collapse algorithm, at each step, the node with the lowest entropy among all unconnected nodes is selected for growth. The formula for node entropy is as follows:


	(1)

where n_i refers to the type of building block that can be connected to the node, and P_j denotes the normalized probability of the building block j∈n_i to be chosen. The type of building block is then randomly chosen based on the provided probability parameters, with blocks having higher probability values being more likely to be selected.


	Fig. 2 Generation of metamaterial structures and analysis of their structure–property relationship. (a) The database of basic building blocks. (b) An example of adjacency rules. (c) The growth process adheres to the minimum entropy principle, where each node is assigned a building block according to the input probability parameters. (d) Relationship between ρ, E, ε^T₃₃, d₃₃ and g₃₃ of different metamaterials. (e) Different metamaterials with different angles from 53° to 120° and the corresponding shortest path schematics with numerical values.

To systematically investigate these effects and provide theoretical support for subsequent inverse design, we examine the effect of rod angle orientation by dividing the building blocks in Fig. 2a into six distinct groups based on their angles. Each group includes the same “−”-shaped building block, “+”-shaped building block (without angle variation), the “L”-shaped building block and the “T”-shaped building block at angles of 53°, 60°, 75°, 90°, 105°, and 120°. To minimize the influence of inconsistencies in building block content on the properties of the piezoelectric metamaterials, 20 structures were generated for each angular group with the same probabilistic parameters, ensuring that the overall connectivity topology and building block content were consistent. The properties of these structures were then calculated using the two-step finite element method (see Methods).

Fig. 2d illustrates the relationship between ρ, E, ε^T₃₃, d₃₃, g₃₃ and angle for six groups of building blocks with similar contents (“+”-shaped 40%, “T”-shaped 40%, “L”-shaped 10%, “−”-shaped 10%). The density of the metamaterials decreases as the angle of the basic building blocks increases, although the overall decrease is modest, with the difference between the highest and lowest values being only 0.06. This difference primarily arises from variations in the ρ of the basic building blocks, which decreases as the angle increases. In application scenarios where lower density metamaterials are preferred, obtuse-angle building blocks yield better results. In contrast to ρ, E value of the metamaterials does not follow a simple decreasing trend with increasing building block angles. It first increases and then decreases, with a peak at 90°. This behavior is attributed to the fact that at 90°, the building block tends to distribute forces more uniformly along its vertical orientation, leading to a relatively small deformation. Similarly, ε^T₃₃, d₃₃, and g₃₃ also exhibit a trend of first increasing and then decreasing with an increase in angles, reaching a peak at 90°. ε^T₃₃ increases by up to 26%, while d₃₃ can increase by as much as 57%, leading to a similar trend in g₃₃. Therefore, despite the analogous variation trends exhibited by the three parameters namely ε^T₃₃, d₃₃, and g₃₃, their disparate amplitude of variation ranges afford a substantial design space for optimizing different performance parameters.

To elucidate the relationship between structure and piezoelectric properties, we introduce a structural descriptor to describe the electric-force transmission path, namely the average shortest path (see Fig. 2e). The shortest paths from all top endpoints of the metamaterials to the bottom were computed using the A* algorithm³⁹ (ESI S6†) with averaged length. This descriptor reflects the main stress transfer path within the metamaterial. It is our intention to ascertain the correlation law between the metamaterial structure and properties from the stress and electric field distribution.⁴⁰ As shown in Fig. S2,† the stress field distribution is concentrated on the shortest path of the metamaterial. It can be demonstrated that the shorter the shortest path, the higher the electric field distribution (as well as polarization distribution) along the shortest path, as proven by Gauss's theorem. The concentration of a high polarization distribution in the stress transfer path results in the piezoelectric material exhibiting a high piezoelectric response. Fig. 2e illustrates the relationship between the average shortest path and the angle. When the angle is 53°, the average shortest path reaches its maximum at 1.41. At 90°, the shortest path is minimized to 1.16. As the angle approaches 90°, the average shortest path decreases, leading to a 23% increase in g₃₃. The shorter path facilitates smoother transmission of stress and electric field, which explains why the 90° group enhances piezoelectric performance. Depending on the specific application requirements, different types of building blocks can be selected as the primary focus of investigation.

2.2 Generator

To achieve optimal piezoelectric signals, this study takes 90° building blocks as a representative example to illustrate how to achieve the multi-objective on-demand inverse design of piezoelectric metamaterials. Here, we used a 256 × 256 pixel space to fully capture the microstructural morphology of the complex metamaterial structures. To reduce training costs and time without compromising the accuracy of the generated results, our generator utilizes the latent diffusion model (LDM), which reduces the high-dimensional pixel space to a lower-dimensional latent space for training. The LDM is a generative model designed to learn the diffusion process underlying the probability distribution of a given training dataset. This process involves iterative denoising of noisy samples to produce high-quality outputs. Due to the ability of the LDM to conditionally direct and regulate its generation process, it has become the method of choice for generative applications in areas such as image generation,⁴¹ video generation⁴² and protein structure prediction.⁴³ In our model, the visual depiction of our LDM is shown in Fig. 3, which involves the collaboration of two deep learning models: the variational autoencoder (VAE) and the traditional diffusion model (DM).⁴⁴


	Fig. 3 Architecture of the latent diffusion model. The model performs noise injection and denoising operations in the latent space. In the forward diffusion process, noise is progressively added at each time step ttt to the latent representation of the metamaterial structure obtained from the VAE encoder. The reverse process begins by sampling from a Gaussian distribution, and the structure is gradually reconstructed through a U-Net-based denoising network. Multi-objective conditions are integrated into the U-Net via cross-attention mechanisms to guide the denoising process toward the desired properties. The resulting latent representation is then decoded by the VAE decoder to generate the corresponding metamaterial structure. Finally, this generated structure undergoes another round of VAE encoding and decoding to produce a clearer image of the metamaterial.

First, we create a variational autoencoder (VAE), which is composed of two main components: the encoder and the decoder. The encoder compresses the original 256 × 256 image into a 32 × 32 low-dimensional latent space, while the decoder reconstructs the low-dimensional latent space back into the original 256 × 256 image. Next, to train the diffusion model (DM), the encoder of the trained VAE encodes the metamaterial into a 32 × 32 low-dimensional latent space and then corrupts the encoded samples with varying levels of Gaussian noise. The model learns to predict the noise present in each sample, subtracting it from the input samples to effectively denoise them. The noise prediction model uses a U-Net architecture, which consists of an encoder–decoder structure with skip connections. We built it by combining a residual convolutional layer with a cross-attention layer, in which the latter allows the model to incorporate target conditions (ρ, E, ε^T₃₃, d₃₃) into the denoising process, similar to how transformers are used in natural language processing.⁴⁵ After DM denoising, the image remains in the 32 × 32 latent space, and the VAE's decoder reconstructs this latent space back into the original 256 × 256 image.

For our specific cases, as shown in Fig. 3, we make the following adjustments to the model: (i) there is no need to train additional neural networks (e.g., BERT⁴⁶) to understand the semantics of the condition when the mathematical form of the condition sufficiently conveys its meaning. Our condition consists of only four numbers, which are directly and strongly related to the structural information of the metamaterial. Therefore, using models to encode these simple numbers (e.g., mapping them to a high-dimensional space) would result in the loss or blurring of information (Fig. S3†). (ii) As shown in Fig. S4,† metamaterial structures generated by LDM still contain some ambiguous regions, where it is unclear whether they are part of the actual metamaterial structure. To address this issue, we employ a trained VAE as an image restoration tool to further enhance the quality of the generated images without compromising the accuracy of the model (Fig. S4 and S5†). Since the VAE encoder has learned from pre-existing training data how to extract key features from the original input and encode them into latent space, it can effectively reconstruct clear images, even when the LDM-generated images contain minor blurring, as the overall morphological features in the LDM output remain clear. As illustrated in Fig. S5,† this approach yields more accurate metamaterial structures compared to manual restoration methods based on intuition or simple rules.

To evaluate the performance of the VAE, the intersection over union (IoU) metric was used to calculate the overlap between the real microstructure pixels (P_n) and the reconstructed microstructure pixels (P_n^re). The formula for IoU is as follows:


	(2)

In this study, the IoU score on the test set of 2000 microstructures was 93%, and the detailed reconstruction result is shown in Fig. S6.† This demonstrates an almost perfect match between P_n and P_n^re, indicating that the microstructure can be reconstructed with high quality. The accuracy of the LDM model was measured by calculating the mean absolute percentage error (MAPE) between the resulting properties and the input conditions. The expression for MAPE is given below:


	(3)

where d^gen₃₃, ε^{T gen}₃₃, E^gen, and ρ^gen are the piezoelectric coefficient, permittivity, Young's modulus and relative density of the piezoelectric metamaterial generated by the generator, respectively, whereas d^cond₃₃, ε^{T cond}₃₃, E^cond, and ρ^cond denote the input conditions, respectively.

To evaluate the accuracy of the LDM model, we tested it on a set of 3000 samples from the test set. The results are shown in Fig. 4a as a parity plot, where the x-axis represents the target performance conditions and the y-axis corresponds to the actual performance of the generated structures. Four performance metrics (d₃₃, ε^T₃₃, E, and ρ) are displayed, with the coefficient of determination R² values of 87.5%, 89.8%, 92.2%, and 93.4%, respectively. In addition, we calculated the average MAPE across these 3000 test samples, which was 10.11%. These results indicate that the generator achieves a high degree of text-image alignment. Moreover, we also evaluated the ability of our model to generate new metamaterial structures. The metamaterial structure images were encoded into latent space using a VAE and then downscaled to 2D space using t-SNE (ESI S7 and Fig. S8†). No complete overlap was observed between the newly generated data and the original dataset in the 2D t-SNE space, suggesting that our model has great potential to generate new metamaterial structures.


	Fig. 4 Inverse design evaluation. (a) Parity plots comparing the target properties with the actual performance of the generated metamaterials (d₃₃, ε^T₃₃, E, and ρ) across 3000 test samples. The R² values for each property are 87.5%, 89.8%, 92.2%, and 93.4%, respectively. (b) The capability of the GFV model for inverse on-demand generation of piezoelectric metamaterials was evaluated using 100 sets of data from the validation set. To illustrate the results of the multi-objective co-design in detail, eight representative cases (Cond1–Cond8) are highlighted with additional color, comparing the target input properties with the simulated properties of the generated structures. The average MAPE across the 100 validation samples is 1.06%.

2.3 Filter and validator

It is important to note that not all piezoelectric metamaterials generated by the model satisfy the specified input conditions. To identify structures that are closer to the desired properties, we employ a filter (a convolutional neural network, CNN) and a validator (a two-step finite element method) to screen optimal structures. The filter predicts d₃₃, ε^T₃₃ and E and filters out the metamaterials that do not meet the required properties. The filter consists of a five-layer CNN followed by a three-layer fully connected neural network. The dataset of 17 [thin space (1/6-em)]

000 images was split into a training set (80%) and a test set (20%), with the training set used to train the CNN. By training with the CNN, a surrogate model was developed to predict d₃₃, ε^T₃₃ and E directly from the metamaterial structure images. The prediction errors of the model on the test set were evaluated. Fig. S9† shows a comparison between the effective d₃₃, ε^T₃₃, and E obtained from the FEM and the predictions obtained using the machine learning model. Within the range covered by the dataset, the predicted values closely match the FEM results, indicating the accuracy of the model. For d₃₃, the model achieved a coefficient of determination (R²) of 91.7% with a mean absolute error (MAE) of 23 pC N⁻¹. For ε^T₃₃, the model's R² was 96.8% with a MAE of 10.82, and for E, R² was 93.1% with a MAE of 0.177 GPa. These results indicate that the CNN model exhibits high accuracy and reliability in predicting the key performance indicators of piezoelectric metamaterials, enabling effective screening of structures that align with the target performance.

Following the filtering step, a validator was employed to assess the performance of the filtered samples and to identify those closest to the target values. Fig. 4b illustrates the performance alignment of the GFV model in the text-driven structure generation task, evaluated using the MAPE. We sampled 100 sets of data uniformly from the test set to validate the GFV model's ability to generate piezoelectric metamaterial structures on demand. Uniform sampling improves the model's adaptability under different data distributions. The horizontal coordinates of the scatter plot are the input targets and the vertical coordinates are the analog simulation values of the piezoelectric metamaterial properties generated by GFV. In order to further demonstrate the ability of the model in multi-objective simultaneous inverse on-demand design, eight cases of typical conditions (Cond1–Cond8) were selected and in Fig. 4b, these data points are marked with different colors, and the MAPEs of Cond1–Cond8 are 0.6%, 1.2%, 0.9%, 0.8%, 0.5%, 1.5%, 0.6%, and 0.5% respectively. The results show that the generated structures are highly consistent with the target inputs in terms of various performance metrics. The average MAPE of the 100 sets of test data in Fig. 4b is 1.06%, which is much lower than the generation error of the LDM model alone. This is because in comparison with the scheme using only the LDM model, the GFV, by introducing the filter and validator mechanism, leads to a substantial reduction in inverse design error while the computational overhead increases only slightly.

This improvement is achieved by our two-stage generate-and-filter (GFV) strategy: first, the latent diffusion model (LDM) produces a large set of candidate structures conditioned on target properties; then, a CNN-based predictor filters out any samples whose predicted performance deviates from tolerance thresholds. This approach reduces the inverse design error from 10.11% (LDM alone) to 1.06% (GFV), which confirms that the GFV strategy significantly improves the likelihood of obtaining structures closer to the target values, further underscoring the high accuracy of our inverse on-demand generation approach.

2.4 Model comparison and ablation experiments

In addition to employing the LDM as the generator, we investigated a pixel-based manual encoding strategy, where each building block is mapped to a specific pixel value, which is then used to replace the VAE-based latent representation in the LDM framework for subsequent diffusion training. For example, we encode each “+”-shaped building block as pixel value 0, each “L”-shaped building block as 240, and defects as 255 in the grayscale image. The specific pixel values are assigned based on the relative contribution of each building block type to the overall piezoelectric performance of the metamaterial. The complete encoding scheme is shown in ESI S11 and visualized in Fig. S14b.† Specifically, instead of using the original 256 × 256 structural images, we represent each metamaterial sample as a compact 20 × 20 grayscale image, corresponding to a 20 × 20 grid of building blocks. This significantly reduces the dimensionality of the input while retaining essential structural semantics, thereby replacing the VAE-compressed latent space in the LDM for downstream diffusion-based generation, which helps reduce the overall training burden of the model. This encoding approach not only provides a compact and informative representation but also mitigates the boundary blurring issue discussed in section 2.2 by enforcing discrete and interpretable structure-level semantics.

We systematically compared the performance metrics of a manually encoded diffusion model (manual coding + DM) with those of a latent diffusion model (LDM) based on a VAE encoder and the results are listed in Table 1. The results indicate that the manual coding + DM approach achieves a slightly lower generation error (MAPE) of 9.75% compared to 10.11%. In terms of training time, the manual coding approach requires only 48 hours, which is 24 hours shorter than the LDM method. This reduction is primarily attributed to the elimination of the VAE pretraining stage, highlighting the time-saving advantage of manual encoding. Although LDM offers advantages in generalization and representational capacity, this comparison suggests that manual encoding remains a viable and efficient strategy for certain task-specific applications. This strategy is feasible for modular metamaterials composed of discrete building blocks. However, its representational capacity is limited in cases involving non-modular structures, continuous media, or irregular layouts (e.g., Voronoi-based designs). The reliance on prior knowledge limits the scalability of this encoding approach, but it offers practical guidance for targeted inverse design problems.

Table 1 Performance evaluation of different inverse design strategies

Method	MAPE	Generation time	Adaptability	Training time
LDM	10.11%	1 s	High	72 h
LDM (CNN)	4.28%	1.5 min	High	72 h
LDM (FEM)	2.57%	101 min	High	72 h
LDM (CNN + FEM)	1.06%	116 min	High	72 h
GA	2.58%	360 min	Medium	5 min
Manual coding + DM	9.75%	1 s	Low	48 h

We also compared the performance of the proposed GFV strategy with the genetic algorithm (GA) across key metrics, including generation error, generation time, adaptability, and training time and the details are listed in Table 1. The GFV framework, denoted as LDM (CNN + FEM), integrates a latent diffusion model (LDM) as the generator, a convolutional neural network (CNN) as the filter, and a finite element method (FEM) as the validator. This configuration achieves a low generation error (MAPE) of 1.06% and a total generation time of approximately 116 minutes, demonstrating superior precision and robustness in generating high-performance structures. In contrast, the GA method achieves a slightly higher error (MAPE) of 2.58% (see ESI S10† for details), and due to its iterative nature, it requires significantly more computational resources, with a total design time of around 360 minutes. Furthermore, GA relies on task-specific manual encoding for each individual in the population, which inherently restricts its scalability and adaptability when applied to complex design problems. Nevertheless, GA offers an advantage in training efficiency, requiring only about 5 minutes to train a predictive model.

To systematically evaluate the contribution of each functional module within the GFV framework, we conducted ablation studies comparing four configurations: LDM only, LDM (CNN), LDM (FEM), and the full LDM(CNN + FEM) architecture. As shown in Table 1, the experimental results indicate that removing either the CNN filter or the FEM validator leads to a decrease in the accuracy, confirming the critical role of module synergy in overall performance. Notably, the FEM validator reduces the error to 2.57%, outperforming the CNN filter's 4.28%, albeit at a higher computational cost, as the validation of each structure requires approximately one minute. The full configuration achieves the lowest error (MAPE) of 1.06%.

Regarding generation time, the GFV framework requires a longer duration due to the necessity of generating a large number of candidate structures for filtering and subsequent validation. In contrast, other methods demand less time but at the expense of reduced accuracy. This ablation study demonstrates the essential roles and complementary effects of each module within the overall framework, providing strong support for the design rationale of our approach.

In our GFV model, the filter and validator are introduced after the generator, reducing the generation error from 10.11% to 1.06% with minimal computational overhead. Considering that the filter offers faster inference but lower accuracy than the validator, it is employed for large-scale preliminary screening, while the validator is applied in the final stage to ensure the reliability of the generated structures. This strategy effectively eliminates suboptimal candidates resulting from conflicts among competing objectives, retaining only those structures that simultaneously satisfy multiple target criteria. It is particularly well-suited for solving complex multi-objective optimization problems.

2.5 Multi-scenario design on demand

This study aims to achieve multi-objective inverse on-demand design of piezoelectric metamaterials based on different application scenarios. In these scenarios, the model not only supports fixed-point optimization but also handles open-ended objectives (such as minimizing or maximizing specific attributes). Below, we present the application of our model in three typical use cases to illustrate its effectiveness in real-world tasks. Fig. 5a illustrates three typical application scenarios: wearable devices,⁴⁷ robots with self-powered systems, and medical ultrasound probes.⁴⁸ These application requirements can be classified into three design categories: low E and high g₃₃, high E and high FOM₃₃, and Z matching with high g₃₃. The first category is suitable for applications requiring high flexibility and comfort^49,50 (e.g., insoles, knee pads, and smart soft grippers); the second category applies to self-powered structural components in high-strength working environments^20,51 (e.g., robot legs and feet); and the third category is for scenarios requiring efficient acoustic wave transmission,^52,53 such as ultrasound probes.


	Fig. 5 Multi-scenario design on demand. (a) Various application scenarios, including wearable devices, self-powered robots, and ultrasound probes. (b–d) Comparison of the piezoelectric performance of raw data and target generated data by the GFV strategy. (e–g) Schematic diagrams of the optimal metamaterial structures generated by GFV under the three conditions.

To meet these design requirements, we set the following conditions on the basis of meeting the mechanical or acoustic impedance requirements while maintaining as high piezoelectric response as possible. For low E applications, the input conditions are marked by the blue pentagon in Fig. 5b (E = 0.2 GPa, ). The E value in these conditions is comparable to that of PVDF-based piezoelectric materials commonly used in existing wearable devices, typically ranging from 0.1 to 3 GPa.^54,55 For high-strength self-powered structural components, the input conditions are marked by the blue pentagon in Fig. 5c (E = 7 GPa, ), where E is in the highest region of the dataset. For ultrasound probe applications, the input conditions are shown by the blue pentagon in Fig. 5d (, ), where Z is similar to that of human tissue (1.5 M Rayl). The details of the selection methods for all parameters are included in ESI S8.† When the Z or E thresholds are satisfied, the values of d₃₃ and ε^T₃₃ for all input conditions fall outside the Pareto frontier within the dataset to optimize g₃₃ and FOM₃₃. The generated MAPE values for the optimal structures closest to the target are 4.2%, 2.3%, and 4.5% for the above three conditions, respectively. The difficulty of inverse generation increases as the input design conditions exceed the performance range of the original dataset,⁵⁶ resulting in higher errors compared to previous results. To select the piezoelectric metamaterial structures that meet the requirements, we propose a weighted evaluation formula based on target features (see ESI S9†). This formula supports the handling of open-ended objectives (such as minimizing or maximizing specific attributes), thereby better meeting the requirements of practical applications. The 100 optimal structures selected are located in the yellow regions of Fig. 5b–d and the optimal piezoelectric metamaterial structures are shown as red pentagons, all of which exceed the dataset boundaries. The performance metrics of the optimized metamaterials are as follows: (b) g₃₃ = 470 × 10⁻³ V m N⁻¹, E = 0.26 GPa (c) FOM₃₃ = 84.5 × 10⁻¹² m² N⁻¹, E = 7.1 GPa and (d) g₃₃ = 258 × 10⁻³ V m N⁻¹, Z = 1.56 M Rayl. Fig. 5e–g displays the optimal structures generated in three distinct application scenarios. In the case of the low E, high g₃₃ structure, a substantial void region is incorporated into the topological network, resulting in a notable increase in porosity. This porosity effectively reduces E while significantly increasing g₃₃.⁵⁷ For high E and high FOM₃₃, a fully vertical connected rod design is adopted, achieving the maximum E and limiting the expansion of the design space,⁵⁸ making it difficult to break the dataset boundaries. In the case of Z matching and high g₃₃ structures, the material's structural characteristics do not undergo extreme changes, as both Z and g₃₃ values remain within an intermediate range.

This study demonstrates that the proposed model is capable of effectively learning the intricate relationship between the structural and functional properties of piezoelectric metamaterials. It successfully captures local features related to electrical, mechanical, and acoustical properties, such as porosity and vertically connected rods, thereby overcoming the complex coupling and trade-offs between multiple objectives (as shown in Fig. S11† and Fig. 5b–d). Based on these insights, the model generates high-performance piezoelectric metamaterial structures, fulfilling multi-objective design requirements for diverse application scenarios. Notably, the piezoelectric metamaterial structures used in this study are composed of the piezoelectric ceramic PZT and 90° building blocks. If other application requirements arise (such as negative Poisson's ratio or lower density), the application space of inverse on-demand design can be further expanded by adjusting the intrinsic properties of materials or the types of building blocks.

3. Conclusions

To address the challenge of simultaneous multi-objective optimization in piezoelectric metamaterials, this work proposes a scalable and efficient GFV (generator, filter, and validator) multi-objective inverse on-demand design method based on a generative artificial intelligence latent diffusion model. This method is capable of generating on-demand piezoelectric metamaterial structures that simultaneously satisfy electrical, mechanical, and acoustic property requirements, with an error (MAPE) of only 1.06%, without the need for specialist knowledge. Moreover, we present illustrative examples of the GFV method for multi-objective design in diverse application scenarios. These examples demonstrate that the method can generate piezoelectric metamaterial structures with high electromechanical performance, beyond the dataset, in accordance with multi-objective requirements. The complexity of the coupling and trade-off relationships between multiple objectives, where enhancing one property may significantly affect others, can result in high-dimensional, non-linear optimisation problems. This complexity represents a significant challenge to conventional optimisation design methods. The GFV model, inspired by generative image modelling, is particularly well suited to this multi-objective on-demand design strategy and overcomes many of the challenges faced by traditional methods. The model is capable of accurately capturing the structural features that affect the material properties, including those related to d₃₃, ε^T₃₃, E, and ρ. Among others, these include pore structures that effectively reduce E while enhancing g₃₃, as well as vertically connected truss structures that reinforce E and combine the emergent abilities that may exist in large models to break the strong coupling relationship between multiple objectives. While satisfying the mechanical and acoustic properties, high d₃₃, low ε^T₃₃ piezoelectric metamaterial structures beyond the Pareto boundary of the dataset are generated.

Moreover, we made several improvements to each module of the GFV model. In the generator, a VAE was introduced to enhance the image clarity of LDM outputs without compromising performance accuracy. In the filter, both fixed-target and open-ended objectives (e.g., minimizing or maximizing specific properties) were supported. In the validator, a self-developed two-step FEM method was employed to accurately evaluate the effective piezoelectric performance of complex structures (see Methods). Therefore, the current work not only reveals the influence of electro-mechanical field coupling on the multifunctional performance of piezoelectric metamaterials but also establishes a simple and efficient multi-objective inverse design method, providing valuable guidance for experimental research. In principle, the current framework can be easily extended by adjusting for multi-objective conditions, different loading scenarios, and kinds of materials. Finally, the proposed framework allows extension to related fields such as mechanical, optical, and electromagnetic applications.

4. Methods

4.1 Training the latent diffusion model

Training of the proposed model involves two stages: (1) training the VAE to learn an expressive latent space and (2) training the diffusion model in the latent space.

4.1.1 Training VAE. In the training of the VAE, the total loss consists of two components: the reconstruction loss (commonly the mean squared error, MSE) and the Kullback–Leibler (KL) divergence. The KL divergence imposes regularization on the learned latent representation, encouraging it to approximate a standard normal distribution. This is achieved through a mild KL penalty, with the KL divergence given by:


	(4)

where μ represents the mean of the distribution and σ denotes the variance of the distribution. The introduction of the KL divergence helps structure the latent space, preventing the emergence of high variance or meaningless distributions.

4.1.2 Training the diffusion model. Based on our trained VAE, we obtain an efficient low-dimensional latent space z₀ in which high-frequency, imperceptible details are abstracted away. The forward noise addition process can be described by a parameterized Markov chain, where the data at each time step depends only on the previous time step and follows a Gaussian distribution. The formula for the forward process is given below:


	(5)

where z_t denotes the sample at time step t, z_t−1 denotes the sample at the previous time step, and β_t is a decreasing sequence over time that controls the noise intensity. This implies that the level of added noise diminishes as the diffusion process progresses. Specifically, at each noise addition step, z_t is the sum of z_t−1 and a β_t fraction of noise I. As the process approaches its endpoint, there is a gradual reduction in the introduced noise.

The denoising process requires training a U-Net as a denoising autoencoder,⁴⁴ designated as ε_θ(z_t, t, y), whose objective is to predict the noise that must be subtracted at the specified moment t. This is achieved through the utilisation of input conditions y (d₃₃, ε^T₃₃, E and ρ) and the input image z_t. The cross-attention mechanism is integrated into the underlying U-Net backbone network to facilitate the integration of the condition y with the input image z_t, which is to be denoised.

The conditional LDM is learned based on the image−condition pair, according to the following equation:


L_LDM = E_{z₀, y, ε∼N(0,1),t}[\|\|ε − ε_θ(z_t, t, y)\|\|²₂],	(6)

where the symbol ε is used to denote the random Gaussian noise samples that are saved during the noise addition process at a given time point, designated as t. Once z₀ has been obtained through the inverse process of stepwise denoising, the low-dimensional image in the latent space must be decoded by the decoder of VAE into the metamaterial structure x₁ that is required in the real space.

4.2 Two-step finite element method

4.2.1 The calculation of d₃₃. The two-step finite element method is divided into two steps: in the first step an external electric field is applied to polarize the piezoelectric metamaterials, and in the second step a stress is applied to calculate the effective electric displacement, which is generated by the direct piezoelectric effect. Initially, the ceramics are subjected to a constant direct current (DC) electric field E to induce polarization, establishing the electric field distribution within the metamaterial. Based on the electric field distribution, we can obtain the distribution of the local piezoelectric coefficient d₃₃ within the piezoelectric metamaterial. The relationship between d₃₃ and the polarization electric field E_apply, as derived from our experiments, is given below and illustrated in Fig. S10a† (we take the piezoelectric ceramic PZT-5H as our research object).


	(7)

where E_i represents the local electric field. The subscript ‘3’ in the piezoelectric coefficient d₃₃ indicates the direction of polarization,^18,59 equivalent to the local electric field direction. As shown in Fig. S10b,† the local electric field distribution in the piezoelectric metamaterial consistently aligns with the struts.^8,60 Therefore, the local ‘3’ direction for d₃₃ at any given point corresponds to the direction along the strut at that location.⁶¹

The piezoelectric coefficient matrix obtained above is in the local coordinate system. To effectively calculate the overall effective piezoelectric coefficient d₃₃ for the material, we need to transform the piezoelectric coefficient matrix from the local coordinate system to the global coordinate system. The method for this transformation is as follows:


d^global_ij = Nd^local_ijT,	(8)

where d^global_ij and d^local_ij are the piezoelectric coefficient matrices in the global and local coordinate systems, respectively. The expressions for N and T are as follows:


	(9)


	(10)

where:


	(11)

as shown in Fig. S10c,† x₁, y₁ and z₁ are the coordinate axes in the local coordinate system, while x₂, y₂ and z₂ are the coordinate axes in the global coordinate system.

In the second step, we apply force σ^eff₃₃ at each end of the metamaterial to obtain the stress distribution of the piezoelectric metamaterials. We can then calculate the electric displacement contributions D⁽ⁱ⁾_n matrix generated at each point of the piezoelectric metamaterials after the stress is applied and the total electric displacement, D^eff₃:


D⁽ⁱ⁾_n = σ^global(i)_ijd^global(i)_ij,	(12)


	(13)

where σ^global(i)_ij, d^global(i)_ij, V_i are the stress matrix, piezoelectric coefficient matrix and volume of the ith point in the metamaterial structure in the global coordinate system, respectively. D⁽ⁱ⁾₃ is the electric displacement at this point in the 3-direction obtained from D⁽ⁱ⁾_n and V is the total volume of the metamaterial. Finally we can determine the effective piezoelectric coefficient d^eff₃₃ of piezoelectric metamaterials through the direct piezoelectric effect equation:


	(14)

where σ^eff₃₃ is the applied stress in the second step.

4.2.2 Calculation of ε^T₃₃. The permittivity can be obtained by applying an external electric field in the first step, with the following calculation formula:


	(15)

where E_apply is the applied electric field, and D₃ is the global average polarization obtained from the following equation:


	(16)

where ε_PZT is the intrinsic permittivity of the PZT.

4.2.3 Calculation of E. Young's modulus can be obtained by applying the stress field in the second step, which is calculated as follows:


	(17)

where σ^eff₃₃ is the applied stress in the second step, and δ₃₃ is the deformation in the direction of the stress.

4.3 Poling experiment for PZT-5H

To obtain the experimental relationship between the piezoelectric coefficient d₃₃ and the applied electric field, we conducted a series of controlled poling experiments on PZT-5H ceramics.

Commercial PZT-5H samples with dimensions of 10 mm × 10 mm × 1 mm (purchased from Shenglei, Shaoxing, China) were used in the tests. The samples were subjected to a direct current (DC) poling process in silicone oil maintained at 120 °C to ensure uniform thermal conditions and to prevent electrical breakdown. A constant DC electric field was applied across the thickness of the samples using silver paste electrodes, which were sintered at 580 °C for 15 minutes prior to poling to ensure reliable electrical contact.

The electric field was incrementally increased from 0 kV cm⁻¹ to 20 kV cm⁻¹ in steps of 1 kV cm⁻¹. For each step, the sample was poled for 5 minutes, followed by the measurement of the piezoelectric coefficient d₃₃ using a standard quasi-static d₃₃ meter. In total, 20 field response data points were collected.

These experimentally obtained values served as reference benchmarks for finite element calibration and validation of our inverse design framework.

Author contributions

Chun-Yu Lei: writing original draft, high throughput simulations and machine learning models. Zhong-Hui Shen: writing – review & editing, supervision, and funding acquisition. Jian Wang: formal analysis and data curation. Run-Lin Liu: formal analysis and data curation. Meng-Jun Zhou: writing – review & editing and supervision. All authors have given approval to the final version of the manuscript.

Conflicts of interest

The authors declare no conflicts of interest.

Data availability

All data used are available within this paper and its ESI.† Further information can be acquired from the corresponding authors upon reasonable request.

Acknowledgements

This work was supported by the NSF of China (grant no. 52422206, 52372121 and 92463306, Zhong-Hui Shen, and grant no. 51202141, Meng-Jun Zhou).

References

J. Zhang, Q. Xu, Y. Zhang, W. Guo, H. Zeng, Y. He, J. Wu, L. Guo, K. Zhou and D. Zhang, Mater. Horiz., 2025, 12, 3494–3504 RSC .
J. Zhang, C. Wang and C. Bowen, Nanoscale, 2014, 6, 13314–13327 RSC .
J.-Q. Luo, H.-F. Lu, Y.-J. Nie, Y.-H. Zhou, C.-F. Wang, Z.-X. Zhang, D.-W. Fu and Y. Zhang, Nat. Commun., 2024, 15, 8636 CrossRef CAS PubMed .
S. Egusa, Z. Wang, N. Chocat, Z. Ruff, A. Stolyarov, D. Shemuly, F. Sorin, P. Rakich, J. Joannopoulos and Y. Fink, Nat. Mater., 2010, 9, 643–648 CrossRef CAS PubMed .
X. Lu, H. Qu and M. Skorobogatiy, ACS Nano, 2017, 11, 2103–2114 CrossRef CAS PubMed .
Q. Xu, Z. Wang, J. Zhong, M. Yan, S. Zhao, J. Gong, K. Feng, J. Zhang, K. Zhou and J. Xie, Adv. Funct. Mater., 2023, 33, 2304402 CrossRef CAS .
Q. He, Y. Zeng, L. Jiang, Z. Wang, G. Lu, H. Kang, P. Li, B. Bethers, S. Feng and L. Sun, Nat. Commun., 2023, 14, 6477 CrossRef CAS PubMed .
H. Cui, D. Yao, R. Hensleigh, H. Lu, A. Calderon, Z. Xu, S. Davaria, Z. Wang, P. Mercier and P. Tarazaga, Science, 2022, 376, 1287–1293 CrossRef CAS PubMed .
S. Zhang, Y. Liu, J. Deng, X. Gao, J. Li, W. Wang, M. Xun, X. Ma, Q. Chang and J. Liu, Nat. Commun., 2023, 14, 500 CrossRef CAS .
R. Guo, C. A. Wang and A. Yang, J. Am. Ceram. Soc., 2011, 94, 1794–1799 CrossRef CAS .
D. Yao, H. Cui, R. Hensleigh, P. Smith, S. Alford, D. Bernero, S. Bush, K. Mann, H. F. Wu and M. Chin-Nieh, Adv. Funct. Mater., 2019, 29, 1903866 CrossRef CAS .
H. Xue, L. Jiang, G. Lu and J. Wu, Adv. Funct. Mater., 2023, 33, 2212110 CrossRef CAS .
M. Yan, Z. Xiao, J. Ye, X. Yuan, Z. Li, C. Bowen, Y. Zhang and D. Zhang, Energy Environ. Sci., 2021, 14, 6158–6190 RSC .
Y. Zhang, J. Roscow, M. Xie and C. Bowen, J. Eur. Ceram. Soc., 2018, 38, 4203–4211 CrossRef CAS .
J. Roscow, R. Lewis, J. Taylor and C. Bowen, Acta Mater., 2017, 128, 207–217 CrossRef CAS .
J. I. Roscow, Y. Zhang, M. J. Kraśny, R. Lewis, J. Taylor and C. R. Bowen, J. Phys. D: Appl. Phys., 2018, 51, 225301 CrossRef .
H. Cui, R. Hensleigh, D. Yao, D. Maurya, P. Kumar, M. G. Kang, S. Priya and X. Zheng, Nat. Mater., 2019, 18, 234–241 CrossRef CAS PubMed .
J. Yang, Z. Li, X. Xin, X. Gao, X. Yuan, Z. Wang, Z. Yu, X. Wang, J. Zhou and S. Dong, Sci. Adv., 2019, 5, eaax1782 CrossRef CAS .
S. Bonfanti, S. Hiemer, R. Zulkarnain, R. Guerra, M. Zaiser and S. Zapperi, Nat. Comput. Sci., 2024, 4, 574–583 CrossRef PubMed .
B. Peng, Y. Wei, Y. Qin, J. Dai, Y. Li, A. Liu, Y. Tian, L. Han, Y. Zheng and P. Wen, Nat. Commun., 2023, 14, 6630 CrossRef CAS PubMed .
K. M. Jablonka, G. M. Jothiappan, S. Wang, B. Smit and B. Yoo, Nat. Commun., 2021, 12, 2312 CrossRef CAS PubMed .
Z.-H. Shen, T.-X. Tang, J. Wang, M.-J. Zhou, H.-X. Liu, L.-Q. Chen, Y. Shen and C.-W. Nan, Nano Energy, 2023, 117, 108933 CrossRef CAS .
X. Yu, Y. Hou, Z. Yang, X. Gao, M. Zheng and M. Zhu, Nano Energy, 2022, 101, 107598 CrossRef CAS .
J. Wang, G. Liu, C. Zhou, X. Cui, W. Wang, J. Wang, Y. Huang, J. Jiang, Z. Wang and Z. Tang, Nanoscale, 2024, 16, 14213–14246 RSC .
K. A. Brown and G. X. Gu, Nat. Comput. Sci., 2024, 4, 553–555 CrossRef CAS PubMed .
J.-H. Bastek and D. M. Kochmann, Nat. Mach. Intell., 2023, 5, 1466–1475 CrossRef .
X. Zheng, X. Zhang, T. T. Chen and I. Watanabe, Adv. Mater., 2023, 35, 2302530 CrossRef CAS PubMed .
Z. Liu, M. Cai, S. Hong, J. Shi, S. Xie, C. Liu, H. Du, J. D. Morin, G. Li and L. Wang, Proc. Natl. Acad. Sci. U. S. A., 2024, 121, e2320222121 CrossRef CAS PubMed .
B. Deng, A. Zareei, X. Ding, J. C. Weaver, C. H. Rycroft and K. Bertoldi, Adv. Mater., 2022, 34, 2206238 CrossRef CAS PubMed .
X. Zheng, I. Watanabe, J. Paik, J. Li, X. Guo and M. Naito, Small, 2024, 2402685 CrossRef CAS PubMed .
P. Dhariwal and A. Nichol, Adv. Neural Inf. Process. Syst., 2021, 34, 8780–8794 Search PubMed .
R. Rombach, A. Blattmann, D. Lorenz, P. Esser and B. Ommer, 2022 IEEECVF Conf. Comput. Vis. Pattern Recognit. CVPR, IEEE, New Orleans, LA, USA, 2022.
J. Wei, Y. Tay, R. Bommasani, C. Raffel, B. Zoph, S. Borgeaud, D. Yogatama, M. Bosma, D. Zhou and D. Metzler, arXiv preprint arXiv:2206.07682, 2022.
M. Zaiser and S. Zapperi, Nat. Rev. Phys., 2023, 5, 679–688 CrossRef .
P. Jiao, J. Mueller, J. R. Raney, X. Zheng and A. H. Alavi, Nat. Commun., 2023, 14, 6004 CrossRef CAS PubMed .
C. Lu, M. Hsieh, Z. Huang, C. Zhang, Y. Lin, Q. Shen, F. Chen and L. Zhang, Engineering, 2022, 17, 44–63 CrossRef .
C. S. Ha, D. Yao, Z. Xu, C. Liu, H. Liu, D. Elkins, M. Kile, V. Deshpande, Z. Kong and M. Bauchy, Nat. Commun., 2023, 14, 5765 CrossRef CAS PubMed .
K. Liu, R. Sun and C. Daraio, Science, 2022, 377, 975–981 CrossRef CAS PubMed .
P. E. Hart, N. J. Nilsson and B. Raphael, IEEE Trans. Syst. Sci. Cybern., 1968, 4, 100–107 Search PubMed .
Z. H. Shen, J. J. Wang, Y. Lin, C. W. Nan, L. Q. Chen and Y. Shen, Adv. Mater., 2018, 30, 1704380 CrossRef PubMed .
D. Podell, Z. English, K. Lacey, A. Blattmann, T. Dockhorn, J. Müller, J. Penna and R. Rombach, arXiv preprint arXiv:2307.01952, 2023.
Y. Liu, K. Zhang, Y. Li, Z. Yan, C. Gao, R. Chen, Z. Yuan, Y. Huang, H. Sun and J. Gao, arXiv preprint arXiv:2402.17177, 2024.
A. Alakhdar, B. Poczos and N. Washburn, J. Chem. Inf. Model, 2024, 64, 7238–7256 CrossRef CAS PubMed .
J. Ho, A. Jain and P. Abbeel, Adv. Neural Inf. Process. Syst., 2020, 33, 6840–6851 Search PubMed .
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser and I. Polosukhin, Adv. Neural Inf. Process. Syst., 2017, 30, 5998–6008 Search PubMed .
J. Devlin, arXiv preprint arXiv:1810.04805, 2018.
H. Ji, S. Zhang, K. Liu, T. Wu, S. Li, H. Shen and M. Xu, Mater. Horiz., 2022, 9, 2976–2983 RSC .
Y. Wang, P. Zang, D. Yang, R. Zhang, S. Gai and P. Yang, Mater. Horiz., 2023, 10, 1140–1184 RSC .
K. Liu, F. Hacker and C. Daraio, Sci. Rob., 2021, 6, eabf5116 CrossRef PubMed .
H. Yin, Y. Li, Z. Tian, Q. Li, C. Jiang, E. Liang and Y. Guo, Nano-Micro Lett., 2025, 17, 42 CrossRef PubMed .
W. Yang, W. Han, H. Gao, L. Zhang, S. Wang, L. Xing, Y. Zhang and X. Xue, Nanoscale, 2018, 10, 2099–2107 RSC .
S. Liu, M. Yan, Z. Hu, X. Yuan, Y. Zhang and D. Zhang, ACS Appl. Mater. Interfaces, 2024, 16, 55731–55740 CAS .
Q. Wang, Z. Li, C. Bowen, C. Courtney, M. Pan, Q. Xu, W. Chen, S. Fieldhouse and C. Wan, Nano Energy, 2024, 131, 110254 CrossRef CAS .
Y. Liu, Y. Zhou, H. Qin, T. Yang, X. Chen, L. Li, Z. Han, K. Wang, B. Zhang and W. Lu, Nat. Mater., 2023, 22, 873–879 CrossRef CAS PubMed .
T. Yang, W. Deng, G. Tian, L. Deng, W. Zeng, Y. Wu, S. Wang, J. Zhang, B. Lan and Y. Sun, Mater. Horiz., 2023, 10, 5045–5052 RSC .
T. Weiss, E. M. Yanes, S. Chakraborty, L. Cosmo, A. M. Bronstein and R. Gershoni-Poranne, Nat. Comput. Sci., 2023, 3, 873–882 CrossRef PubMed .
M. Yan, S. Liu, Z. Xiao, X. Yuan, D. Zhai, K. Zhou, D. Zhang, G. Zhang, C. Bowen and Y. Zhang, Ceram. Int., 2022, 48, 5017–5025 CrossRef CAS .
L. Zheng, K. Karapiperis, S. Kumar and D. M. Kochmann, Nat. Commun., 2023, 14, 7563 CrossRef CAS PubMed .
Z. Li, X. Yi, J. Yang, L. Bian, Z. Yu and S. Dong, Adv. Mater., 2022, 34, 2107236 CrossRef CAS PubMed .
R. Kiran, A. Kumar, R. Kumar and R. Vaish, Scr. Mater., 2018, 151, 76–81 CrossRef CAS .
T. Shimada, S. Sepideh, J. Wang and T. Kitamura, Acta Mater., 2017, 125, 202–209 CrossRef .

Footnote

† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5nr01669j

Click here to see how this site uses Cookies. View our privacy policy here.