Design of Phononic Bandgap Metamaterials Based on Gaussian Mixture Beta Variational Autoencoder and Iterative Model Updating

Wang, Zihan; Xian, Weikang; Baccouche, M. Ridha; Lanzerath, Horst; Li, Ying; Xu, Hongyi

doi:10.1115/1.4053814

Abstract

Phononic bandgap metamaterials, which consist of periodic cellular structures, are capable of absorbing energy within a certain frequency range. Designing metamaterials that trap waves across a wide wave frequency range is still a challenging task. In this paper, we present a deep feature learning-based design framework for both unsupervised generative design and supervised learning-based exploitative optimization. The Gaussian mixture beta variational autoencoder (GM-βVAE) is used to extract latent features as design variables. Gaussian process (GP) regression models are trained to predict the relationship between latent features and properties for property-driven optimization. The optimal structural designs are reconstructed by mapping the optimized latent feature values to the original image space. Compared with the regular variational autoencoder (VAE), we demonstrate that GM-βVAE has a better learning capability and is able to generate a more diversified design set in unsupervised generative design. Furthermore, we propose an iterative GM-βVAE model updating-based design framework. In each iteration, the optimal designs found property-driven optimization is used to update the training dataset. The GM-βVAE model is re-trained with the updated dataset for the optimization search in the next iteration. The effectiveness of the iterative design framework is demonstrated by comparing the proposed designs with the designs found by the traditional single-loop design method and the topologically optimized designs reported in literatures. The caveats to designing phonic bandgap metamaterials are summarized.

1 Introduction

Controlling the energy flow attracts great attention in the design of mechanical metamaterials [1–3]. Periodic cellular phononic bandgap metamaterials have demonstrated the capability to influence the propagation of mechanical waves within a certain frequency range [4]. Vibrations within the bandgap range are attenuated by the mechanism of wave interferences within the periodic structures, so the waves are blocked by the metamaterial. Phononic bandgap metamaterials show their advantage in a variety of applications, such as vibration isolation [5], noise reduction [6,7], and stress wave protection [8–10]. It has been demonstrated that bandgaps spanning different frequency ranges can be achieved by tailoring the topological features of the cellular structures [11–16]. A rational methodology for discovering new metamaterials with a wide bandgap is highly desired. Topology optimization methods have been proposed to pursue specific bandgap properties (e.g., maximum total bandgap width, maximum bandgap width in the desired frequency range, or achieving bandgap in a specific frequency range) [17–22]. Topology optimization shows its strength in exploitative design optimization, but it is not effective for the generative design of a highly diversified design set.

Deep learning, as a subject of artificial intelligence (AI), has been successfully applied to a variety of research fields [23–28]. Here, we focus on the deep learning-driven design methods for computational material design [29–37]. Deep generative models, such as variational autoencoders (VAEs), generative adversarial networks (GANs), and their variations, have been used to compress the high-dimensional input data into a low-dimensional latent space for design representation and design generation. A GAN model converges when the discriminator and the generator reach a Nash equilibrium [38]. The generator model learns to generate new images and the discriminator learns to decide whether the newly generated images are true or false. Unlike GAN that does not work with any explicit density estimation, VAE provides an explicit likelihood for modeling training. The quality of the VAE model can be evaluated easily based on its log-likelihood loss function, while GAN is notoriously difficult to train and requires a lot of tuning [39]. Wang et al. proposed a data-driven metamaterial design framework based on VAE to learn encoded structure and property information for the design of metamaterial structural systems that achieve desired distortion patterns [40]. Tan et al. applied a deep convolutional generative adversarial network and convolutional neural network to design microstructural materials. The optimal designs achieve desired compliance tensor matrix and satisfy the geometrical constraints [41]. GAN has also been applied to design complex architectured materials for stiffness and Poisson ratio [42]. However, only elastic or elastic-related properties are considered in the works mentioned above. The deep generative modeling methods have been applied to design phononic/photonic metamaterials as well. Li et al. proposed an autoencoder-based method to design phononic crystals [43]. Ma et al. proposed a VAE-based approach to design photonic structures [44]. The VAE-based inverse design framework proposed by Liu et al. is generally applicable to the design for various material properties [45]. GAN-based design frameworks have been developed to design photonic structures with user-defined objectives [46–48]. However, these works only consider metamaterial designs with simple topological features (e.g., single-hole structure) and a single design objective. For multiobjective design, Kim et al. proposed to integrate VAE and Gaussian process (GP) regression to find optimal microstructure designs that achieve both ultimate tensile strength and toughness [49].

The objective of this study is to find high-performance designs with a good diversity, which would incur a significant computational cost with the traditional design methods. In this work, we propose an effective design framework that enables both unsupervised generative design and exploitative optimization of cellular metamaterial structures with multiple design objectives. The unsupervised generative design can be done by exploring the latent feature space by space-filling sampling techniques (e.g., Latin hypercube sampling (LHS)) [50]. The random space-filling process may not be the most effective way with limited computational resources. To resolve this issue, we propose a new approach based on the modes of the latent features learned by the Gaussian mixture beta variational autoencoder (GM-βVAE). GM-βVAE enables unsupervised clustering through deep generative models [51]. The exploitative optimization is based on the supervised learning model of the relationship between the latent features and the design performances, such as GP regression [52] and neural network [53]. The optimal latent feature values are obtained by optimization and then mapped back to the original structure design space to reconstruct the corresponding structure (decoding). In this research, we establish and compare a single-loop supervised learning-based design approach and a new iterative model updating-based design framework. The proposed methods are expected to be generally applicable to design problems where the gradient information is not available or too expensive to obtain (e.g., structure crashworthiness, phononic bandgap).

The remainders of this paper are organized as follows. In Sec. 2, fundamental physics and the simulation model of phononic bandgap properties are introduced. A convergence study is presented to determine the optimal mesh size. In Sec. 3, we present the deep generative model-based design methodology: a single-loop approach and an iterative GM-βVAE model updating-based design framework. The deep generative model is selected based on a comparative study of the learning capabilities of the VAE, βVAE, and GM-βVAE models. In Sec. 4, we first present the GM-βVAE-based unsupervised generative design method and compare it with the traditional space filling-based unsupervised generative design method. Then we showcase the application of the single-loop approach on property-driven design optimization. Finally, the iterative model updating-based framework is demonstrated on a multiobjective design problem. The optimal designs are compared with the topologically optimized designs reported in literatures. Section 5 is the conclusion.

2 Technical Background: Phonoic Bandgap Simulations and the Convergence Study

Waves are often subject to reflection, refraction, and diffraction when passing through the boundaries between one medium to another [54]. Periodic cellular metamaterials with peculiar wave propagating properties, therefore, have been designed for applications like wave filters and waveguides [55]. Bloch wave analysis, illuminated by solid-state physics originally, is one of the most used methods in terms of studying wave propagation properties in periodic structural patterns. It solves eigenvalue problem considering constraints posted by the structural periodicity. Consider an incremental wave motion in addition to equilibrium given as Eq. :

δ x (X, t) = δ k (X) e x p [- i ω t]

(1)

where X is Lagrangian description of material points in space, δk is the periodic mode in the space that is frequency-independent, and ω is the frequency of the wave. The incremental displacements

δ k = \tilde{k} e x p [i K \cdot X]

are constrained by the periodicity given by the bloch wave vector K that determines the direction at which the wave propagates,

\tilde{k}

represents the absolute value of the incremental displacements. With a given constitutive relation, corresponding stress in complex form is as given in Eq. :

δ S = \tilde{S} e x p [- i ω t]

(2)

where

\tilde{S}

is frequency-independent absolute values of the incremental stresses, respectively. Solutions to equations of motions Eq. , as a result, are eigenvalue problem that gives natural frequencies and modes

\nabla δ S + ρ_{0} ω^{2} δ k = 0

(3)

For simple structures, like a circular hole in a squared unit cell, it is possible to give an analytical solution to the eigenvalue problem of Eq. [56]. However, it is a nontrivial, if not impossible, task to derive analytical solutions to general structures. As a numerical alternative, commercial finite element analysis software abaqus is used to solve the eigenvalues problem. The finite element implementation is well-established, efficient, and fully validated by experiments [57]. In the ABAQUS, a complex periodic boundary condition, as in Eq. , is applied via multi-point constraint. In Eq. , the incremental displacements along the edges of the unit cells are related by the exponential of iK · R, where R is the directional vectors that mark the unit cell. This is demonstrated in Fig. 1. The pairs of horizontal and vertical dot-dashed lines highlight the exemplary pairs of conjugated nodes on left and right, and up and down edges, respectively. All the conjugated nodes on the edges in the finite element mesh are connected in this way

δ x (X + R) = δ x (X) e x p [i K \cdot R]

(4)

It is worth noting that Eq. writes in complex form. In order to solve it in the ABAQUS, two identical sets of mesh are implemented in the software to represent the real part and imaginary part of Eq. , respectively. Therefore, the frequency-independent periodic boundary condition can be written as

\begin{matrix} δ k {(X + R)}^{r e} = δ k {(X)}^{r e} \cos (K \cdot R) - δ k {(X)}^{i m} \sin (K \cdot R) \\ δ k {(X + R)}^{i m} = δ k {(X)}^{r e} \sin (K \cdot R) + δ k {(X)}^{i m} \cos (K \cdot R) \end{matrix}

(5)

where the upper scripts re and im are for the real and imaginary parts, respectively. More details can be found in Ref. [58].

Dispersion curves are plots that show the derived natural frequencies with respect to corresponding wave vectors. Figures 1(a) and 1(c) are two examples of cellular metamaterials. Figures 1(b) and 1(d) show corresponding dispersion curves, respectively. The results of normalized reduced frequency can be expressed as

f = ω L / (2 π C_{t})

(6)

where L represents the characteristic length of the cellular metamaterial and C_t represents the transverse wave speed. The green areas in Fig. 1(b) remark band gaps of the structure in (a), suggesting no waves within the highlighted range can propagation through the structure. For the structure in (c), there is no bandgap.

Fig. 1

View large Download slide

Two cellular metamaterial designs and their corresponding dispersion curves. Each curve represents natural frequency as a function of the wave vector K. Results are normalized by the characteristic length of the cellular metamaterial m and the transverse wave speed C_t. Several band gaps are identified for the first structure while there is no band gap for the second structure. (a) and (b) show a metamaterial design with bandgaps and its dispersion curves, (c) and (d) show a metamaterial design without bandgap and its dispersion curves.

Generally, fine meshes of the unit cell are preferred in terms of the accuracy of the solutions to the eigenproblem. The expense of the simulation, however, increases as the meshes are refined. Therefore, a convergence test was conducted in pursuit of the mesh refinement that balances accuracy and efficiency. We randomly picked two different cellular metamaterials in our database to perform the convergence test. In the test, refinements of the meshes are quantified as the size of 4-node elements along the edges of the structures. Refinements are set as 1/50, 1/125, 1/250, and 1/375, from coarse to refined. Corresponding dispersion characteristics are estimated. We compared, in Fig. 2, for two different unit cells. Only the first five curves are shown for clarity. Assuming the 1/375 as the finest possible resolution that can be achieved with available computational resources, we evaluated the accuracy of the rest resolutions. Symbols of line-circle, line-star, line-cross, and line-triangle represent reduced natural frequencies with respect to different reduced wave vector K, for 1/50, 1/125, 1/250, and 1/375, respectively. Lines are guides to the eyes. As clearly shown, the 1/50 meshes have obvious deviations from the references 1/375 results. The 1/250 meshes, on the other hand, show good consistency to the references but the computational expense is out of our expectation in terms of the generation of a database with thousands of structures. Therefore, the choice of meshes 1/125 is taken as the compromise. Sampled with 60 discrete sampling locations on wave vector K, simulations for a single cellular metamaterial take about up to 30 min, up to 90 min, up to 180 min, and up to 300 min for 1/50, 1/125, 1/250, and 1/375, respectively. There is no surprise that simulations for sparse structure (Fig. 2(a)) are faster than those for dense structure (Fig. 2(b)).

Fig. 2

View large Download slide

Convergence test. Results for refinements of 1/50, 1/125, 1/250, and 1/375, from coarse to refine are represented by line-circle, line-star, line-cross, and line-triangle, respectively. Refinement of 1/250 is chosen considering accuracy and efficiency. (a) A diamond-shape metamaterial design and its dispersion curves. (b) A more complex metamaterial design and its dispersion curves.

Based on the convergence study, we choose to use 250 × 250 pixel image (correspond to the refinements of 1/125) to represent our input data for the sake of a balance between calculation time of bandgap simulation and the accuracy. Note that the size of input data is also used in the simulation of elastic properties and the generative model for the sake of consistency. We automated the simulation-postprocessing process and implemented it on our cellular metamaterial dataset. The total bandgap of a cellular metamaterial is selected as a property of interest. The statistics of all samples in the database are shown in Fig. 3(a) and Table 1.

Fig. 3

View large Download slide

Statistics of simulated total bandgap width and longitudinal stiffness of the cellular metamaterials in the database. (a) Statistical distribution of the bandgap width and (b) statistical distribution of the longitudinal stiffness.

Table 1

Statistics of simulated total bandgap width and longitudinal stiffness of the cellular metamaterials

	Lower bound	25%	Mean	75%	Upper bound
Total bandgap width	0	11.2	96.9	385.2	6130.5
E₁ value (MPa)	422.04	13713	28749	38416	63896

Simulations of the cellular metamaterials’ stiffness are also implemented in ABAQUS. Using the unified periodical boundary conditions in literature [59], the elastic moduli and Poisson’s ratio of the cellular metamaterials can be calculated. In our study, we are only interested in the longitudinal moduli E₁. The statistics of all samples’ longitudinal stiffness E₁ are shown in Fig. 3(b) and Table 1.

3 Overview of the Deep Generative Model-Based Design Methodology

3.1 Single-Loop and Iterative GM-βVAE Model Updating-Based Design Framework.

We first present a single-loop computational design approach, which will be used as the basic building block in the proposed iterative model updating-based design framework. The single-loop approach consists of three major components (top dotted box of Fig. 4):

A deep generative model (e.g., VAE, βVAE, or GM-βVAE) to extract low-dimensional latent feature vectors from the input images. The cellular metamaterial images are mapped to the points in the continuous latent space. New cellular structure designs can be generated by conducting arithmetic operations in the latent space (e.g., increasing or decreasing the values of the latent features, which is equivalent to moving the points in the latent space).
A GP model of the latent feature–property relationship for structure optimization. The properties of training samples are obtained by high-fidelity simulations. The GP model is trained on the simulation dataset to provide an efficient emulator of the latent feature–property relationship. Metamaterial structural design is formulated as a parametric optimization problem in the latent space:
$\begin{aligned} max_{z_{l}} {G P_{1} (z_{l}), \dots, G P_{M} (z_{l})}^{T} \\ s u b j e c t t o min (z) \leq z_{l} \leq max (z) \end{aligned}$
(7)
z_l represents the latent feature vector, l is the dimension of the latent features, GP_i represents the GP model for the ith mechanical property of interest, i = 1, 2, …, M. z represents the latent feature vectors of the training points. As the computational cost of design evaluation by GP is negligible compared with the true simulations, a variety of optimization algorithms, such as gradient-based algorithms and nongradient evolutionary algorithms, can be applied to solve the optimal z_l values. In this work, we adopt the Genetic Algorithm to find the optimal designs.
A deep decoder that maps the latent feature vector to the corresponding cellular structure image in the original input space. It is used to reconstruct the optimal cellular metamaterial design based on the optimal z_l values obtained in the previous step.

Fig. 4

View large Download slide

Iterative GM-βVAE model updating-based design framework

However, the single-loop design framework has a significant shortcoming. The deep generative model is trained on a randomly generated cellular structure dataset. This initial dataset may not fully cover the critical design regions where the global optimum is located. Therefore, the lack-of-data uncertainties in the critical design regions limit (i) the accuracy of the latent feature-structural image mapping and (ii) the accuracy of the latent feature–property supervised learning model, and thus will deteriorate the optimal search performances.

To resolve this issue, we propose an iterative GM-βVAE model updating-based design framework (Fig. 4). Similar to the strategy of active learning [60–62], this framework improves the model predictability in the critical design regions during the search iterations.

In each iteration, the single-loop design method is implemented to generate high-performance designs based on the existing dataset. The newly discovered designs are validated by simulations and used to replace the low-performance designs in the training dataset. Then the GM-βVAE model is re-trained for the next iteration. This process is iterated to achieve progressive improvements in design performances.

The proposed strategy is a performance-driven model updating process, which adds the newly discovered high-performance designs to the training dataset. Another model updating strategy is to reduce the epistemic uncertainty in the supervised learning model (uncertainty-driven model updating). New sample points will be generated at locations with the largest epistemic uncertainties to improve the predictability of the model. In Sec. 4.3, the performance-driven and the uncertainty-driven modeling updating strategies will be compared in a multiobjective design optimization problem.

The proposed approach has two stages: “offline model training” and “online design optimization.” The computational costs are mainly induced by the offline modeling training stage, which includes the property simulations for the initial training dataset, and the simulations of newly added samples during the iterative model updating process. A sufficient number of training samples are needed to guarantee the quality of the deep generative model and the GP model. We recommend to conduct a convergence study to find the minimum number of samples.

3.2 Selection of Deep Generative Model: VAE, βVAE, or GM-βVAE.

A deep generative model can extract the low-dimensional latent features from the input image data and reconstruct images based on the latent feature values. Here we provide a review of three VAE-based deep generative models and present a comparative study that helps to select the best model for structure design.

3.2.1 Variational Autoencoder.

Variational autoencoder, which was first proposed by Kingma and Welling [63], is one of the most popular methods to learn complicated data distribution such as images in an unsupervised manner. With embedded Bayesian inference in the autoencoder architecture, a VAE learns the mapping relationship between the input image x and the corresponding latent space z.

Variational autoencoder consists of an encoder (recognition model) and a decoder (generative model). The encoder of the VAE performs nonlinear dimensionality reduction and compresses the high-dimensional data into a low-dimensional latent space, which can be expressed as $Q_{ϕ} (z | x)$ ⁠. $Q_{ϕ} (z | x)$ is the approximate posterior that follows a normal distribution. ϕ is the vector of the recognition model parameters.

Another deep neural network, called decoder or generative model, is used to generate high-dimensional variables based on latent variables. The generative model is expressed as

P_{θ} (x, z) = P_{θ} (x | z) P_{θ} (z)

(8)

where θ is the vector of parameters of the generative model,

P_{θ} (z)

is the prior distribution of latent variables, and

P_{θ} (x | z)

is the approximated distribution of x conditioned on z.

The likelihood function for the training can be expressed explicitly and approximated by the evidence lower bound:

L_{V A E} = E_{Q_{ϕ} (z | x)} [\log P_{θ} (x | z) - D_{K L} (Q_{ϕ} (z | x) | | P_{θ} (z))]

(9)

3.2.2 Beta Variational Autoencoder.

βVAE is a variation of VAE, it can disentangle the latent space thus obtaining a more meaningful latent space [64]. βVAE has a multiple weight β on the Kullback-Leibler (KL) divergence loss term:

L_{β V A E} = E_{Q_{ϕ} (z | x)} [l o g P_{θ} (x | z) - β D_{K L} (Q_{ϕ} (z | x) | | P_{θ} (z))]

(10)

When β equals 1, the βVAE corresponds to the typical VAE framework.

VAE/βVAE shows a great advantage in nonlinear mapping, However, the prior distribution of the latent variable is assumed to be a standard normal distribution, which may not perform well on the dataset that shows multimode characteristics.

3.2.3 Gaussian Mixture Beta Variational Autoencoder.

Different from the typical VAE and βVAE, the prior over each latent variable of GM-βVAE is modeled as a mixture of Gaussian, which can capture the complex statistical distribution with multimodal characteristics. We assume that the input data are generated from different classes, which means that each mode of the latent feature corresponds to a specific class. Each class of data obeys a Gaussian distribution. The departure of each class of data is measured by Mahalanobis distance [65].

GM-βVAE also aims to learn the relationship between the input image x and the corresponding latent space z. Moreover, a categorical variable y is introduced to identify which Gaussian distribution does each data point belongs to. The relationship between the input data and the latent space is obtained by the encoder (recognition model), which can be expressed as

Q_{ϕ} (y, z | x) = Q_{ϕ} (y | x) Q_{ϕ} (z | x, y)

(11)

where ϕ is the vector of the recognition model parameters,

Q_{ϕ} (y, z | x)

is the approximate posterior that is represented by a mixture of Gaussian distributions. Each latent feature consists of k distinct Gaussian distributions, i.e.,

Q_{ϕ} (z | x, y_{i})

that are assumed to be Gaussian, where i ∈ 0, 1, 2, …k − 1. Thus, the approximate posterior becomes a Gaussian mixture. The means and variances of the Gaussian distributions are learned by the inference model.

The decoder of GM-βVAE can be expressed as

P_{θ} (x, y, z) = P_{θ} (x | z) P_{θ} (y | z) P (y)

(12)

where θ is the network parameters of the generative model,

P_{θ} (x | z)

is the approximated distribution of x conditioned on z,

P_{θ} (y | z)

is the approximated distribution of y conditioned on z, P(y) is the initial prior on y, which is selected to be uniform multinomial distributions.

In our proposed GM-βVAE model that consists of the encoder and decoder networks, the evidence lower bound can be expressed as

L_{G M - β V A E} = E_{Q_{ϕ} (y, z | x)} [\log P (y) - \log Q_{ϕ} (y | x) + β \log \frac{P_{θ} (z | y)}{Q_{ϕ} (z | x, y)} + \log P_{θ} (x | z)]

(13)

where

E_{Q_{ϕ} (y, z | x)} [\log Q_{ϕ} (y | x)]

is the conditional entropy that reflects how informative x is on y.

\log \frac{P_{θ} (z | y)}{Q_{ϕ} (z | x, y)}

is the KL divergence loss term and

\log P_{θ} (x | z)

is the reconstruction loss, respectively. We use β = 8 to achieve a balance between the disentanglement of latent space and reconstruction accuracy.

3.2.4 Comparison of the Three Generative Models.

The learning capacity of a deep generative model is assessed by measuring the difference between the original image and the reconstructed image. Here, we demonstrate the advantage of GM-βVAE over VAE and βVAE in terms of reconstruction accuracy. For a fair comparison, the same deep learning network architecture and the same training dataset are used in all models.

The details of the generative network models are shown in Fig. 5. The dimensionality of the latent space is set to be four for both the VAEs and GM-βVAE model. Based on the results of test runs, we find that using a lower dimensionality will incur a large reconstruction loss while using a dimensionality higher than four will not lead to a significant improvement in the reconstruction accuracy. The differences between the two networks lie in the definitions of the loss function and the distribution functions of the latent features (Gaussian distribution versus Gaussian mixture distribution).

Fig. 5

View large Download slide

The detailed architecture of (a) VAE and βVAE model and (b) the GM-βVAE model

We created a 2D cellular metamaterial database with 400 samples collected from literature [66,67] and our previous research [68]. We divide our dataset into a training set with 360 samples and a testing set with 40 samples randomly. Model training is performed for 150 epochs with a batch size of 32 for all three models. We used Adam as the optimizer to train the model. A visual comparison is conducted on the reconstructed images. The VAE, βVAE, and GM-βVAE models all have a good reconstruction quality and preserve most of the geometrical features (Fig. 6). However, some of the reconstructed images have blurred local regions, such as the central region in the image and the structural boundaries. The blurred regions are common in the reconstructions by VAE because of their log-likelihood loss functions [69]. We tackle this issue by setting a threshold to binarize the reconstructed images. We set the binarization threshold as 0.9. Pixels higher than 0.9 are converted to solid, and pixels lower than 0.9 are converted to void. The binarization step induces minimal changes to the structural geometry. We use the binarized structures in our finite element simulations, which has been introduced in Sec. 3.2.3.

Fig. 6

View large Download slide

Structure images reconstruction by VAE, βVAE, and GM-βVAE. Eight data in the test set are randomly selected for visual comparison: (a) the original input images, (b) the reconstructed images by VAE, (c) the reconstructed images by βVAE, and (d) the reconstructed images by GM-βVAE.

A quantitative comparison on the reconstruction accuracy is done by calculating the average cumulative relative error in pixel values between the test images and the corresponding reconstructions:

Error = \frac{1}{N} \sum_{i = 1}^{m} \sum_{j = 1}^{n} | O_{i j} - R_{i j} |

(14)

where N represents the total amount of test images. Two sets of validations are performed. In the first set, we apply the trained models to reconstruct the images in the training set to verify the training accuracy (N = 360). In the second set, we apply the trained models to reconstruct the images in the validation set to evaluate their capabilities on new data (N = 40). For each pixelated metamaterial image, m is the number of pixels in each row and n is the number of pixels in each column, where m = n = 250. O_ij is the value of the pixel in row i and column j of the original image. R_ij is the value of pixel in row i and column j of the reconstructed image. The relative errors of VAE, βVAE, and GM-βVAE models are listed in Table 2. GM-βVAE achieves the highest accuracy in both the training dataset and the validation dataset. It indicates that GM-βVAE model has the best accuracy in design representation and reconstruction.

Table 2

Quantitative comparison of the accuracy of VAE, βVAE, and GM-βVAE. A smaller error value indicates a higher accuracy

	VAE	βVAE	GM-βVAE
Training set	20.5563	37.3033	7.4748
Testing set	33.4167	42.7666	23.0436

4 Design Cases

4.1 Unsupervised Generative Design.

The purpose of unsupervised generative design is to discover a diverse set of cellular metamaterial designs that span a wide range of properties (e.g., bandgap width), without conducting the expensive design evaluations (e.g., phononic property simulations). This design set provides a start point for the following supervised learning-based design optimization. First, we implement a traditional approach of unsupervised generative design. With a trained βVAE/GM-βVAE model, random samples are collected by sampling the latent space using space-filling methods (e.g., Design of Experiments) [60]. The assumption of this approach is that a wide range of design performances can be achieved by filling the latent space with stochastically but evenly distributed sample points.

4.1.1 Design Diversity.

Here a case study is presented to illustrate the mechanism of generating diverse structural designs using a deep generative model. The basic idea is to morph the structure design by manipulating the values of the latent feature variables.

First, we choose the cellular structure whose latent features are [0, 0, 0, 0] and vary each latent variable value along the negative and positive directions. When we vary the value of a latent feature, the other three latent variables are fixed to 0. The resultant new structures are shown in Fig. 7(a). It is obvious that different latent variables control different topological patterns in the structural image. As the latent variables are continuous, we can generate an infinite number of structural patterns theoretically.

Fig. 7

View large Download slide

(a) Generating new cellular metamaterial designs by manipulating the latent feature’s values and (b) generating new cellular metamaterial designs by linear interpolation/extrapolation based on two sample points in the latent space

Second, we investigate the interpolation and extrapolation between two existing samples in the training dataset (highlighted by squares in Fig. 7(b)). New designs are generated by conducting linear interpolation/extrapolation based on the two existing sample points in the latent space. A continuous evolution in structural morphology can be obtained along this linear path.

4.1.2 Unsupervised Generative Design.

We propose an unsupervised generative design approach by leveraging the multimode nature of the GM-βVAE latent feature distributions. The statistical distribution of each latent feature is a mixture of Gaussian distributions. The mean of each Gaussian distribution (mode center) is considered as the center of a cluster of samples with similar characteristics. Therefore, a “cluster exemplar” can be obtained by constructing a latent feature vector in which each feature takes the value of a mode center (Fig. 8(a)). There are four latent features and two distinctive modes are identified for each latent feature, so in total 2⁴= 16 combinations (designs) are obtained. The corresponding cellular metamaterial images of the sixteen designs are obtained by decoding, as shown in Fig. 8(b).

Fig. 8

View large Download slide

(a) Multimode statistical distribution of each latent feature obtained by GM-βVAE, (b) design exemplars obtained by combining the mode values of each latent feature, and (c) the range of total bandgap width of the sample sets by sampling the βVAE latent space by LHS, by sampling the GM-βVAE latent space by LHS, and by the proposed approach which is based on the latent feature mode centers learned by GM-βVAE

The effectiveness of the unsupervised generative design is evaluated by comparing the range of properties achieved by the sample set. A wider range indicates a higher diversity of the design set. We also generate 16 designs using the traditional LHS approach. The property of each design (bandgap width) is obtained by simulation. It is observed that the proposed generative design approach achieves a much wider property range (Fig. 8(c)).

4.2 Single-Loop, Single-Objective Design Optimization.

The purpose of the second case study is to showcase the single-loop approach in property-driven structure design optimization. The design goal is to find cellular metamaterial designs with the maximal bandgap width. We repeat the GP modeling-optimization process three times independently and get three different optimal candidates, as shown in Fig. 9. The total bandgap width values of the three optimal candidates are verified by phononic simulations in ABAQUS. There are two important observations.

The single-objective optimization results in unrealistic designs for engineering applications, as shown in Fig. 9(a). The structural designs consist of several solid components connected by thin beams, which will lead to low stiffness and loss of structural integrity under loadings.
We also found that the performances of the training data could affect the performances of the optimization results. We regroup our existing dataset into two groups: the first group contains 320 designs with the top 80% width of bandgap width, the second group contains 320 designs with the bottom 80% width of bandgap width. The bandgap widths of the optimal designs found by the first model (trained on the high-performance data) are one order of magnitude higher than the optimal designs found by the second model (trained on the low-performance data). Therefore, we hypothesize that improving the performance of the VAE training dataset will lead to better designs.

Fig. 9

View large Download slide

Optimal designs and their total bandgap width obtained by (a) the VAE model trained on the top 80% high-performance data subset and (b) the VAE model trained on the bottom 80% low-performance data subset

4.3 Iterative GM-βVAE Model Updating-Based Multiobjective Design Optimization: Maximizing the Total Bandgap Width and the Longitudinal Stiffness.

A multiobjective design problem is formulated to maximize the total bandgap width and to maximize the structural stiffness (longitudinal stiffness E₁). The multiobjective design formulation can resolve the issue of lacking stiffness in the single-objective optimization. Two surrogate modeling strategies are compared in this study: establishing a multi-response GP model [70] that predicts both properties simultaneously, and establishing independent GP models for each response. The metamaterial samples are divided into the training dataset (364 points) and the test dataset (36 points) randomly. Multi-response GP yields a much lower accuracy (Fig. 10), because the two desired mechanical properties (width of total bandgap and longitudinal stiffness) are weakly correlated (the correlation coefficient is less than 1e-8). Therefore, we decide to train two independent GP models for each response for the following optimization studies. Nondominated Sorting Genetic Algorithm (NSGA-II) [71] is applied to find the nondominated design set.

Fig. 10

View large Download slide

(a) Multi-response GP model prediction and validation for the bandgap width, (b) multi-response GP model prediction and validation for the longitudinal stiffness E₁, (c) Gaussian process regression model prediction and validation for the bandgap width, and (d) Gaussian process regression model prediction and validation for the longitudinal stiffness E₁. Test points are randomly selected.

In the first iteration, 20 nondominated designs are obtained and the corresponding design images are reconstructed by decoding. The top 10 designs with the largest bandgap width are used to replace the top 10 undesired designs with a small total bandgap width and a small elastic modulus in the training set. The GM-βVAE is re-trained with the updated dataset for the next iteration.

In total three iterations are conducted. The training dataset and the GM-βVAE model are updated once in each iteration. The nondominated sets of the three iterations are shown in Fig. 11(a). The design performances in Fig. 11(a) are predicted by the GP model. The true performances of the nondominated designs, which are obtained by finite element simulations, are shown in Fig. 11(b). Three representative designs, the two ends (#1 and #20) and the mid-point (#11) of the nondominated set (Figs. 11(a) and 11(d)), are selected and compared in Table 3. We find that:

The nondominated design set moves closer to the utopian point (upper right corner) during the iterative design process.
Most of the designs on the Pareto frontier (marked by the dashed line in Fig. 11(b)) are from the last iteration, which indicates the effectiveness of the iterative search process.

Fig. 11

View large Download slide

(a) Nondominated set in each iteration of the iterative designs’ multiobjective optimization, the dashed line indicates the predicted Pareto frontier for the three design circles, (b) true value of the predicted nondominated set in each iteration, the dashed line indicates the true Pareto frontier for the three design circles, (c) comparison of designs from literature, Pareto frontier by proposed property-driven model updating and Pareto frontier by uncertainty-driven model updating, (d) representative designs in the first and third iteration, and (e) designs reported from literatures

Table 3

Nondominated set value of bandgap width and E₁ value of the three selected optimal designs and their corresponding simulated values

Iterations		First			Last
Designs		#1	#11	#20	#1	#11	#20
Total bandgap width	GP prediction	21.5	1344.3	3856.5	148	2426.6	5575
Total bandgap width	True value by simulation	20.2	1075	4886.5	16.1	2204	6768
E₁ value (MPa)	GP prediction	63195	29543.9	1338.5	71413	25837	3070
E₁ value (MPa)	True value by simulation	42468	32843	4045	48010	24180	2131

Iterations		First			Last
Designs		#1	#11	#20	#1	#11	#20
Total bandgap width	GP prediction	21.5	1344.3	3856.5	148	2426.6	5575
Total bandgap width	True value by simulation	20.2	1075	4886.5	16.1	2204	6768
E₁ value (MPa)	GP prediction	63195	29543.9	1338.5	71413	25837	3070
E₁ value (MPa)	True value by simulation	42468	32843	4045	48010	24180	2131

We compared our designs with the optimal phononic metamaterial designs reported in literature [17,21,22] (Figs. 11(c) and 11(e)). These designs are obtained by topology optimization. We simulate the stiffness and total bandgap width of these designs and compare them with the nondominated designs found by our proposed design framework. As shown in Fig. 11(c), our proposed design framework successfully generates designs that excel the literature designs in both stiffness and total bandgap width.

We also investigated another iterative model updating strategy that focuses on reducing the epistemic uncertainty in the supervised learning model (uncertainty-driven model updating). In the first two iterations, the GM-βVAE and GP models are updated by adding 15 new sample points collected from design regions with the highest epistemic uncertainties to the training set. In the last iteration, the design objective is formulated as maximizing the total bandgap width and the structural stiffness (longitudinal stiffness E₁). We compare the nondominated design set found by the uncertainty-driven model updating strategy and the proposed property-driven model updating strategy. As shown in Fig. 11(c), the proposed property-driven model updating strategy performs better than the uncertainty-driven model updating strategy.

It is to be noted that there exists a large difference between the design performances predicted by GP and the true values obtained by finite element simulation (Figs. 11(a) and 11(b), Table 3). The prediction errors are induced by the following factors.

Small changes are made on the reconstructed design images in postprocessing, which includes eliminating the small disconnected solid parts in the design [72–74] and converting the blurry image to a binary image.
The accuracy of the GP model is limited due to the limited amount of data.

Although the design performances predicted by GP cannot fully match the true values obtained by finite element simulation, a strong correlation is observed between them (the R squared value is 0.88), as shown in Fig. 13 in the Appendix. It indicates that the “high-performance” design points found by the surrogate model have a high probability to possess a real high performance.

5 Discussion and Conclusion

In this paper, we present a deep feature learning-based framework to design cellular metamaterials for bandgap width and stiffness. By taking advantage of the latent space learned by GM-βVAE, this design framework enables both unsupervised generative design and property-driven optimization. Our major conclusions are summarized as follows:

The proposed deep feature learning-based framework is effective in designing metamaterials with complex topological characteristics.
The proposed iterative updating-based design framework is effective in improving the design performances.
The GM-βVAE-based unsupervised generative design method can generate a more diversified design set than the traditional space filling-based unsupervised generative design method.
As the two responses are correlated weakly, it is recommended to establish independent GP models for each response (stiffness and total bandgap width), instead of an all-in-one multi-response GP, for supervised learning and design optimization.

Two caveats to designing phononic bandgap metamaterials are identified.

When considering only the phononic bandgap as the design objective, the optimization algorithm tends to generate structure designs with weakly connected components. Stiffness is an indispensable design objective/constraint to guarantee structural integrity under mechanical loads.
Postprocessing (e.g., filtering) on the reconstructed structure image may lead to significant changes in the bandgap properties, which indicates a lack of robustness in our proposed designs.

We also identified three opportunities to further improve the proposed framework.

The proposed design framework will be further improved and extended to the design of 3D metamaterials.
To resolve the issue that small changes in the optimal design’s geometry lead to a large degradation in phononic properties, robust optimization methods will be investigated.
The diversity of the design set can be improved in two ways. First, the VAE model can be improved by incorporating a diversity-oriented loss function that encourages the generation of novel designs. The diversity-oriented loss function was proposed based on the determinantal point processes [75], which provides a measurement of the design similarity. Second, the design diversity can also be improved in the stage of multiobjective optimization. By integrating the niching method with the Evolutionary optimization algorithms (e.g., niching Genetic Algorithm [76]), multiple optima at different locations in the design space can be obtained in the property-driven optimization.

Acknowledgment

This research is supported by Ford University Research Program. Z.W. acknowledges the support from the University of Connecticut School of Engineering General Electric (GE) Graduate Fellowship for Innovation. Y.L. gratefully acknowledges financial support from the Air Force Office of Scientific Research through the Air Force's Young Investigator Research Program (FA9550-20-1-0183; Program Manager: Dr. Ming-Jen Pan) and the National Science Foundation (CMMI-1934829). Y.L. would like to thank the support from 3M’s Non-Tenured Faculty Award. This research also benefited in part from the computational resources and staff contributions provided by the Booth Engineering Center for Advanced Technology (BECAT) at UConn. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the U.S. Department of Defense.

Conflict of Interest

There are no conflicts of interest.

Data Availability Statement

The datasets generated and supporting the findings of this article are obtainable from the corresponding author upon reasonable request.

Appendix A: Selection of Modes Number in GM-βVAE.

We trained multiple GM-βVAE models with different numbers of modes (1, 2, 3, and 4) in the Gaussian mixture latent space for each design iteration. A convergence study is conducted to find the optimal number of modes that lead to the highest fitting accuracy. When the number of modes equals 1, the latent variable follows the normal distribution. We compared the cumulated distribution function (CDF) of each latent variable fitted by Gaussian mixture with the empirical CDF of the true latent variable values. Figure 12(a) shows the fitted CDF versus the true CDF of the four latent features when the number of modes equals 2. The fitting error is evaluated as the area enclosed by two CDF curves. Figure 12(b) shows the result of the convergence study. Increasing the number of modes will reduce the enclosed area between the true CDF and the fitted CDF (smaller error). We observe a significant improvement by increasing the number of modes increases 1 to 2, but a slight improvement from 2 to 3 or from 3 to 4. So we choose a Gaussian mixture with 2 modes in our GM-βVAE model.

Fig. 12

View large Download slide

(a) Cumulative distribution function (CDF) comparison of actual distribution and the simulated distribution by GM-βVAE and (b) averaged enclosed area by simulated CDF and true CDF for a different number of modes in first, second, and third iteration with a number of modes equal to 1, 2, 3, and 4

For the second and third iteration of the design process, the trends are the same. So we use Gaussian mixture with two models for all three iterations.

Appendix B: Correlation Between GP Prediction and FEM Simulation

Fig. 13

View large Download slide

Correlation between GP prediction and FEM simulation

References

1.

Frenzel

,

T.

,

Köpfler

,

J.

,

Jung

,

E.

,

Kadic

,

M.

, and

Wegener

,

M.

,

2019

, “

Ultrasound Experiments on Acoustical Activity in Chiral Mechanical Metamaterials

,”

Nat. Commun.

,

10

(

1

), pp.

1

–

6

.

Google Scholar

Crossref

PubMed

2.

Yu

,

X.

,

Zhou

,

J.

,

Liang

,

H.

,

Jiang

,

Z.

, and

Wu

,

L.

,

2018

, “

Mechanical Metamaterials Associated With Stiffness, Rigidity and Compressibility: A Brief Review

,”

Prog. Mater. Sci.

,

94

, pp.

114

–

173

.

Google Scholar

Crossref

3.

Wu

,

L.

,

Wang

,

Y.

,

Zhai

,

Z.

,

Yang

,

Y.

,

Krishnaraju

,

D.

,

Lu

,

J.

,

Wu

,

F.

,

Wang

,

Q.

, and

Jiang

,

H.

,

2020

, “

Mechanical Metamaterials for Full-Band Mechanical Wave Shielding

,”

Appl. Mater. Today

,

20

, p.

100671

.

Google Scholar

Crossref

4.

Joannopoulos

,

J. D.

,

Johnson

,

S. G.

,

Winn

,

J. N.

, and

Meade

,

R. D.

,

2011

,

Photonic Crystals

,

Princeton University Press

,

Princeton, NJ

.

Google Scholar

Crossref

5.

Deymier

,

P. A.

,

2013

,

Acoustic Metamaterials and Phononic Crystals

, Vol.

173

,

Springer Science & Business Media

,

Berlin/Heidelberg, Germany

.

Google Scholar

Crossref

6.

Cervera

,

F.

,

Sanchis

,

L.

,

Sánchez-Pérez

,

J. V.

,

Martínez-Sala

,

R.

,

Rubio

,

C.

,

Meseguer

,

F.

,

López

,

C.

,

Caballero

,

D.

, and

Sánchez-Dehesa

,

J.

,

2001

, “

Refractive Acoustic Devices for Airborne Sound

,”

Phys. Rev. Lett.

,

88

(

2

), p.

023902

.

Google Scholar

Crossref

PubMed

7.

Munday

,

J.

,

Bennett

,

C. B.

, and

Robertson

,

W.

,

2002

, “

Band Gaps and Defect Modes in Periodically Structured Waveguides

,”

J. Acoust. Soc. Am.

,

112

(

4

), pp.

1353

–

1358

.

Google Scholar

Crossref

PubMed

8.

Benchabane

,

S.

,

Khelif

,

A.

,

Rauch

,

J.-Y.

,

Robert

,

L.

, and

Laude

,

V.

,

2006

, “

Evidence for Complete Surface Wave Band Gap in a Piezoelectric Phononic Crystal

,”

Phys. Rev. E

,

73

(

6

), p.

065601

.

Google Scholar

Crossref

9.

Nemat-Nasser

,

S.

,

2015

, “

Refraction Characteristics of Phononic Crystals

,”

Acta Mech. Sin.

,

31

(

4

), pp.

481

–

493

.

Google Scholar

Crossref

10.

Mitchell

,

S. J.

,

Pandolfi

,

A.

, and

Ortiz

,

M.

,

2014

, “

Metaconcrete: Designed Aggregates to Enhance Dynamic Performance

,”

J. Mech. Phys. Solids

,

65

, pp.

69

–

81

.

Google Scholar

Crossref

11.

Wang

,

Y.-F.

,

Wang

,

Y.-S.

, and

Wang

,

L.

,

2013

, “

Two-Dimensional Ternary Locally Resonant Phononic Crystals With a Comblike Coating

,”

J. Phys. D: Appl. Phys.

,

47

(

1

), p.

015502

.

Google Scholar

Crossref

12.

Wang

,

Y.-F.

,

Wang

,

Y.-S.

, and

Su

,

X.-X.

,

2011

, “

Large Bandgaps of Two-Dimensional Phononic Crystals With Cross-Like Holes

,”

J. Appl. Phys.

,

110

(

11

), p.

113520

.

Google Scholar

Crossref

13.

Chen

,

Q.

, and

Elbanna

,

A.

,

2016

, “

Modulating Elastic Band Gap Structure in Layered Soft Composites Using Sacrificial Interfaces

,”

ASME J. Appl. Mech.

,

83

(

11

), p. 111009.

14.

Nemat-Nasser

,

S.

,

Sadeghi

,

H.

,

Amirkhizi

,

A.

, and

Srivastava

,

A.

,

2015

, “

Phononic Layered Composites for Stress-Wave Attenuation

,”

Mech. Res. Commun.

,

68

, pp.

65

–

69

.

Google Scholar

Crossref

15.

Huang

,

Y.

,

Li

,

J.

,

Chen

,

W.

, and

Bao

,

R.

,

2019

, “

Tunable Bandgaps in Soft Phononic Plates With Spring-Mass-Like Resonators

,”

Int. J. Mech. Sci.

,

151

, pp.

300

–

313

.

Google Scholar

Crossref

16.

Wang

,

P.

,

Shim

,

J.

, and

Bertoldi

,

K.

,

2013

, “

Effects of Geometric and Material Nonlinearities on Tunable Band Gaps and Low-Frequency Directionality of Phononic Crystals

,”

Phys. Rev. B

,

88

(

1

), p.

014304

.

Google Scholar

Crossref

17.

Dong

,

H.-W.

,

Su

,

X.-X.

,

Wang

,

Y.-S.

, and

Zhang

,

C.

,

2014

, “

Topological Optimization of Two-Dimensional Phononic Crystals Based on the Finite Element Method and Genetic Algorithm

,”

Struct. Multidiscipl. Optim.

,

50

(

4

), pp.

593

–

604

.

Google Scholar

Crossref

18.

Sigmund

,

O.

, and

Søndergaard Jensen

,

J.

,

2003

, “

Systematic Design of Phononic Band–Gap Materials and Structures by Topology Optimization

,”

Philos. Trans. R. Soc. London, Ser. A

,

361

(

1806

), pp.

1001

–

1019

.

Google Scholar

Crossref

19.

Hussein

,

M. I.

,

Hulbert

,

G. M.

, and

Scott

,

R. A.

,

2006

, “

Dispersive Elastodynamics of 1D Banded Materials and Structures: Analysis

,”

J. Sound Vib.

,

289

(

4–5

), pp.

779

–

806

.

Google Scholar

Crossref

20.

Hussein

,

M. I.

,

Hamza

,

K.

,

Hulbert

,

G. M.

,

Scott

,

R. A.

, and

Saitou

,

K.

,

2006

, “

Multiobjective Evolutionary Optimization of Periodic Layered Materials for Desired Wave Dispersion Characteristics

,”

Struct. Multidiscipl. Optim.

,

31

(

1

), pp.

60

–

75

.

Google Scholar

Crossref

21.

Liu

,

Z.-F.

,

Wu

,

B.

, and

He

,

C.-F.

,

2014

, “

Band-Gap Optimization of Two-Dimensional Phononic Crystals Based on Genetic Algorithm and FPWE

,”

Waves Random Complex Medium

,

24

(

3

), pp.

286

–

305

.

Google Scholar

Crossref

22.

Han

,

X.

, and

Zhang

,

Z.

,

2019

, “

Topological Optimization of Phononic Crystal Thin Plate by a Genetic Algorithm

,”

Sci. Rep.

,

9

(

1

), pp.

1

–

13

.

Google Scholar

Crossref

PubMed

23.

Faust

,

O.

,

Hagiwara

,

Y.

,

Hong

,

T. J.

,

Lih

,

O. S.

, and

Acharya

,

U. R.

,

2018

, “

Deep Learning for Healthcare Applications Based on Physiological Signals: A Review

,”

Comput. Methods Programs Biomed.

,

161

, pp.

1

–

13

.

Google Scholar

Crossref

PubMed

24.

Guo

,

Y.

,

Liu

,

Y.

,

Oerlemans

,

A.

,

Lao

,

S.

,

Wu

,

S.

, and

Lew

,

M. S.

,

2016

, “

Deep Learning for Visual Understanding: A Review

,”

Neurocomputing

,

187

, pp.

27

–

48

.

Google Scholar

Crossref

25.

Vargas

,

R.

,

Mosavi

,

A.

, and

Ruiz

,

R.

,

2017

, “

Deep Learning: A Review

,”

Adv. Intell. Syst. Comput.

[Working Paper].

26.

Zinn

,

J.

,

Vogel-Heuser

,

B.

, and

Gruber

,

M.

,

2021

, “

Fault-Tolerant Control of Programmable Logic Controller-Based Production Systems With Deep Reinforcement Learning

,”

ASME J. Mech. Des.

,

143

(

7

), p.

072004

.

Google Scholar

Crossref

27.

Lee

,

X. Y.

,

Balu

,

A.

,

Stoecklein

,

D.

,

Ganapathysubramanian

,

B.

, and

Sarkar

,

S.

,

2019

, “

A Case Study of Deep Reinforcement Learning for Engineering Design: Application to Microfluidic Devices for Flow Sculpting

,”

ASME J. Mech. Des.

,

141

(

11

), p. 111401.

28.

Oh

,

S.

,

Jung

,

Y.

,

Kim

,

S.

,

Lee

,

I.

, and

Kang

,

N.

,

2019

, “

Deep Generative Design: Integration of Topology Optimization and Generative Models

,”

ASME J. Mech. Des.

,

141

(

11

), p. 111405.

29.

Agrawal

,

A.

, and

Choudhary

,

A.

,

2019

, “

Deep Materials Informatics: Applications of Deep Learning in Materials Science

,”

MRS Commun.

,

9

(

3

), pp.

779

–

792

.

Google Scholar

Crossref

30.

Ryan

,

K.

,

Lengyel

,

J.

, and

Shatruk

,

M.

,

2018

, “

Crystal Structure Prediction via Deep Learning

,”

J. Am. Chem. Soc.

,

140

(

32

), pp.

10158

–

10168

.

Google Scholar

Crossref

PubMed

31.

Yang

,

K. V.

,

Rometsch

,

P.

,

Jarvis

,

T.

,

Rao

,

J.

,

Cao

,

S.

,

Davies

,

C.

, and

Wu

,

X.

,

2018

, “

Porosity Formation Mechanisms and Fatigue Response in Al-Si-Mg Alloys Made by Selective Laser Melting

,”

Mater. Sci. Eng. A

,

712

, pp.

166

–

174

.

Google Scholar

Crossref

32.

Cecen

,

A.

,

Dai

,

H.

,

Yabansu

,

Y. C.

,

Kalidindi

,

S. R.

, and

Song

,

L.

,

2018

, “

Material Structure-Property Linkages Using Three-Dimensional Convolutional Neural Networks

,”

Acta Mater.

,

146

, pp.

76

–

84

.

Google Scholar

Crossref

33.

Liu

,

R.

,

Agrawal

,

A.

,

Liao

,

W.-K.

,

Choudhary

,

A.

, and

De Graef

,

M.

,

2016

, “

Materials Discovery: Understanding Polycrystals From Large-Scale Electron Patterns

,”

2016 IEEE International Conference on Big Data (Big Data)

,

Washington, DC

,

Dec. 5–8

,

IEEE

, pp.

2261

–

2269

.

Google Scholar

Crossref

34.

Jha

,

D.

,

Singh

,

S.

,

Al-Bahrani

,

R.

,

Liao

,

W.-K.

,

Choudhary

,

A.

,

De Graef

,

M.

, and

Agrawal

,

A.

,

2018

, “

Extracting Grain Orientations From Ebsd Patterns of Polycrystalline Materials Using Convolutional Neural Networks

,”

Microsc. Microanal.

,

24

(

5

), pp.

497

–

502

.

Google Scholar

Crossref

PubMed

35.

DeCost

,

B. L.

,

Lei

,

B.

,

Francis

,

T.

, and

Holm

,

E. A.

,

2019

, “

High Throughput Quantitative Metallography for Complex Microstructures Using Deep Learning: A Case Study in Ultrahigh Carbon Steel

,”

Microsc. Microanal.

,

25

(

1

), pp.

21

–

29

.

Google Scholar

Crossref

PubMed

36.

Yang

,

Z.

,

Li

,

X.

,

Catherine Brinson

,

L.

,

Choudhary

,

A. N.

,

Chen

,

W.

, and

Agrawal

,

A.

,

2018

, “

Microstructural Materials Design via Deep Adversarial Learning Methodology

,”

ASME J. Mech. Des.

,

140

(

11

), p. 111416.

37.

Cang

,

R.

,

Xu

,

Y.

,

Chen

,

S.

,

Liu

,

Y.

,

Jiao

,

Y.

, and

Yi Ren

,

M.

,

2017

, “

Microstructure Representation and Reconstruction of Heterogeneous Materials via Deep Belief Network for Computational Material Design

,”

ASME J. Mech. Des.

,

139

(

7

), p. 071404.

38.

Goodfellow

,

I.

,

2016

, “

Nips 2016 Tutorial: Generative Adversarial Networks

,” arXiv preprint arXiv:1701.00160.

39.

Nie

,

W.

, and

Patel

,

A. B.

,

2020

, “

Towards a Better Understanding and Regularization of GAN Training Dynamics

,”

Uncertainty in Artificial Intelligence

,

Tel Aviv, Israel

,

July 22–25, 2019

,

PMLR

, pp.

281

–

291

.

40.

Wang

,

L.

,

Chan

,

Y.-C.

,

Ahmed

,

F.

,

Liu

,

Z.

,

Zhu

,

P.

, and

Chen

,

W.

,

2020

, “

Deep Generative Modeling for Mechanistic-Based Learning and Design of Metamaterial Systems

,”

Comput. Methods Appl. Mech. Eng.

,

372

, p.

113377

.

Google Scholar

Crossref

41.

Tan

,

R. K.

,

Zhang

,

N. L.

, and

Ye

,

W.

,

2019

, “

A Deep Learning-Based Method for the Design of Microstructural Materials

,”

Struct. Multidiscipl. Optim.

,

61

(

4

), pp.

1

–

22

.

42.

Mao

,

Y.

,

He

,

Q.

, and

Zhao

,

X.

,

2020

, “

Designing Complex Architectured Materials With Generative Adversarial Networks

,”

Sci. Adv.

,

6

(

17

), p.

eaaz4169

.

Google Scholar

Crossref

PubMed

43.

Li

,

X.

,

Ning

,

S.

,

Liu

,

Z.

,

Yan

,

Z.

,

Luo

,

C.

, and

Zhuang

,

Z.

,

2020

, “

Designing Phononic Crystal With Anticipated Band Gap Through a Deep Learning Based Data-Driven Method

,”

Comput. Methods Appl. Mech. Eng.

,

361

, p.

112737

.

Google Scholar

Crossref

44.

Ma

,

W.

,

Cheng

,

F.

,

Xu

,

Y.

,

Wen

,

Q.

, and

Liu

,

Y.

,

2019

, “

Probabilistic Representation and Inverse Design of Metamaterials Based on a Deep Generative Model With Semi-Supervised Learning Strategy

,”

Adv. Mater.

,

31

(

35

), p.

1901111

.

Google Scholar

Crossref

45.

Liu

,

Z.

,

Raju

,

L.

,

Zhu

,

D.

, and

Cai

,

W.

,

2020

, “

A Hybrid Strategy for the Discovery and Design of Photonic Structures

,”

IEEE J. Emerg. Sel. Top. Circuits Syst.

,

10

(

1

), pp.

126

–

135

.

Google Scholar

Crossref

46.

Gurbuz

,

C.

,

Kronowetter

,

F.

,

Dietz

,

C.

,

Eser

,

M.

,

Schmid

,

J.

, and

Marburg

,

S.

,

2021

, “

Generative Adversarial Networks for the Design of Acoustic Metamaterials

,”

J. Acoust. Soc. Am.

,

149

(

2

), pp.

1162

–

1174

.

Google Scholar

Crossref

PubMed

47.

So

,

S.

, and

Rho

,

J.

,

2019

, “

Designing Nanophotonic Structures Using Conditional Deep Convolutional Generative Adversarial Networks

,”

Nanophotonics

,

8

(

7

), pp.

1255

–

1261

.

Google Scholar

Crossref

48.

Yeung

,

C.

,

Tsai

,

R.

,

Pham

,

B.

,

King

,

B.

,

Kawagoe

,

Y.

,

Ho

,

D.

,

Liang

,

J.

,

Knight

,

M. W.

, and

Raman

,

A. P.

,

2020

, “

Global Inverse Design Across Multiple Photonic Structure Classes Using Generative Deep Learning

,” arXiv preprint arXiv:2012.15790.

49.

Kim

,

Y.

,

Park

,

H. K.

,

Jung

,

J.

,

Asghari-Rad

,

P.

,

Lee

,

S.

,

Kim

,

J. Y.

,

Jung

,

H. G.

, and

Kim

,

H. S.

,

2021

, “

Exploration of Optimal Microstructure and Mechanical Properties in Continuous Microstructure Space Using a Variational Autoencoder

,”

Mater. Des.

,

202

, p. 109544.

50.

Jung

,

J.

,

Yoon

,

J. I.

,

Park

,

H. K.

,

Jo

,

H.

, and

Kim

,

H. S.

,

2020

, “

Microstructure Design Using Machine Learning Generated Low Dimensional and Continuous Design Space

,”

Materialia

,

11

, p.

100690

.

Google Scholar

Crossref

51.

Dilokthanakul

,

N.

,

Mediano

,

P. A.

,

Garnelo

,

M.

,

Lee

,

M. C.

,

Salimbeni

,

H.

,

Arulkumaran

,

K.

, and

Shanahan

,

M.

,

2016

, “

Deep Unsupervised Clustering With Gaussian Mixture Variational Autoencoders

,” arXiv preprint arXiv:1611.02648.

52.

Rasmussen

,

C. E.

,

2003

, “

Gaussian Processes in Machine Learning

,”

Summer School on Machine Learning

,

Springer

,

Berlin/Heidelberg, Germany

.

53.

Baum

,

E. B.

, and

Wilczek

,

F.

,

1988

, “

Supervised Learning of Probability Distributions by Neural Networks

,”

Neural Information Processing Systems

,

New York, NY

.

54.

Born

,

M.

, and

Wolf

,

E.

,

2013

,

Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light

,

Elsevier

,

New York

.

55.

Foehr

,

A.

,

Bilal

,

O. R.

,

Huber

,

S. D.

, and

Daraio

,

C.

,

2018

, “

Spiral-Based Phononic Plates: From Wave Beaming to Topological Insulators

,”

Phys. Rev. Lett.

,

120

(

20

), p.

205501

.

Google Scholar

Crossref

PubMed

56.

Elford

,

D. P.

,

Chalmers

,

L.

,

Kusmartsev

,

F. V.

, and

Swallowe

,

G. M.

,

2011

, “

Matryoshka Locally Resonant Sonic Crystal

,”

J. Acoust. Soc. Am.

,

130

(

5

), pp.

2746

–

2755

.

Google Scholar

Crossref

PubMed

57.

Li

,

Y.

,

Baker

,

E.

,

Reissman

,

T.

,

Sun

,

C.

, and

Liu

,

W. K.

,

2017

, “

Design of Mechanical Metamaterials for Simultaneous Vibration Isolation and Energy Harvesting

,”

Appl. Phys. Lett.

,

111

(

25

), p.

251903

.

Google Scholar

Crossref

58.

Bertoldi

,

K.

, and

Boyce

,

M. C.

,

2008

, “

Wave Propagation and Instabilities in Monolithic and Periodically Structured Elastomeric Materials Undergoing Large Deformations

,”

Phys. Rev. B

,

78

(

18

), p.

184107

.

Google Scholar

Crossref

59.

Xia

,

Z.

,

Zhang

,

Y.

, and

Ellyin

,

F.

,

2003

, “

A Unified Periodical Boundary Conditions for Representative Volume Elements of Composites and Applications

,”

Int. J. Solids Struct.

,

40

(

8

), pp.

1907

–

1921

.

Google Scholar

Crossref

60.

Pahl

,

G.

, and

Beitz

,

W.

,

2013

,

Engineering Design: A Systematic Approach

,

Springer Science & Business Media

,

Berlin/Heidelberg, Germany

.

61.

Lookman

,

T.

,

Balachandran

,

P. V.

,

Xue

,

D.

, and

Yuan

,

R.

,

2019

, “

Active Learning in Materials Science With Emphasis on Adaptive Sampling Using Uncertainties for Targeted Design

,”

npj Comput. Mater.

,

5

(

1

), pp.

1

–

17

.

Google Scholar

Crossref

62.

Kim

,

C.

,

Chandrasekaran

,

A.

,

Jha

,

A.

, and

Ramprasad

,

R.

,

2019

, “

Active-Learning and Materials Design: The Example of High Glass Transition Temperature Polymers

,”

MRS Commun.

,

9

(

3

), pp.

860

–

866

.

Google Scholar

Crossref

63.

Kingma

,

D. P.

, and

Welling

,

M.

,

2013

, “

Auto-Encoding Variational Bayes

,” arXiv preprint arXiv:1312.6114.

64.

Higgins

,

I.

,

Matthey

,

L.

,

Pal

,

A.

,

Burgess

,

C.

,

Glorot

,

X.

,

Botvinick

,

M.

,

Mohamed

,

S.

, and

Lerchner

,

A.

,

2017

, “

Beta-Vae: Learning Basic Visual Concepts With a Constrained Variational Framework

,”

International Conference on Learning Representations (ICLR)

,

Toulon, France

,

Apr. 24–26

, Vol. 2, No. 5, p.

6

.

65.

De Maesschalck

,

R.

,

Jouan-Rimbaud

,

D.

, and

Massart

,

D. L.

,

2000

, “

The Mahalanobis Distance

,”

Chemom. Intell. Lab. Syst.

,

50

(

1

), pp.

1

–

18

.

Google Scholar

Crossref

66.

Vogiatzis

,

P.

,

Chen

,

S.

,

Wang

,

X.

,

Li

,

T.

, and

Wang

,

L.

,

2017

, “

Topology Optimization of Multi-Material Negative Poisson’s Ratio Metamaterials Using a Reconciled Level Set Method

,”

Comput.-Aided Des.

,

83

, pp.

15

–

32

.

Google Scholar

Crossref

67.

Chan

,

Y.-C.

,

Ahmed

,

F.

,

Wang

,

L.

, and

Chen

,

W.

,

2021

, “

METASET: Exploring Shape and Property Spaces for Data-Driven Metamaterials Design

,”

ASME J. Mech. Des.

,

143

(

3

), p.

031707

.

Google Scholar

Crossref

68.

Vogiatzis

,

P.

,

Chen

,

S.

,

Gu

,

X. D.

,

Chuang

,

C.-H.

,

Xu

,

H.

, and

Lei

,

N.

,

2018

, “

Multi-Material Topology Optimization of Structures Infilled With Conformal Metamaterials

,”

International Design Engineering Technical Conferences and Computers and Information in Engineering Conference

,

Quebec City, Quebec, Canada

,

Aug. 26–29

,

American Society of Mechanical Engineers

, Vol. 51760, p. V02BT03A009.

Google Scholar

Crossref

69.

Goodfellow

,

I.

,

Bengio

,

Y.

,

Courville

,

A.

, and

Bengio

,

Y.

,

2016

,

Deep Learning

, Vol.

1

,

MIT Press

,

Cambridge, MA

.

70.

Park

,

J.

, and

Choi

,

S.

,

2019

, “

Multi-Response Gaussian Process for Multidisciplinary Design Optimization

,”

AIAA Aviation 2019 Forum

,

Dallas, TX

,

June 17–21

, p.

2888

.

Google Scholar

Crossref

71.

Deb

,

K.

,

Pratap

,

A.

,

Agarwal

,

S.

, and

Meyarivan

,

T.

,

2002

, “

A Fast and Elitist Multiobjective Genetic Algorithm: NSGA-II

,”

IEEE Trans. Evol. Comput.

,

6

(

2

), pp.

182

–

197

.

Google Scholar

Crossref

72.

Diaz

,

A.

, and

Sigmund

,

O.

,

1995

, “

Checkerboard Patterns in Layout Optimization

,”

Struct. Optim.

,

10

(

1

), pp.

40

–

45

.

Google Scholar

Crossref

73.

Jog

,

C. S.

, and

Haber

,

R. B.

,

1996

, “

Stability of Finite Element Models for Distributed-Parameter Optimization and Topology Design

,”

Comput. Methods Appl. Mech. Eng.

,

130

(

3–4

), pp.

203

–

226

.

Google Scholar

Crossref

74.

Sigmund

,

O.

, and

Petersson

,

J.

,

1998

, “

Numerical Instabilities in Topology Optimization: A Survey on Procedures Dealing With Checkerboards, Mesh-Dependencies and Local Minima

,”

Struct. Optim.

,

16

(

1

), pp.

68

–

75

.

Google Scholar

Crossref

75.

Kulesza

,

A.

, and

Taskar

,

B.

,

2012

, “

Determinantal Point Processes for Machine Learning

,” arXiv preprint arXiv:1207.6083.

76.

Lee

,

C.-G.

,

Cho

,

D.-H.

, and

Jung

,

H.-K.

,

1999

, “

Niching Genetic Algorithm With Restricted Competition Selection for Multimodal Function Optimization

,”

IEEE Trans. Magn.

,

35

(

3

), pp.

1722

–

1725

.

Google Scholar

Crossref

Design of Phononic Bandgap Metamaterials Based on Gaussian Mixture Beta Variational Autoencoder and Iterative Model Updating

Abstract

1 Introduction

2 Technical Background: Phonoic Bandgap Simulations and the Convergence Study

3 Overview of the Deep Generative Model-Based Design Methodology

3.1 Single-Loop and Iterative GM-βVAE Model Updating-Based Design Framework.

3.2 Selection of Deep Generative Model: VAE, βVAE, or GM-βVAE.

3.2.1 Variational Autoencoder.

3.2.2 Beta Variational Autoencoder.

3.2.3 Gaussian Mixture Beta Variational Autoencoder.

3.2.4 Comparison of the Three Generative Models.

4 Design Cases

4.1 Unsupervised Generative Design.

4.1.1 Design Diversity.

4.1.2 Unsupervised Generative Design.

4.2 Single-Loop, Single-Objective Design Optimization.

4.3 Iterative GM-βVAE Model Updating-Based Multiobjective Design Optimization: Maximizing the Total Bandgap Width and the Longitudinal Stiffness.

5 Discussion and Conclusion

Acknowledgment

Conflict of Interest

Data Availability Statement

Appendix A: Selection of Modes Number in GM-βVAE.

Appendix B: Correlation Between GP Prediction and FEM Simulation

References

Get Email Alerts

Cited By

ASME Journals

ASME Conference Proceedings

ASME eBooks

Resources

Opportunities

Design of Phononic Bandgap Metamaterials Based on Gaussian Mixture Beta Variational Autoencoder and Iterative Model Updating

Abstract

1 Introduction

2 Technical Background: Phonoic Bandgap Simulations and the Convergence Study

3 Overview of the Deep Generative Model-Based Design Methodology

3.1 Single-Loop and Iterative GM-βVAE Model Updating-Based Design Framework.

3.2 Selection of Deep Generative Model: VAE, βVAE, or GM-βVAE.

3.2.1 Variational Autoencoder.

3.2.2 Beta Variational Autoencoder.

3.2.3 Gaussian Mixture Beta Variational Autoencoder.

3.2.4 Comparison of the Three Generative Models.

4 Design Cases

4.1 Unsupervised Generative Design.

4.1.1 Design Diversity.

4.1.2 Unsupervised Generative Design.

4.2 Single-Loop, Single-Objective Design Optimization.

4.3 Iterative GM-βVAE Model Updating-Based Multiobjective Design Optimization: Maximizing the Total Bandgap Width and the Longitudinal Stiffness.

5 Discussion and Conclusion

Acknowledgment

Conflict of Interest

Data Availability Statement

Appendix A: Selection of Modes Number in GM-βVAE.

Appendix B: Correlation Between GP Prediction and FEM Simulation

References

Get Email Alerts

Cited By

Related Articles

Related Proceedings Papers

Related Chapters

ASME Journals

ASME Conference Proceedings

ASME eBooks

Resources

Opportunities

This Feature Is Available To Subscribers Only