A Generative Deep Learning Approach for Shape Recognition of Arbitrary Objects from Phaseless Acoustic Scattering Data
Item Type Article
Authors Ahmed, Waqas Waseem;Farhat, Mohamed;Chen, Pai-Yen;Zhang, Xiangliang;Wu, Ying
Citation Ahmed, W. W., Farhat, M., Chen, P.-Y., Zhang, X., & Wu, Y. (2023).
A Generative Deep Learning Approach for Shape Recognition of Arbitrary Objects from Phaseless Acoustic Scattering Data.
Advanced Intelligent Systems, 2200260. Portico. https://
doi.org/10.1002/aisy.202200260 Eprint version Publisher's Version/PDF
DOI 10.1002/aisy.202200260
Publisher Wiley
Journal Advanced Intelligent Systems
Rights Archived with thanks to Advanced Intelligent Systems under a Creative Commons license, details at: http://
creativecommons.org/licenses/by/4.0/
Download date 2024-01-01 17:19:16
Item License http://creativecommons.org/licenses/by/4.0/
Link to Item http://hdl.handle.net/10754/679785
Supplementary Information
A Generative deep learning approach for shape recognition of arbitrary objects from phaseless acoustic scattering data
Waqas W. Ahmed1, Mohamed Farhat1, Pai-Yen Chen2, Xiangliang Zhang1,3β , and Ying Wu1,4*
1Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
2Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL60607, United States of America
3Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States of America
4Division of Physical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Emails: *[email protected], β [email protected]
This supplementary information contains details on the analytic formula of acoustic wave scattering by solid objects derived from Mie scattering, which is used to validate the numerical simulations performed using COMSOL Multiphysics to generate the scattering data(Section 1)οΌ the definition of the structure similarity index measure (SSIM) (Section 2); the analysis of the effect of number of frequencies on the object detection (Section 3); the effect of a half-plane far- field data (Section 4); and more examples for reconstruction of arbitrary objects (Section 5).
1. Validation of numerical simulation of acoustic scattering by solid scatterers
Let us consider the elastic cylinder shown in Fig. S1, with radius π, density π1, and speeds of sound π1 and π2 , corresponding to pressure (longitudinal) and shear (transverse) waves, respectively.
Fig. S1. Scheme and geometry of the acoustic wave scattering from an infinitely extended (in the π§ βdirection) elastic cylinder.
1.1 General governing equations
The general (linear) equation governing the propagation of elastic waves in solids (taking into account shear waves) is given by
(π + 2π)β(π) β πβ Γ (2π) = π1π2π
ππ‘2, (1.1) with π = β β π and 2π = β Γ π. From Eq. (1.1) we can derive the equations for π and 2π as
βπ = π1 π2π
, (1.2)
and
β(2π) =π1 π
π2(2π)
ππ‘2 , (1.3)
where the velocities can be defined as
π1 = βπ + 2π
π1 ; π2 = βπ
π1, (1.4)
where π and π are the LamΓ© parameters.
In order to solve Eq. (1.1) we need to rewrite the displacement vector as
π = ββπ + β Γ π¨, (1.5)
to obtain
βπ = 1 π12
π2π
ππ‘2 and βπ¨ = 1 π22
π2π¨
ππ‘2 (1.6)
To solve Eq. (1.6) in the case of cylindrical objects (Fig. S1) we can use the following ansatz
π = β πππ½π(π1π) cos ππ
β
π=0
, (1.7)
π΄π§ = β πππ½π(π2π) sin ππ
β
π=0
, (1.8)
with the wavenumbers ππ = π/ππ. Equation (1.8) is shown to be valid as π¨ can be shown to have components in the π§-direction because of symmetry considerations. Also, only sinππ terms appear in Eq. (1.8) as the potential vector has to be anti-symmetrical about the axis π = 0, to ensure that the resulted displacement is symmetrical around π = 0.
Therefore, by combining Eq. (1.5) and Eqs. (1.7)-(1.8), we can get
π’π = β {πππ
π π½π(π2π) β ππ π
πππ½π(π1π)} cos ππ
β
π=0
, (1.9)
π’π = β {πππ
π π½π(π1π) β ππ π
πππ½π(π2π)} sin ππ
β
π=0
. (1.10)
On the other hand, the incident plane wave can be represented by ππ = π0πβππ3π₯ = π0πβππ3πcosπ = π0β ππ(βπ)ππ½π(π3π) cos ππ
β
π=0
. (1.11)
where π0 = 1 and ππβ₯1= 2. The radial component of the displacement of this incident plane wave is thus (from now on we assume, without loss of generality, that π0 = 1 to simplify the expressions)
π’π,π= 1 π3π2
πππ
ππ = 1
π3π2β ππ(βπ)π π
πππ½π(π3π) cos ππ
β
π=0
, (1.12)
and the scattered wave is
ππ = β πππ»π(1)(π3π) cos ππ
β
π=0
, (1.13)
and its associated radial component of displacement is
π’π ,π = 1
π3π2β ππ π
πππ»π(1)(π3π) cos ππ
β
π=0
, (1.14)
where ππ are the scattering coefficients, to be evaluated via the boundary conditions of the problem, along with the coefficients ππ and ππ.
1.2 Boundary conditions and numerical simulations
In the case of an elastic cylinder in an acoustical media (such as water) the boundary conditions that shall be employed on the cylinderβs boundaries are: (i) The pressure in the fluid must be equal to the normal component of stress at the solid interface; (ii) the radial component of displacement of the fluid has to be equal to the normal component of displacement of the solid interface; and (iii) the tangential components of shear stress must vanish at the interface of the solid. This means specifically
ππ+ ππ = β[ππ] at π = π, (1.15)
π’π,π + π’π ,π = π’π at π = π, (1.16)
[ππ] = [ππ§] = 0 at π = π, (1.17)
where the stress components in cylindrical coordinates are [ππ] = ππ + 2πππ’π
ππ = 2π1π22{(ππ β 2π)π +ππ’π
ππ}, (1.18)
[ππ] = π {1 π
ππ’π
ππ + π π
ππ(π’π
π )}, (1.19)
[ππ§] = π {ππ’π
ππ§ +ππ’π§
ππ}. (1.20)
By applying the boundary conditions at the interface between the elastic cylinder and the surrounding fluid, we get the linear system
(
βπ₯1π½πβ²(π₯1) ππ½π(π₯2) βπ₯3
π3π2π»π(1)β²(π₯3) π₯12[β2ππ½πβ²β²(π₯1) + ππ½π(π₯1)] 2ππ[π₯2π½πβ²(π₯2) β π½π(π₯2)] π2π»π(1)(π₯3)
2π[π₯1π½πβ²(π₯1) β π½π(π₯1)] βπ2π½π(π₯2) + π₯2π½πβ²(π₯2) β π₯22π½πβ²β²(π₯2) 0
) ( ππ ππ ππ
) =
( π₯3
π3π2π½πβ²(π₯3)
βπ2π½π(π₯3) 0
). (1.21)
where π₯π = πππ.
Fig. S2. Scattering cross-section (SCS) from an elastic cylinder made of (a) steel and (b) Aluminum embedded inside water. The dashed (red) curve gives the COMSOL simulations whereas the green solid curves give the Mie results discussed in this section.
To obtain ππ we make use of the Cramerβs rule, i.e., ππ = det ππ
det ππ, where ππ is the matrix appearing in the LHS of Eq. (1.21) and ππ is the matrix obtained from ππ by replacing its last column by the RHS of Eq. (1.21).
In order to verify the validity of the COMSOL Multiphysics elastic/acoustic modeling of scattering, we compute and give in Fig. S2 the SCS of a cylinder by using both COMSOL (red dashed lines) and the Mie formalism discussed in the previous paragraphs (green solid lines).
These results match perfectly for both a steel cylinder of radius 20 cm [Fig. S2(a)] in water and an
Aluminum cylinder [Fig. S2(b)] in water, validating thus the numerical modeling by COMSOL, that we use throughout this paper.
2. Structure Similarity Index Measure (SSIM)
SSIM is a metric that quantify the similarity between two given images by extracting key characteristic from an image named Luminance, Contrast, and Structure. Luminance is measured by averaging over all the pixel value denoted by π reads as: ππ₯= 1
πβππ=0π₯π. Contrast is measured by taking the standard deviation of all the pixel values denoted by Ο reads as: ππ₯ =
β 1
πβ1βππ=0(π₯π β ππ₯)2 and structure comparison is computed by (π₯ β ππ₯) πβ π₯. The luminance comparison function π(π₯, π¦) is a function of ππ₯ and ππ¦ defined as:
π(π₯, π¦) = 2ππ₯ππ¦+ π1 ππ₯2 + ππ¦2 + π1
The contrast comparison function π(π₯, π¦) is a function of ππ₯ and ππ₯ defined as:
π(π₯, π¦) = 2ππ₯ππ¦+ π2 ππ₯2+ ππ¦2+ π2
Structure comparison function π (π₯, π¦)is defined as:
π (π₯, π¦) =2ππ₯π¦+ π3 ππ₯ππ¦+ π3
where ππ₯π¦= β 1
πβ1βππ=0(π₯π β ππ₯)2(π¦πβ ππ¦)2, and π1, π2, π3 are constants to ensure stability when the denominator becomes 0.
And finally, luminance, contrast, and structure comparison functions are combined to determine the similarity index value given by,
SSIM(π₯, π¦) = [π(π₯, π¦)]πΌ[π(π₯, π¦)]π½[π (π₯, π¦)]πΎ
where πΌ > 0, π½ > 0, πΎ > 0 denote the relative strength of each of the metrics. To simplify the expression, we assume, πΌ = π½ = πΎ = 1 and π3 = π2/2 that results in:
SSIM(π₯, π¦) = (2ππ₯ππ¦+ π1)(2ππ₯π¦+ π2) (ππ₯2+ ππ¦2+ π2)(ππ₯2+ ππ¦2+ π2)
We used SSIM to determine the quality of the reconstructed and generated objects in the main text.
3. Object detection from Single to multifrequency far-field radiation data
In the main text, we presented the results for five frequencies radiation data to determine the shape of object accurately. By including multifrequency data in the training process, we demonstrate how the results are significantly improved by providing training and prediction of neural networks with different frequencies.
3.1 Object detection from Single frequency far-field radiation profile
We start our analysis with a single frequency (π1= 1 kHz) radiation data and designed the neural network to solve the forward and inverse problem. The five-layer forward neural network is designed containing 4096 β 500 β 500 β 500β 400 β 400 β87 nodes. The LeakyReLu function is used for activation, and Adam optimization is used with a learning rate 10β4. The training and prediction results for the designed forward network are shown in Figs. S3(a) and S3(b), respectively. The relative prediction error is computed over the testing data with mean error 0.0172 indicated by vertical red dashed line. Fig. S3(c) illustrates the comparison of computed
radiation pattens for two given arbitary objects where the COMSOL results (solid blue curve) are in perfect agreement with ML predictions (red dashed curve).
Fig. S3. Training and prediction performance of forward designed neural network for single frequency far- field radiation data. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error.
(c) Representative examples for far-field prediction from given arbitary objects. The solid blue and dotted red line represent the far-field response calculated from COMSOL and ML method, respectively.
Next, we proceed to solve the inverse problem of arbitary shape detection with single frequency far-field data. The inverse designed network contains 87β 800 β 500 β 500β 500 β 400 β100 nodes. The training and prediction results are presented in Fig.S4. The relative spectral error, binary cross-entropy error (BCE) and SSIM over generated objects for the testing data are computed with mean values of 0.007, 0.271, and 0.70 respectively. Although the spectral error is
very low but large BCN error in generated geometries as compared to target objects is expected due to presence of non-unique solution space, as depicted in Fig. S4(e).
Fig. S4. Training and prediction performance of inverse designed neural network for single frequency far- field radiation data. a Learning curve of optimized network. (b) Histogram of spectral prediction error. (c) binary cross-entropy error for ML generated shapes. (d) distribution of SSIM for the ML generated geometries. (e) Representative examples for arbitrary object prediction from given one-frequency far-field data. The predicted far-field response (dashed red) from the generated objects exactly matches with (solid blue) but the generated shapes show variations from the target objects due to non-unique solution space in inverse process.
3.2 Object detection from far-field radiation profile at two different frequencies
Next, we train the network with far-field radiation data at two different frequencies π1 = 1 kHz and π2 = 1.5 kHz. The forward and inverse architectures consist of 4096 β 500 β 500 β 500β 400
β 174 and 174β 800 β 800 β 500 β 500β 500 β 400 β100 nodes, respectively. The training and prediction results for forward and inverse network are shown in Figs. S5 and S6, respectively. On Fig.S5(b), the red line indicates the mean relative spectral error for trained forward network as 0.0227. In inverse network, the relative spectral error, binary cross-entropy error (BCE) and SSIM over generated objects for the testing data are computed with mean values of 0.0135, 0.131, and 0.75 respectively, as depicted in Fig.S6.
Fig. S5. Training and prediction performance of forward designed neural network for far-field radiation data at two different frequencies. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error. (c) Representative examples for far-field prediction from given arbitrary objects. The solid blue and dotted red line represent the far-field response calculated from COMSOL and ML method, respectively.
Fig. S6 Training and prediction performance of inverse designed neural network for two-frequency far- field radiation data. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error.
(c) binary cross-entropy error for ML generated shapes. (d) distribution of SSIM for the ML generated geometries. (e) Representative examples of arbitrary object prediction from given two-frequency far-field data. The predicted far-field response (dashed red) from the generated objects exactly matches with (solid blue) but the generated shapes show variations from the target objects due to non-unique solution space in inverse process.
3.3 Object detection from far-field radiation profile at three different frequencies
Subsequently, we consider three different frequencies π1 = 1 kHz π2 = 1.5 kHz, and π3= 2 kHz. The results for the trained forward and inverse networks are shown in Figs. S7 and S8, respectively.
The designed forward and inverse architectures consist of 4096 β 500 β 500 β 500β 400 β 400
β261 and 174β 800 β 800 β 500 β 500β 500 β 400 β100 nodes, respectively. The red dashed line in Fig. S7(b) indicates the mean relative spectral error for the trained forward network as 0.027.
For the inverse case, relative spectral error, binary cross-entropy error (BCE), and SSIM over generated objects are computed with mean values of 0.0196, 0.0986, and 0.79, respectively [see Fig.S8].
Fig. S7. Training and prediction performance of forward designed neural network for far-field radiation data at three different frequencies. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error. (c) Representative examples for far-field prediction from given arbitrary objects. The solid blue and dotted red line represent the far-field response calculated from COMSOL and ML method, respectively.
Fig. S8 Training and prediction performance of inverse designed neural network for far-field radiation data at three different frequencies. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error. (c) binary cross-entropy error for ML generated shapes. (d) Distribution of SSIM for the ML generated geometries (e) Representative example arbitrary object prediction from given three-frequency far-field data. The predicted far-field response (dashed red) from the generated objects exactly matches with (solid blue) but the generated shapes show variations from the target objects due to non-unique solution space in inverse process. The results of arbitrary shape detection are significantly improved due to addition of extra information in the training process as compared to Figs.S5 and S6.
3.4 Object detection from far-field radiation profile at four different frequencies
Lastly, we train the network with far-field radiation data at four different frequencies π1= 1 kHz π2 = 1.5 kHz, π3= 2 kHz and π4= 2.5 kHz and results for trained forward and inverse networks are presented in Figs. S9 and S10, respectively. The designed forward and inverse architectures consist of β4096 β 800 β 800 β 800β 800 β 600 β600β348 and 348β 800 β 800 β 500 β 500β 500 β 400
β100 nodes, respectively. Fig. S(9) illustrates the distribution of relative spectral error over testing data with mean value 0.0313. Fig. S10 shows the inverse network's relative spectral error, binary cross-entropy error (BCE) and SSIM over the generated objects with mean values of 0.025, 0.0916, and 0.83 respectively.
Fig. S9. Training and prediction performance of forward designed neural network for far-field radiation data at four different frequencies. (a) Learning curve of optimized network. (b) Histogram of spectral
blue and dotted red line represent the far-field response calculated from COMSOL and ML method, respectively.
Fig. S10. Training and prediction performance of inverse designed neural network for far-field radiation data at four different frequencies. (a) Learning curve of optimized network. (b) Histogram of spectral prediction error. (c) binary cross-entropy error for ML generated shapes. d distribution of SSIM for the ML generated geometries.
4. Object detection from multifrequency half-plane far-field data
Here, we present the training behavior of designed neural network to predict the shape of arbitary object from multifrequency half-plane far-field data which is discussed in the main text. The designed forward and inverse architectures consist of 4096 β 1000 β 1000 β 800β 800 β 600 β600
β328 and 328β 800 β 800 β 500 β 500β 400 β100 nodes, respectively. In addition, some more examples are provided for arbitary shape detection from the given far-field patterns.
Fig. S11. Training performance of designed neural network for half- plane multifrequency far-field radiation data. (a) Learning curve of forward optimized network. (b) Learning curve of inverse optimized network. (c-f) target object and generated object from the inverse designed network.
5. Effect of sampling and angular interval on the detection efficiency
In order to analyze the effect of sampling rate on the recognition process, we perform the following two sets of studies. In the first set, we keep the angular range unchanged but reduce the number of
range. For the first three cases (87,29, and 9) the structure similarity index measure (SSIM) remains almost unchanged as shown in Fig. S12(a). The binary cross entropy (BCE) displays similar behavior (Fig. S12(b)). This shows that the recognition mechanism works well for certain number of reduced sampling points. In order to further showcase this point, we show in Figs. S12(c)-(d) the shape recognition mechanism for two random shapes, with both 29 and 9 sampling points where the generated shape reconstructs well the original one. Only when the sampling number of points is reduced to 3, deterioration of the shape is observed in the rightmost panel of Figs. S12(c) and (d). The SSIM and the BCE depart quite largely from the other scenarios.
Fig. S12. (a) structure similarity index measure and (b) binary cross entropy, for different sampling points.
(c)-(d) show the original and the generated shape for 29, 9, and 3 sampling points for two objects.
In the second set, we reduce both the sampling number of points as well as the angular range.
Specifically, we consider 360 degrees, 180 degrees, as well as 90 degrees, 45 and 25 degrees, with 87, 65, 33, 16 and 9 sampling points, respectively. The results over the testing data are shown in Fig. S13. As we reduce the angular range with sampling points, the efficacy of the model to determine the unique shape decreases due to less spectral information.
Fig. S13. (a) Structure similarity index measure and (b) binary cross entropy, for different angular intervals.
(c)-(d) Original and generated shape for 90o (33 points), 45o (16 points), and 25o (9 points) angular interval, for two different objects.
To conclude, fewer sampling points would not significantly affect the recognition performance as long as the number of sampling points and the angular coverage are above certain threshold.
6. More examples for reconstruction of arbitrary Object from AAE
Here, we present some more examples to reconstruct the arbitrary object from the designed AAE in the main text.
Fig. S14. Examples of object reconstruction from trained adversarial autoencoder discussed in main text.