• Tidak ada hasil yang ditemukan

Sound Propagation in the Concatenated Tube Model

Dalam dokumen Discrete-Time Speech Signal Processing (Halaman 158-164)

EXERCISES

4.4 A Discrete-Time Model Based on Tube Concatenation

4.4.1 Sound Propagation in the Concatenated Tube Model

Consider the concatenated tube model in Figure 4.14 in which thekth tube has cross-sectional areaAk and length lk. Because each tube is lossless and because we assume planar wave propagation, the pressure and volume velocity relations in each tube satisfy the wave equation, Equation (4.7), each quantity having a forward- and backward-traveling wave component as given in Equation (4.8). The volume velocity and presure solution for thekth tube is therefore written as

uk(x, t ) = u+k

tx c

uk

t + x c

pk(x, t ) = ρc Ak

u+k

tx

c

+ uk

t + x c

(4.27) where the kth tube has length lk, the spatial variable x falling in the range 0 ≤ xlk, as illustrated in Figure 4.14. To give a specific solution requires boundary conditions that are provided by the pressure and volume velocity at two adjacent tubes. In particular, we use the physical principle that pressure and volume velocity must be continuous both in time and in space everywhere in the system [28]. Therefore, at thekth/(k+1)st junction we have

uk(lk, t ) = uk+1(0, t ) pk(lk, t ) = pk+1(0, t ).

Applying the two boundary conditions with Equation (4.27), we then have u+k

tlk

c

uk

t + lk c

= u+k+1(t )uk+1(t ) Ak+1

Ak

u+k

tlk c

+uk

t + lk

c

= u+k+1(t )+ uk+1(t ). (4.28) Now defineτk = lck, which is the time of propagation down the length of the tube. Then the first equation in the Equation (4.28) pair can be written as

u+k(tτk) = uk(t +τk)+u+k+1(t )uk+1(t )

and substituting this expression foru+k(tτk)into the second equation of our pair, we obtain, after some algebra,

u+k+1(t ) =

2Ak+1 Ak+1 +Ak

u+k(tτk)+

Ak+1Ak Ak+1 + Ak

uk+1(t ). (4.29) Then subtracting the top from the bottom component of the above modified equation pair, we obtain, after some rearranging,

uk(t + τk) = −

Ak+1Ak Ak+1 + Ak

u+k(tτk)+

2Ak

Ak+1 +Ak

uk+1(t ). (4.30) Equations (4.29) and (4.30) illustrate the general rule that at a discontinuity alongxin the area functionA(x, t )there occurpropagation and reflection of the traveling wave, and so in each uniform tube, part of a traveling wave propagates to the next tube and part is reflected back (Figure 4.15). This is analogous to propagation and reflection due to a change in the impedance along an electrical transmission line.

From Equation (4.29), we can therefore interpretu+k+1(t ), which is the forward-traveling wave in the(k+1)st tube atx=0, as having two components:

1. A portion of the forward-traveling wave from the previous tube,u+k(tτk), propagates across the boundary.

2. A portion of the backward-traveling wave within the(k+1)st tube,uk+1(t ), gets reflected back.

Observe thatu+k(tτk)occurs at the(k+1)st junction and equals the forward waveu+k(t ), which appeared at thekth junctionτk seconds earlier. This interpretation is applied generally to the forward/backward waves at each tube junction. Likewise, from Equation (4.30), we can interpretuk(t+τk), which is the backward-traveling wave in thekth tube atx=lk, as having two components:

1. A portion of the forward-traveling wave in thekth tube,u+k(tτk), gets reflected back.

2. A portion of the backward-traveling wave within the(k+1)st tube,uk+1(t ), propagates across the boundary.

Figure 4.15 further illustrates the forward- and backward-traveling wave components at the tube boundaries.

4.4 A Discrete-Time Model Based on Tube Concatenation 139

The above interpretations allow us to see how the forward- and backward-traveling waves propagate across a junction. As a consequence, we can think ofAAk+1Ak

k+1+Akas areflection coefficient at the kth junction, i.e., the boundary between the kth and(k +1)st tube, that we denote byrk:

rk = Ak+1Ak

Ak+1 +Ak (4.31)

which is the amount of uk+

1(t ) [oru+k(tτk)] that is reflected at thekth junction. It is straightforward to show that because Ak > 0, it follows that −1 ≤ rk ≤ 1. From our definition Equation (4.31), we can write Equations (4.29) and (4.30) as

u+k+1(t ) = (1+ rk)u+k(tτk)+rkuk+1(t )

uk(t +τk) = −rku+k(tτk)+(1− rk)uk+1(t ) (4.32) which can be envisioned by way of a signal flow graph illustrated in Figure 4.16a. We will see shortly that these equationsat the junctionslead to the volume velocity relations for multiple concatenated tubes. Before doing so, however, we consider the boundary conditions at the lips and at the glottis.

k + 1 k – 1

k uk+(t)

+ +

+

uk+(t –τk) uk+1+ (t) uk+1+ (t –τk+1) uk+1 (t + τk+1) uk+1 (t)

uk(t + τk) uk(t)

lk x 0

uk(t) = uk(t – 0/c)

uk(t) = uk(t + 0/c) uk(0, t) = uk(t) – uk(t)

+ +

+

uk(t –τk) = uk(t – lk/c)

uk(t + τk) = uk(t + lk/c) uk(lk, t) = uk(t –τk) – uk(t +τk) Figure 4.15 Sound waves in the concatenated tube model consist of forward- and backward-going traveling waves that arise from reflection and transmission at a tube junction.

uk(t)

uN(t) uN(t –τN)

uN(t + τN)

uN(lN, t) (1 + rL)

uk(t –τk) uk+1(t)

uk+1(t –τk+1)

uk+1(t + τk+1) uk+1(t)

uk(t +τk)

+

+

ug(t) +

1 + rg

2 rg

u1(t)

u1(t)

u1(t + τ1) u1(t –τ1)

+

uN(t) uN(lN, t)

ug(t) u1(t)

u1(t)

uN(t –τN)

uN(t + τN)

+ +

+

uk(t)

τk

τk

τk+1

τk+1

τN

τN

–rL

kth Tube (k + 1)st Tube

(a)

(b)

(c) (1 – rk)

τ1

τ1

Tube N

Tube 1

+

+

+

(1 + rk)

–rk +rk

Figure 4.16 Signal flow graphs of (a) two concatenated tubes; (b) lip boundary condition; (c) glottal boundary condition. An open circle denotes addition.

SOURCE: L.R. Rabiner and R.W. Schafer,Digital Processing of Speech Signals[28]. ©1978, Pearson Education, Inc. Used by permission.

Suppose there are a total ofN tubes. Then the boundary conditions at the lips relate pressure pN(lN, t ) and the volume velocity uN(lN, t ) at the output of the Nth tube [28].

From continuity pN(lN, t ) = pL(t ), which is the pressure at the lips output. Likewise, uN(lN, t ) = uL(t ), which is the volume velocity at the output of the lips. Suppose now that the radiation impedance is real, corresponding to a very large frequencyand thus, from Equation (4.24), a purely resistive load, i.e., Zr() = Rr. Then, from Figure 4.11, we can write the pressure/volume velocity relation at the lips as

pN(lN, t ) = ZruN(lN, t ), (4.33) whereZr is not a function of. From our earlier expressions for pressure and volume velocity within a tube, we then have

ρc

AN[u+N(tτN)+uk(t +τN)] = Zr[u+N(tτN)uN(t +τN)].

4.4 A Discrete-Time Model Based on Tube Concatenation 141

This expression can be written as

ρc

AN +Zr

uN(t + τN) = −

ρc

ANZr

u+N(tτN) and thus

uN(t +τN) = − ρc

ANZr ρc

AN + Zr

u+N(tτN)

= −rLu+N(tτN) (4.34)

where the reflection coefficient at the lips is defined as rL =

ρc ANZr

ρc AN + Zr.

Equation (4.34) states that the forward-going wave is reflected back at the lips boundary with reflection coefficientrL. There is no backward-going wave contribution from free space to the right. The volume velocity at the lips output can then be expressed as

uN(lN, t ) = u+N(tτN)uk(t + τN)

= (1 +rL)u+N(tτN)

which is shown schematically in Figure 4.16b. A complex radiation impedanceZr()can be invoked by using a frequency-domain version of the previous derivation or by using a differential equation, the inductance resulting in a first-order ordinary differential equation.

We can also model the radiation from the lips with an additional tube of cross-section AN+1 and infinite length. The tube has no backward-propagating wave because of its infinite length, which is consistent with sound radiating out from the lips. In addition, if we selectAN+1 such thatZr = AρcN+1, then the expression forrLbecomes

rL =

ρc

ANAρcN+1

ρc

AN + AρcN+1

= AN+1AN

AN+1 +AN (4.35)

which is our original definition of reflection coefficient rN for two adjacent tubes of cross- sectional areaAN+1 andAN. Observe that whenZr()=0 thenrL=1, the pressure drop is zero, and the imaginary added tube becomes infinite in cross-section becauseAN must be finite.

The boundary condition model for the glottis is shown in Figure 4.16c where the glottal impedance is assumed approximated as a series resistance and inductance. We can think of the glottal slit, anatomically, as sitting just below (schematically, to the left of) the first tube of the

concatenated tube model, and so we denote the pressure at the output of the glottis byp1(0, t ), i.e., the pressure at the left edge of the first tube, and the volume velocity output of the glottis byu1(0, t ), i.e., the volume velocity at the left edge of the first tube. Suppose that the glottal impedance is real (purely resistive), i.e.,Zg =Rg, then using Kirchoff’s current law, we can write in the time domain [28]

u1(0, t ) = ug(t )p1(0, t ) Zg .

Then from our earlier expressions for volume velocity and pressure within a uniform tube [Equation (4.27)], we have

u+1(t )u1(t ) = ug(t )ρc A1

u+1(t ) +u1(t ) Zg

. Solving for the forward-going traveling wave

u+1(t ) = (1+ rg)

2 ug(t ) +rgu1(t ) (4.36) where

rg = ZgρcA1 Zg + ρcA1.

IfZg()is complex, then, as with a complexZr(), a differential equation realization of the glottal boundary condition is required. A flow diagram of the flow relations in Equation (4.36) is illustrated in Figure 4.16c. As with the radiation impedance, we can also model the effect of glottal impedance with an additional tube of cross-sectionA0 and infinite length. (We are looking down into the lungs.) If we makeA0 such thatZg = ρcA0, then our expression forrg becomes

rg =

ρc A0Aρc1

ρc A1 + Aρc0

= A1A0 A1 +A0.

Consider now the special case of two concatenated lossless tubes of equal length, with radiation and glottal boundary conditions, depicted in the flow diagram of Figure 4.18a. In Exercise 4.9, using this flow diagram, you are asked to show that the transfer function relating the volume velocity at the lips to the glottis is given by

Va(s) = bes2τ

1+a1es2τ +a2es4τ (4.37) whereb=(1+rg)(1+rL)(1+r1)/2,a1=r1rg+r1rL, anda2 =rLrg. In this model, as well as more general models with an arbitrary number of lossless tubes, all loss in the system

4.4 A Discrete-Time Model Based on Tube Concatenation 143

occurs at the two boundary conditions. Nevertheless, it is possible to select loss at the glottis and lips to give observed formant bandwidths. Furthermore, with this configuration Fant [7]

and Flanagan [8] have shown that with appropriate choice of section lengths and cross-sectional areas, realistic formant frequencies can be obtained for vowels. As we noted earlier, however, the resulting model is not necessarily consistent with the underlying physics. In the following section, we describe a means of converting the analog concatenated tube model to discrete time.

In this discrete-time realization, as in the analog case, all loss, and hence control of formant bandwidths, is introduced at the two boundaries.

Dalam dokumen Discrete-Time Speech Signal Processing (Halaman 158-164)