• Tidak ada hasil yang ditemukan

Acoustical and clinical assessment of voice quality with laryngeal pathology

N/A
N/A
Protected

Academic year: 2017

Membagikan "Acoustical and clinical assessment of voice quality with laryngeal pathology"

Copied!
8
0
0

Teks penuh

(1)

90

Abdoerrachnan Med J Indones

Acoustical

and

Clinical

Pathology

Hartono Abdoerrachman

Assessment

of

Voice

Quality

with Laryngeal

Abstrak

Penelitian nengenai lælcacauan gelontbang suara telah banyak dilakulan oleh para ahli dan telah dilaporkan sebelunt ini, tetapi nasalah ini nasih banyak nenarik ninat para klinisi untuk ntenelaah lebih laniut. Para klinisi sangat nengharapkan nene,nukan suatu teknik yang praktis, otonatis dan non-invasif untuk penggunaan nedis. Dalan penelitian nrengenai lækacauan suara ini, dicakup 672 data suara selana 3 tahun,

dari Januari

1990 santpai Desenber 1992, yang terdiri dari berbagai kasus dengan penyakit yang nenyebabknn perubahan suara Hasil penelitian ini nrenunjukkan bahwa beberapa tujuan penggunaan analisis suara

ini

tercapai, nteskipun target utan,a nasih jauh.

Abstract

Acoustical waveform studies have been conducted since a long tiue, and resulb have been reported by nany experts, but it is still an interesting ,,ratrcr

for

clinicians to erplore. Clinicians are seeking for a practical auto,ttatic and non-invasive technique for clinical

purposes- Our study on voice perturbation covered a

j

years period data, fron January l99O until Deceutber 1992, includitrg 672 voice data consisting ofvarious diseases, which cause vocal changes. The results revealed that sone aims of voice analysis were achieved, although the nain target is still a linle bitfar.

Keywords: voice analysis, acoustical ossessntent, acoustical wavefornt perturbation, pathological voice analysis.

INTRODUCTION

Besides the eye and the ear, speech is the effective and direct way for verbal communication, and speech dis-order

will

have negative effect to the social

life and

future career ofa person. There are many causes result-ing

in

phonation disturbances, organic

or

functional disorders, affecting one or more organs involved in

voice production,

such as

the

respiratory organ, phonatory and resonance organs, for which it is some-times

difficult to establish

the etiological diagnosis. The importance of establishing the exact diagnosis is to understand the problem, so that the proper treatment could be managed, Examination

of

the larynx by in-direct or in-direct laryngoscopy is one way to visualize the position and movement

of

the vocal cords. This method is somewhat invasive, traumatic, and not al-ways easy to perform, especially on sensitive and non-cooperative patients.

There has been a growing interest in the objective documentation and assessment of voice quality. This

interest arose because of important advances in

tech-niques and analyses

that

can be .applied

to

vocal

physiology

in

a scientific manner.' Clinicians have recently explored the usefulness

of

various acoustic factors as well as glottographics reflections

of vocal

vibration. There is a great need to develop methods for quantification

of

glottal

voice source behaviour in clinical practice for patients with voice disorders. It is well known that vocal fold vibration in modal register involves, in addition to the gross opening and closing movements, a wave-like motion of the looser mucosa relative to the denser underlying tissues. Histological sections indicate that the membranous portion

of the

vocal fold should be considered as a layeied structure. I Hirano2'3 divided the layers of the vocal cord into

5 components, and from the mechanical point of view, the five histological layers can be reclassified into 3 sections: the cover, consisting

of the

epithelium and the superficial layer of the lamina propria, the transi-tional layer, consisting

of

the intermediate and deep
(2)

layer of the lamina propria, and the body, consisting

of

the muscle.

Phonation

is

the

aerodynamic and acoustic product

of

an extremely complex vibratory system. Part of the evidence for this complexity is the fact that

the cycle-to-cycle consistency of laryngeal vibration in phonation is never completely regular (Boves, 1984, cit.*) Baken (1987) stated that the phonatory system is

not a perfect machine and every speaker's vibratory

cycle is erratic to some extent (cit.a) From the acousti-cal point of view, the characteristic of perturbation in the laryngeal waveform can be detected

in

both the time domain, called jitter, and in the frequency domain,

called shimmer. From the auditory point of view,

jitter

and shimmer contribute to the rough perceptual effect,

which is

called harshness. Physiologically

a

harsh voice can arise either functionally from hyperkinetic adjustments of the phonatory muscles, or

pathological-ly

from disorders and diseases on one or both

of the

vocal cords. Laryngeal waveform perturbation can occur across a wide range in incidence from occasional

to habitual, and in degree from slight to severe. Severe

perturbations are always signs of either pathological or

functional disorder, but slight perturbations are evident

in all

speaking voices, especially

at

the border

of

voicing phase.

Medical background and aim of the study

The

attemp to discover

objective

acoustic

and

physiological

ways

of

characterizing

laryngeal waveform

in

terms

of

perturbatioà parameters has a

history of over 30 years, beginning with the pioneering

work

of

researchers such as Moore and von Leden (1958), Lieberman (1963) and also Michel

in

t964.

Since then, the topic

of

waveform perturbation had been spreading and had attracted many experts

of

different fields, such as laryngology, signal

process-ing, speech science, and speech pathology.

It

is important to emphasize that there are many

different ways

of

defining waveform perturbations,

and that

no

wide consensus has yet existed

on the

definitions of the identity of the most information-rich parameters. When a patient complains of hoarseness or a harsh voice, an important clinical decision must be

made whether this

is

a pathological

or

a functional symptom.

This

desicion

will

influence

directly the

mode

of

treatment, and so any objective method for supporting treatment desicions,

if

they can be made

automatic, reliable, cost-efective and non-invasive, is

of obvious oto-laryngological value.

According to Laver et al.s and Wilson et a1.,6 there

are four major applications

of

automatic methods

of

waveform perturbation analysis that could identify

laryngeal pathology:

1.

Screening

of

a given population,

to

complement

existing screening programs

in

hospitals, such as

well

person

clinic,

because

the

incidence

of

pathological conditions that need urgent treatment

is substantially lower than the incidence of solely functional voice disorders.

2. Priority

âssessment

of

patients

visiting

their ieneral practitioner with complains

of

hoarseness

or harshness.

3.

Diagnostic support, where the acoustic system had demohstrated

its

ability to

provide

reliable

evidence discriminating pathological versus

func-'

tional etiology.

4.

Follow up tool, a monitor to assess voice changes after the patients received radiotherapy, chemo-therapy, surgery, speech therapy or to detect remis-sion or deterioration in progressive disease.

It

is probably reasonable to suggest that current

acoustic

techniques

of

quantifying

perturbatory

evidence

in

laryngeal waveforms could reliably and

cost-effectively support the functions

of screening,

priority assessment, and monitoring. As Laver5 stated,

it is premature to suggest that such systems can support

a discriminative diagnostic function with any

satiilac-tory degree of reliability. Principally there are promis-ing signs that such techniques are capable ofproviding diagnostic support.

A

major obstacle

is

the lack

of

adequate databases ofacoustic and clinical recordings

of speakers

with relevant laryngeal disorders, and

of

normal controls.

Techniques

According to Laver et al,5 there are two advantages

of

acoustic technique:

firstly,

the recording methods in-volved are non-invasive, non-frightening to patients,

and using fairly low cost technology, and secondly the

acoustic perturbation data techniques

of

analysis are directly executed to automate and hard-copy records that are easy to interpret and can easily be made avajl-able for the patient's file.

The

disadvantages

in using

acoustical data-analysing techniques

for

perturbation analysis are: firstly, acoustic recordings are subjected to

contamina-tion

by

both environmental factors and

equipment-derived artifacts. Thus, acquiring the acurate data

of

the acoustic signals requires a well prepared recording

environment as

well

as equipment, and the hazards

derived from those factors can be greatly reduced when

(3)

92

Abdoerrachnan

process of automating the perturbatory analysis invol-ves relatively high-cost computing equipment.

The ideal acoustic recording equipment

for

ac-quiring acoustical data for perturbation analysis should have the features of flat low frequency response, that

should be able to record as low as 40-50 Hz or even lower, and should have no signal distortion which is

mostly caused by phase distortion.

The equipment should also have high signal-to-noise ratio, because most of time-domain pitch detec-tion algorithms are not very resistant to the effecls

of

contamination by transient noises, and by continuous ambient noises. And at last, the recording equipment

must be affordable to the clinic using it.

The acoustical data collecting must be made in a quiet environment

to

achieve an adequate signal-to-noise ratio, preferably in a sound-treated room which

is

shielded

from

electrical signals and mechanical

sounds.

MATERIALS

AND METHODS

This paper

will

report the results of the analysis upon

672 voice samples

from

various diseases, recorded within a 3 years period, from 1990-1992, using DAT

tape-recorder,

with

a constant distance

from mic

to mouth

of l5

cm, following patient's practice.

Record-ings were made in a sound-treated booth. Four hundred

forty

two samples consisted

of

23 normal voice, 17 tremorous voice, 40 spasmodic dysphonia,177

vocal-cord polyp, 56 vocal

nodule,

97

recurrent nerve

paralysis and 32 polypoid

of

the vocal cord. Voice samples of sustained vowel lel were digitized through

a l6-bit A/D converter at a sampling rate of 40 kHz and stored

on

a disk controlled

by

a computer.

A

half-second segment was extracted by excluding the initial and final portions from each sample. Saturated

record-ing of sustained vowel, and also too short duration

of

lhe lel phonation, less than 0.5 second, are excluded. Using peak picking method, the method recom-mended

by

Imaizumi et

al.,'

local maximum points

which could correspond

to

vocal excitation epochs were detected successively and then cycle

by

cycle

perturbation quotient were calculated and

finally

the

pitch

perturbation quotient (PPQ-%) and amplitude perturbation quotient (APQ-%) were determined, The

additive noise level (NLvl) were also calculated.

All

the magnitude score data are stored in a

com-puter, and coded for the name of each disease: VCP for

vocal cord polyp, SPD for spasmodic dysphonia, RNP

for

recurrent nerve paralysis,

VCN

for

vocal cord nodule, POLPOD for polypoid of the cord, and N for normal voice.

Med. J Indones

RESULTS AND DISCUSSION

Within

three

years,

from

1990-1992,

672

voice

samples

from

various diseases were collected, and

grouped

in

coded names. The VCP is predominantly found, followed by RNP, VCN, SPD, and POLPOD.

From grouped and extracted data, one figure is exposed and histograms are composed to reveal and clarify the results of the investigation.

The present research of acoustical waveform per-turbation show the following results:

Figure

I

shows the automatisation

of

the 0.5

second voice sample analysis, direct implementation

of the algorithm on the computer, also the waveform and the sonagram, showing the fundamental frequency (F0), the harmonic, non-harmonic components and the

fluctuation,

all in

one. Each taken sample

is a

0.5 second segment by excluding the onset and the final portion of the whole sustained vowel /e/.

Figures

2

and3, show the real frequency (7o)

of

the PPQ and of the APQ, infering that the peak of the diseased voices are shifted to the right. These

eviden-ces assume that the PPQ and APQ of diseased voices have larger amount of scores compared to the normal

voice.

This situation

is

more evident and clear in

Figures 4 and 5.

Figures 4 and 5 show that the cummulative

dis-tribution of PPQ and APQ of the diseased voices are

shifted to the right, infering larger magnitude of scores compared to normal voice, and only about lO% of the

POLPOD has smaller score of the APQ. This differen-ces

of

scorqs are proven significant by the ANOVA statistical calculation (P=0.0001

for

the

PPQ and

0.0053 for the APQ).

As for the follow up parameter, Figures

6

andT show the cummulative frequency of PPQ and APQ

of

VCP voices before and after surgery. It is evident that the scores before treatment are larger compared to post-treatment, which

is

also seen

in

the Percentille Comparison graph. The statistical calculation

of

pre

and post-treatment voice samples is significant for the

PPQ

(P=0.0068),

but not significant

for

APQ

(P=0.113) and this might be caused by too small data. We also include the observation on the noise level

(NLvl), as we know that hoarse voice is also influenced

by this factor. Figure 8 shows the real frequency (%)

of NLvl for

normal and diseased voices. The graph

reveals higher scores in diseased voices compared to

normal, as the peak

of

diseased voices are shifted to the right. This situation is clearer shown in Figure 9,

where the cummulative frequency of diseased voices

(4)

(|tt tlDt I g ! aù tld r gf, .ab p Êlrl. Ct.rlla Orlrlq oa t r llrlat E!.ral ib (P, r a0[. rq tll- I {Lal

tlr I lm hl llhr d \ hulGtld a ht (!tt . l- ela t &b tH tld .X -b a lE.lt.l 6Fl.æ .l

tl 2r0 rr1rl! 0,Ëltlt 0.lltli2 0,2!tl(l 2. el rrZ.l9 0,æn t6lzl GZIltl

'il- I rrc-.ôt b tt (i) r Î.a.l tr- lil(arr -ta.G tll-a ùbr lE.U.t kl ,ba l.t! ùÊtÈl "" r-l.t i.tl.d Lrtt.llhr l.![(t, HltlT.. LlAt hh r 4.8 t-tlh Ald. aaa{.d thl tb t t.lf, (:, Fl e, I l.d

[image:4.595.86.492.87.370.2] [image:4.595.56.497.427.685.2]

h rlr . !!am xl Gt I :,f,l

Figure 1. Profile of the auto,natisationfor 0.5 second voice suilple analysis

t@

I

80

?E

68

w

4E

æ

æ

l@

@

4.6 4.4 4.2

E

9.2

a-4

1-2 1.4

1.6

v

PolyFoid

R

e

a I

F

r

q

z

4.6 9.8

1

PPA

À

LCN

X

RIP

tr

ltlormal

+

SPD

o

l.EP
(5)

94 Abdoerrachnan

t@

I

8@

7E

60

50 4g

30

I

1A 0

R

e

a

I

F

r

e

q

z

-o.6

4.2

4.2

4.6

1

1.4

æcl

[image:5.595.60.484.88.350.2] [image:5.595.52.495.428.691.2]

O

ltlorrnl

+

SPD

o

(EP

À tÆ{

x

RIP

v

polypoid

Figure 3. Real Frequency (%) ofAPQfor nonnal and diseased voices

vn

I

æ

7tà 6g 50

&

3g

n

1@

o

-0.ô

4.2

4.2

g.ô

ÊPO

tr

ltbrnal

+

SPD

o

t"fP

À

tl){ x

RIP

v

Pol

ypoid

c

u m m

F

r

e

q

z

1.4

Med J Indones

(6)

I

æ

7g 60

g

40

30

n

l@

@

4.6

c

u

m m

F

r

€ q

z

1.4

g-2

4.6

PPQ

[image:6.595.65.511.88.355.2] [image:6.595.67.509.432.684.2]

o lbrral +

SPD

o

l.EP

Â

IXN

x

RIP

e

Polypoid

Figure 5. Cunnulative Distribution of APQfor nornrcl and diseased voices

t@

I

8g 7E

æ

w

4A

æ

æ

1@

a

-€.6

4.2

9.2

9.6

PPA

o

PreOp

+

Po6t0p c

u

m m

F

r

e q

z

(7)

3

+

Abdoerracfunan

-€.6

4.2

a-2

0.Ê

ÊP0

[image:7.595.45.503.83.338.2] [image:7.595.65.507.397.684.2]

tr

Pre0p

+

PætOp

Figure 7. Cunnulative Distribution of APQfor VCP Voices before and after surgery

Med J Indones

c

u

m m

F

r

q

z

t@

I

80

7îà

eo 50

4E

30

æ

t@.

@,

1.4

Percentlle Comparlson Graph

for

columns:

X1Y1

1.5

@

o

+

o

ù

À

o

ct,

o

o

Log1O(PP...

-.8 -.5

-.2

0

.25 .5

.75

1

1.21.5

Log10(PPQ):Pre

(8)
[image:8.595.59.547.57.384.2]

Figure 9. Cumnulative Distribution of Noise l-evel (Nlvl)for normal and diseasedvoices

CONCLUSION

Clinicians are eager and longing

for

an automatic,

practical, easy

to

operate, inexpensive and non-in-vasive instrument to detect,

if

possible to determine diagnosis

of

the disease. This condition is also noted in the Otolaryngology field,with several diseases that

cause voice changes.

After having conducted the study and discussed the results, it seems that we are

still

far from the final target. Neverthele.ss, some

of

the aims as has been mentioned before, were achieved.

We

see that the significancy

of

the

role

of

the acoustic waveform

analysis as supportive tool and adjunctive

examina-tion, and also as instrument and measures for follow up and paramoter work up has been proven. The

sig-nificant change in waveform perturbation would also arise the suspicion

or

early change

in

the epithelial

layer of the vocal cord, that might lead to early detec-tion of a hazardous disease.

These evidences are somewhat beneficial for the clinicians

in

planning, determining and following up

the patient's history of disease.

It

should be accepted that the application of acoustical analysis method is in

the same level as other medical examinations, such as

laboratory findings

or

X-photo examination.

For

a

more intensive study,

it

is suggested to include larger amount of samples, divided in different stages of the

disease.

REFERENCES

l.

Georgiou B. Given and Ear to your Computer. BYTE The Small Systems joumals. 1978;3:5G9

l.

2.

Hirano M, Hibi S, Yoshida T, Hirade Y, Kasuya H, Kikuchi

Y. Acoustic Analysis of Pathological Voice. Acta Otolaryn-gol. 1988;105:432-8.

3.

Hirano lvl Morphological structure of the vocal fold as a vibrator and its variations. Folia Phoniat. 1974126:89-94.

4.

Titzæ IR. The Human Vocal Cords, A Mathematical model.

Part IL Phonetica. 197 4;29

:l-2t.

5.

Laver J, Hiller S, Beck IM. Acoustic Waveform Perturbation

and Voice disorder. J Voice. 1992:6:115-26.

6.

Wilson FB, Starr CD. Use of the phonation analyzer as a

Clinical

tool.

f of

Speech and Hearing Disorder.

1985;50:351-56.

7.

Imaizumi S, Gauffin f. Acoustical perceptual characteristics

of Pathological Voices: rough, creak, fry and diplophonia. Ann Bull RILP. l99l;25:109-19.

Perc_entlle

Comparlson Graph

for columns:

X2Y2

U'

o

+

o

(L

o

o,

o

1

-1

t.5

-t

o

d

o

Log1O(AP...

-2.5

-2

-1.5

-1

-.5

0 .5 1

1.5

Gambar

Figure 1. Profile of the auto,natisationfor 0.5 second voice suilple analysis
Figure 3. Real Frequency (%) ofAPQfor nonnal and diseased voices
Figure 5. Cunnulative Distribution of APQfor nornrcl and diseased voices
Figure 7. Cunnulative Distribution of APQfor VCP Voices before and after surgery
+2

Referensi

Dokumen terkait

Pada hari Rabu tanggal dua puluh bulan juli tahun dua ribu enam belas, yang bertanda tangan di bawah ini, Pokja ULP Dinas Pengairan kabupaten Gayo Lues yang ditetapkan dengan

Berdasarkan Hasil Evaluasi Dokumen Kualifikasi tanggal 02 Agustus 2013, dan Hasil Pembuktian Dokumen Kualifikasi tanggal 13 Agustus 2013, dengan ini kami mengumumkan calon

1 Dengan tingkat harga yang sama, produk yang ditawarkan oleh Laras Catering lebih baik dibandingkan Catering lain 2 Ketika memerlukan jasa catering, saya akan

DPR merupakan impian dari semua orang karena untuk menjadi seorang anggota DPR adalah hal yang sangat luar biasa karena situ dengan berpaikain yang rapi (safari) dan duduk

Apakah bapak dan ibu dalam menerapkan sikap disiplin selalu memberi contoh yang baik pada anaka. Dalam menanamkan disiplin, apakah bapak dan ibu mendikte anak

Karena manusia memiliki unsur jasmaniah atau biologis, manusia memiliki berbagai kebutuhan biologis yang harus dipenuhinya, seperti makan, minum, menghirup udara,

Cara pembuatan kue lontar coklat hampir sama seperti membuat kue lontar Original hanya pada isi Kue Lontar kita cukup menambah coklat batangan yang telah dilelehkan menggunakan mangkuk

Berdasarkan hasil uji t dalam penelitian ini maka dapat disimpulkan bahwa Variabel kualitas website memiliki pengaruh positif dan significan terhadap niat