PENERJEMAH Novita Aryani Sutejo B2A018015

(1)

TUGAS TERJEMAH JURNAL

...

Selection of the Maximum Spatial Cluster Size of the Spatial Scan

Statistic by Using the Maximum Clustering Set-Proportion Statistic

Ma, Yue; Yin, Fei; Zhang, Tao; Zhou, Xiaohua Andrew; Li,

Xiaosong.PLoS One; San Francisco Vol. 11, Iss. 1, (Jan 2016):

e0147918. DOI:10.1371/journal.pone.0147918

...

Pemilihan Ukuran Cluster Spasial Maksimum dari Statistika

Pemindaian Spasial dengan Menggunakan Statistik

Set-Proportion Clustering Maksimum

PENERJEMAH

Novita Aryani Sutejo

B2A018015

Program Studi S1 Statistika

Fakultas Matematika dan Ilmu Pengetahuan Alam

Universitas Muhammadyah Semarang

(2)

Pemilihan Ukuran Cluster Spasial Maksimum

dari Statistika Pemindaian Spasial dengan

Menggunakan Statistik Set-Proportion Clustering

Maksimum

Abstrak

Statistik pemindaian spasial banyak digunakan dalam berbagai bidang.

Kinerja statistik ini dipengaruhi oleh parameter, seperti ukuran klaster spasial

maksimum, dan dapat ditingkatkan dengan pemilihan parameter menggunakan

ukuran kinerja. Ukuran kinerja saat ini didasarkan pada keberadaan cluster dan

dengan demikian tidak dapat diterapkan ke kumpulan data tanpa cluster yang

dikenal. Dalam karya ini, kami mengusulkan ukuran kinerja keseluruhan baru

yang disebut set-set proporsi maksimum (MCS-P), yang didasarkan pada

kemungkinan penyatuan kelompok yang terdeteksi dan kumpulan data yang

diterapkan. MCS-P dibandingkan dengan ukuran kinerja yang ada dalam studi

simulasi untuk memilih ukuran cluster spasial maksimum. Hasil pengukuran

kinerja lainnya, seperti sensitivitas dan misklasifikasi, menunjukkan bahwa

statistik pemindaian spasial mencapai hasil yang akurat dalam sebagian besar

skenario dengan ukuran klaster spasial maksimum yang dipilih menggunakan

MCS-P. Mengingat bahwa gugus yang diketahui sebelumnya tidak diperlukan

dalam strategi yang diusulkan, pemilihan ukuran cluster maksimum yang optimal

dengan MCS-P dapat meningkatkan kinerja statistik pemindaian dalam aplikasi

tanpa gugus yang diidentifikasi.

Pengenalan

Statistik pemindaian spasial, yang diperkenalkan oleh Kulldorff

[1], berfokus pada mendeteksi keberadaan dan lokasi kelompok geografis

(3)

pengguna untuk menerapkan statistik ini di berbagai bidang. Daftar

penelitian yang menggunakan statistik pemindaian spasial diposting di

situs web resmi SaTcan [3].

Ukuran klaster spasial maksimum adalah satu-satunya parameter yang harus

dipilih oleh pengguna untuk menerapkan statistik pemindaian spasial sirkuler

yang biasa digunakan dengan perangkat lunak SaTScan. Parameter ini adalah

ukuran maksimum yang bisa dicapai oleh jendela pemindaian sebagai skala dalam

jarak spasial atau persentase dari total populasi yang berisiko [4]. Ribeiro dan

Costa [5] menyelidiki kinerja statistik pemindaian spasial dengan berbagai ukuran

klaster spasial maksimum, termasuk klaster sekunder; mereka menyarankan

bahwa tiga ukuran kinerja sensitif terhadap ukuran klaster spasial maksimum.

Meskipun dataset simulasi mendukung pemilihan ukuran klaster spasial

maksimum yang berbeda untuk model klaster tertentu, mengidentifikasi model

klaster yang berlaku untuk kumpulan data yang kompleks rumit. Oleh karena itu,

pedoman untuk memilih ukuran klaster spasial maksimum untuk data nyata tetap

tidak jelas.

Kulldorff [6] melaporkan bahwa sebuah jendela yang berukuran hingga 50% dari

populasi berisiko secara umum dapat mengurangi kelompok negatif. Peneliti lain

juga memilih nilai yang lebih rendah untuk alasan praktis, seperti ketersediaan

data [7], diskontinuitas lokasi [8], minat khusus pada kelompok kecil [9], mencari

kluster kecil dengan risiko relatif tinggi (RR) [10], infektivitas rendah dari

patogen spesifik [11, 12], analisis eksplorasi untuk gugus berbentuk tidak

beraturan [8, 13], atau sumber daya terbatas yang tersedia untuk intervensi [5].

Dengan demikian, pemilihan sederhana dari ukuran cluster spasial maksimum

mungkin tidak sesuai. Kinerja statistik pemindaian spasial harus diberi peringkat

dengan parameter yang berbeda dalam aplikasi karena hubungan bervariasi antara

ukuran klaster spasial maksimum dan kinerja dalam set data yang berbeda. Oleh

karena itu, ukuran kinerja yang umumnya berlaku untuk berbagai aplikasi harus

digunakan.

Berbagai ukuran kinerja umumnya digunakan dalam studi simulasi;

(4)

nyata [14] karena mereka didasarkan pada keberadaan kluster buatan yang

diberikan. Identifikasi semua kelompok yang terdeteksi sebagai benar atau

sebaliknya biasanya tidak praktis. Sebagai contoh, penelitian surveilans penyakit

biasanya memiliki sumber daya yang terbatas; Selain itu, beberapa ukuran kinerja

mewakili berbagai aspek kinerja [15]. Dengan demikian, hasil dari berbagai

tindakan dapat menjadi masalah ketika memerinci kinerja dari berbagai

implementasi statistik pemindaian spasial. Ukuran kinerja dapat dikombinasikan

menggunakan rumus spesifik dengan bobot yang sewenang-wenang, tetapi

pemilihan parameter secara tidak terelakkan berubah-ubah. Jika kinerja

keseluruhan, daripada aspek spesifik dari kinerja, adalah menarik, maka ukuran

kinerja keseluruhan yang tidak didasarkan pada gugus buatan yang diberikan akan

kurang sewenang-wenang.

Ukuran kinerja pada tingkat agregasi biasanya digunakan di atas kumpulan

data yang dihasilkan dengan model dasar yang serupa karena yang pertama dapat

mendeteksi sedikit perbedaan antara statistik pemindaian spasial dengan

parameter yang berbeda. Namun, kumpulan data ini tidak ada dalam kenyataan.

Meskipun dataset simulasi dapat dihasilkan dari data historis dengan

menggunakan model pengelompokan, pendekatan ini sulit terutama ketika tidak

ada data historis seperti itu [16]. Dalam hal ini, ukuran kinerja untuk satu set data

lebih disukai daripada ukuran kinerja berdasarkan kumpulan kumpulan data yang

dihasilkan dengan model yang sama.

Singkatnya, ukuran kinerja keseluruhan berdasarkan pada dataset yang diterapkan,

daripada keberadaan cluster yang diketahui, dapat digunakan untuk memilih

parameter spasial yang optimal untuk meningkatkan kinerja statistik pemindaian

spasial dalam aplikasi. Namun, sejauh pengetahuan kami, ukuran ini belum

dikembangkan.

Studi ini mengusulkan ukuran kinerja keseluruhan novel, yaitu, maksimum

kumpulan setel-proporsi (MCS-P), yang didasarkan pada fungsi kemungkinan dan

disesuaikan untuk semua kelompok signifikan dan dataset yang diterapkan.

Definisi lengkap MCS-P dan perincian tambahan disediakan di bagian

(5)

yang diketahui karena kehadiran kluster tidak diperlukan. Bagian 3 menjelaskan

studi simulasi untuk memilih ukuran cluster spasial maksimum untuk

membandingkan MCS-P dengan ukuran kinerja yang ada.

Metode

Statistik pemindaian spasial

Statistik pemindaian spasial digunakan untuk mengidentifikasi kelompok

kemungkinan maksimum dalam bentuk himpunan Z unit spasial, yang menolak

hipotesis nol di area studi G dan mempertimbangkan p dan q sebagai probabilitas

peristiwa yang terjadi di dalam dan di luar zona , masing-masing. Dalam aplikasi

saat ini, kami biasanya fokus pada mendeteksi zona di mana p> q.

Meskipun statistik pemindaian spasial bervariasi dalam hal bentuk jendela

pemindaian dan model probabilitas, kebanyakan dari mereka menggunakan

logaritma rasio kemungkinan (LLR) sebagai statistik uji untuk mengidentifikasi

kelompok kemungkinan maksimum [17-22]. Metode estimasi kemungkinan

maksimum juga diterapkan untuk menentukan sub-wilayah yang paling terkluster

Z. Cluster yang terdeteksi adalah estimator kemungkinan maksimum Z. Misalkan

C dan cz adalah jumlah kejadian yang diamati dalam G dan z, masing-masing,

sedangkan N dan nz adalah jumlah kejadian yang diharapkan dalam G dan z di

bawah hipotesis nol; maka, N = C. Misalkan L (z) adalah kemungkinan di bawah

hipotesis alternatif bahwa z adalah gugus dan L0 adalah kemungkinan di bawah

hipotesis nol; dalam hal ini, LLR adalah: (1) di mana L0 adalah konstanta untuk

G. yang diberikan. Koleksi z unit spasial dapat memaksimalkan LLR (z) dan L

(z).

Jendela pemindaian dengan bentuk yang telah ditentukan dan ukuran

spasial maksimum digunakan untuk mengidentifikasi solusi. Ukuran (η) dari

jendela bervariasi antara nol dan ukuran kelompok spasial maksimum (η (Z)) pada

setiap fokus yang mungkin dalam G untuk menghasilkan sekumpulan kelompok

(6)

kelompok yang paling mungkin (MLC). Selain MLC ini, kluster sekunder dengan

nilai kemungkinan tinggi dipertimbangkan.

Distribusi statistik uji yang tepat masih belum jelas; dengan demikian, simulasi

Monte Carlo digunakan untuk mendapatkan nilai kritis di bawah hipotesis nol.

LLR dari semua kelompok potensial dibandingkan dengan nilai kritis untuk

menentukan perbedaan signifikan mereka.

Ukuran performa

Meskipun kapasitas untuk mendeteksi keberadaan gugus telah dipelajari

secara luas [18, 22-27], kinerja atau apa yang disebut ketepatan spasial dari

kelompok yang terdeteksi juga harus dipertimbangkan [14, 16, 20]. Dalam

kebanyakan penelitian, langkah-langkah mengenai dua aspek masing-masing

kinerja digunakan berpasangan [21, 28-30], dengan satu ukuran yang

menunjukkan kapasitas untuk mengidentifikasi secara tepat unit spasial di dalam

kelompok yang benar dan ukuran lain yang memiliki kapasitas untuk

mengidentifikasi unit spasial dengan benar. di luar kluster yang benar. Selain itu,

pengukuran akuntansi untuk kedua aspek digunakan untuk mengukur kinerja

keseluruhan [5, 31]. Tiga ukuran kinerja yang umum digunakan termasuk

sensitivitas, nilai prediksi positif (PPV), dan kesalahan klasifikasi. Penelitian

sebelumnya menggunakan jumlah unit spasial untuk menghitung ukuran kinerja;

namun, penggunaan ukuran berbasis populasi dapat memberikan perkiraan yang

lebih kuat [32]. Baca et al. [14] menyatakan bahwa semua unit spasial di wilayah

studi dapat diklasifikasikan menjadi empat jenis untuk mengevaluasi kinerja

statistik pemindaian spasial:

1. Unit di dalam kelompok yang benar dan yang terdeteksi (s)

2. Unit di dalam cluster sebenarnya tetapi di luar cluster yang terdeteksi (s)

3. Unit di dalam cluster yang terdeteksi tetapi di luar cluster yang sebenarnya

4. Unit di luar cluster yang benar dan terdeteksi.

Biarkan populasi di masing-masing dari empat jenis unit spasial menjadi a,

(7)

Sensitivitas merepresentasikan proporsi populasi dalam gugus sejati yang

diidentifikasi secara benar sebagai kelompok (cluster). Ukuran ini digunakan

untuk menentukan kapasitas untuk menentukan kluster yang benar.

(3)

PPV, yang umumnya digunakan dengan sensitivitas, mewakili proporsi populasi

dalam cluster yang terdeteksi (s) yang sebenarnya milik cluster sejati (s). Ukuran

ini menunjukkan kapasitas untuk secara akurat mengidentifikasi unit spasial di

luar gugus sejati (s).

(4)

Kesalahan klasifikasi merepresentasikan proporsi populasi yang salah

diidentifikasi. Ukuran ini menyumbang populasi unit spasial dalam cluster

sebenarnya (s) tetapi di luar cluster yang terdeteksi (s), serta populasi unit spasial

yang terdeteksi di luar cluster sejati (s). Jika misklasifikasi sama dengan nol, maka

semua unit spasial diidentifikasi dengan benar.

Ukuran kinerja ini didasarkan pada keberadaan kluster yang benar dan tidak

berlaku untuk kumpulan data nyata dengan kluster yang tidak diketahui. Dalam

studi ini, kami mengusulkan ukuran kinerja keseluruhan novel dengan

menggunakan kumpulan data terapan.

Ukuran kinerja keseluruhan novel berdasarkan kumpulan data terapan

Biarkan MLC dengan η (Z) = i menjadi dan klaster signifikan jth adalah,

maka uk

uran klaster spasial maksimum η (Z) adalah parameter dari kumpulan

klaster potensial P =

∪

{z | η (z) ≤ η (Z )}. Untuk optimum lokal dengan η (Z) = i,

mungkin berbeda dari global optimum. Karena itu, mungkin lebih kecil dari.

Ketika hanya MLC ditemukan ata

u menarik, η (Z) optimal dipilih dengan

memberi peringkat LLR dari MLC yang berbeda. Optimal η (Z) = i

memaksimalkan.

Perbandingan LLR dari cluster yang sesuai, seperti MLC, mungkin tidak cukup

untuk memeringkat kinerja dengan nilai η (Z) yang berbeda keti

ka kelompok

sekunder menarik. Pertama, memasangkan beberapa klaster yang sesuai sangat

(8)

mungkin tidak konsisten. Misalnya, bisa lebih kecil dari, sedangkan bisa lebih

besar dari. Ketika semua kelompok penting menarik, klaster signifikan

mengklasifikasikan semua unit spasial menjadi dua set: pengelompokan set di

mana peristiwa cenderung mengelompok, serta set di mana peristiwa di seluruh

unit spasial tidak mungkin mengelompok. . Oleh karena itu, penyatuan semua

klaster yang signifikan, bukan kluster individu, dapat digunakan sebagai

kumpulan pengelompokan ketika memeringkatkan kinerja beberapa kluster.

Biarkan penyatuan kelompok signifikan yang ditemukan dengan η (Z) = i menjadi

Zi0, lalu (5)

Kumpulan kluster dengan nilai η (Z) yang berbeda dapat digunakan untuk

memberi peringkat kinerja beberapa kluster dengan cara yang mirip dengan MLC.

Dengan η (Z) memaksimalkan kemungkinan di bawah hipotesis alternatif,

peristiwa dalam kumpulan pengelompokan paling tidak mungkin untuk

mengelompokkan secara kebetulan. Selain itu, fungsi kemungkinan

dimaksimalkan ketika LLR dimaksimalkan. Oleh karena itu, perbandingan LLR

dari kumpulan pengelompokan dapat memberi peringkat kinerja beberapa gugus

dengan

nilai η (Z) yang berbeda.

(6)

The LLR dikondisikan pada Zi0 mewakili rasio kemungkinan pengelompokan

diatur dengan η (Z) = i dan kemungkinan di bawah hipotesis nol. LLR juga dapat

mengukur ketidaksamaan antara Zi0 sebagai kumpulan pengelompokan dan

hipotes

is nol. Ketika η (Z) = i memaksimalkan LLR (Zi0), L (Zi0) juga akan

dimaksimalkan. Yaitu, ketika η (Z) = i memaksimalkan LLR (Zi0), kejadian

dalam Zi0 lebih cenderung mengelompok daripada kumpulan pengelompokan lain

yang ditemukan dengan nilai η (Z) lainny

a.

Meskipun LLR (Zi0) dapat digunakan untuk memberi peringkat kinerja statistik

pemindaian spasial dengan nilai η (Z) yang berbeda, rentang LLR (Zi0) mungkin

berbeda karena distribusi spasial kejadian. Misalnya, unit spasial non-clustering

yang dikelilingi oleh kluster dapat dimasukkan dalam jendela pemindaian. Unit

spasial non-pengelompokan dengan RR relatif tinggi dekat kluster lebih mungkin

(9)

menyebabkan rentang bervariasi dan nilai optimal LLR (Zi0), bahkan di set data

yang dihasilkan dengan model yang sama. Selain itu, ukuran kinerja yang paling

ada dibangun dalam bentuk proporsi dan rahnge dari 0 hingga 1. Jadi,

penyesuaian efek distribusi spasial pada kumpulan data akan membuat ukuran

sebanding dengan ukuran kinerja yang ada. Maksimum perkiraan LLR dari G

karena itu digunakan. Statistik pemindaian spasial digunakan untuk mendeteksi

unit spasial pengelompokan dengan p> q; karenanya, penyatuan unit spasial

dengan RR lebih tinggi dari 1 dipilih sebagai kumpulan pengelompokan (MCS)

untuk mendapatkan perkiraan LLR maksimum di G.

(7)

Selanjutnya, kami menyesuaikan LLR (Zi0) dengan LLR (ZMCS),

sehingga ukuran kinerja MCS-P adalah: (8)

MCS-P mewakili rasio antara LLR dari kumpulan pengelompokan

dengan η (Z) =

i dan perkiraan LLR maksimum dalam G. LLR menggambarkan dukungan relatif

dari hipotesis alternatif terhadap hipotesis nol. MCS-P menyajikan kedekatan

dukungan relatif dari pengelompokan yang diatur ke dukungan maksimum yang

diperoleh dari kumpulan data. Dengan penyesuaian ini, MCS-P berkisar dari 0

hingga 1, yang serupa dengan pengukuran kinerja lainnya. Penyebut LLRMCS

adalah LLR maksimum yang diperoleh dari G. Dalam kasus ekstrim, LLR dari

kumpulan pengelompokan mungkin lebih tinggi daripada MCS. Meskipun tidak

ada kasus seperti itu ditemukan dalam penelitian ini, kita harus mencatat bahwa

MCS-P adalah ukuran kinerja relatif perkiraan.

Studi Simulasi

Data simulasi

Set data patokan simulasi berdasarkan kumpulan data nyata dari mortalitas

kanker payudara [33] digunakan dalam penelitian ini. Populasi berisiko dalam

analisis data simulasi adalah penduduk perempuan dari sensus 1990, yang berisi

29.535.210 individu di Northeastern USA. Wilayah studi terdiri dari 245

kabupaten di Northeastern USA [23].

Lima puluh skenario dibangun dengan 50 model cluster sirkular yang berbeda.

(10)

ukuran cluster yang berbeda dari 1, 2, 4, 8, atau 16 kabupaten; dan lima pola

distribusi spasial kluster yang mengandung satu klaster yang terletak di daerah

pedesaan, campuran, atau perkotaan. Dua kelompok berlokasi di pedesaan dan ur

Perbandingan rata-rata MCS-P dan ukuran kinerja lainnya untuk setiap ukuran

klaster spasial maksimum

Dalam setiap skenario, ukuran cluster spasial maksimum dekat nilai

optimal dipilih dengan MCS-P. Nilai ukuran kinerja lainnya dengan ukuran

klaster spasial maksimum yang dipilih dibandingkan dengan ukuran pengukuran

dengan ukuran klaster spasial maksimum lainnya untuk menentukan apakah

pemilihan dapat meningkatkan kinerja statistik pemindaian spasial. Nilai rata-rata

ukuran kinerja atas replika dalam skenario yang sama dilaporkan untuk setiap

ukuran klaster spasial maksimum untuk memberikan informasi rinci mengenai

hubungan antara MCS-P dan ukuran kinerja lainnya.

Umumnya, dengan ukuran klaster spasial maksimum yang dipilih dalam

kebanyakan skenario, nilai tinggi MCS-P sesuai dengan nilai sensitivitas dan PPV

yang tinggi dan nilai kesalahan klasifikasi yang rendah. Pemilihan ukuran

kelompok spasial maksimum menggunakan MCS-P, sensitivitas, PPV, dan

kesalahan klasifikasi menunjukkan bahwa statistik pemindaian spasial mencapai

hasil yang akurat untuk sebagian besar model klaster.

Perbandingan rata-rata MCS-P dan ukuran kinerja lainnya dalam berbagai

skenario menunjukkan hubungan rinci mereka. Rata-rata MCS-P berhubungan

positif dengan sensitivitas rata-rata dan PPV tetapi berhubungan negatif dengan

kesalahan klasifikasi rata-rata dalam kebanyakan skenario. Mirip dengan

perjanjian MCS-P dengan ukuran kinerja lainnya, lima skenario mengandung

gugus yang sangat heterogen, yaitu, 600-dua-1, 600-dua-2, 6000-dua-1,

600-tiga-1, dan 6000 -tiga-600-tiga-1, menunjukkan hubungan yang berbeda antara MCS-P dan

ukuran kinerja lainnya.

Hubungan antara rata-rata MCS-P dan ukuran kinerja lainnya disajikan

(11)

menunjukkan hubungan antara rata-rata MCS-P dan ukuran kinerja lainnya. Hasil

optimal dari masing-masing ukuran ditandai dengan huruf tebal, sedangkan nilai

yang berbeda kurang dari 0,01 (1%) dari hasil optimal digarisbawahi. Untuk nilai

yang digarisbawahi dari MCS-P, sensitivitas dan kesalahan klasifikasi sebagian

besar tumpang tindih, yang menyiratkan bahwa statistik pemindaian spasial

dengan ukuran klaster spasial maksimum yang dipilih oleh MCS-P mencapai nilai

yang mendekati hasil optimal dari pengukuran ini. Oleh karena itu, ukuran klaster

spasial maksimum dapat dipilih dengan MCS-P untuk mendapatkan hasil yang

akurat untuk ukuran kinerja lainnya. Oleh karena itu, pemilihan ukuran kelompok

spasial maksimum dengan MCS-P dapat meningkatkan kinerja statistik

pemindaian spasial. Hubungan rinci antara MCS-P dan ukuran kinerja lainnya

disajikan pada Gambar 1. Hasil pengukuran kinerja lainnya menjadi lebih dekat

dengan hasil optimal dengan meningkatkan MCS-P. Hubungan serupa antara

MCS-P dan ukuran kinerja lainnya dapat ditemukan dalam 45 skenario yang

tersisa.

[Gambar dihilangkan. Lihat PDF.]

Tabel 3. Pengukuran kinerja rata-rata untuk ukuran cluster spasial maksimum

yang berbeda di 6000-tiga-8.

https://doi.org/10.1371/journal.pone.0147918.t003

[Gambar dihilangkan. Lihat PDF.]

Gambar 1. Rata-rata MCS-P dan ukuran lain di 6000-tiga-8.

https://doi.org/10.1371/journal.pone.0147918.g001

Kelima skenario menunjukkan hubungan tidak teratur, mirip dengan perjanjian

MCS-P dan ukuran kinerja lainnya. Mengingat pengkondisiannya pada

kemungkinan semua kelompok signifikan sebagai kumpulan pengelompokan

homogen, MCS-P tidak dapat secara tepat mengukur kinerja beberapa contoh dari

gugus yang sangat heterogen. Tren ini khususnya khas di 600-dua-1, seperti yang

ditunjukkan pada Tabel 4. Satu cluster di daerah pedesaan menyajikan nilai RR

tinggi 192,89 dan populasi yang sangat kecil dari 2675, sedangkan cluster lain di

daerah perkotaan memiliki populasi besar 786178 tetapi nilai RR rendah 2,73.

(12)

nilai MCS-P yang lebih tinggi untuk ukuran cluster spasial maksimum 1% dan 2%

dari populasi berisiko (Tabel 5). Ketika bagian atau seluruh klaster terakhir

disertakan dengan ukuran klaster spasial maksimum yang besar lebih dari 3%,

MCS-P menurun tajam.

[Gambar dihilangkan. Lihat PDF.]

Tabel 4. Pengukuran kinerja rata-rata untuk ukuran klaster spasial maksimum

yang berbeda dalam 600-dua-1.

https://doi.org/10.1371/journal.pone.0147918.t004

[Gambar dihilangkan. Lihat PDF.]

Tabel 5. Pengukuran kinerja rata-rata untuk ukuran klaster spasial maksimum

yang berbeda dalam 6000-dua-16.

https://doi.org/10.1371/journal.pone.0147918.t005

Keterbatasan ini menghilang dengan meningkatnya ukuran cluster karena

berkurangnya heterogenitas klaster. Jika kelompok yang sangat kecil dengan

heterogenitas besar dilaporkan, maka MCS-P tidak dapat digunakan sebagai

ukuran kinerja yang tepat.

Menariknya, statistik menunjukkan rincian hubungan antara MCS-P dan ukuran

kinerja lainnya. Hasil 6000-dua-16 menggambarkan dengan jelas rincian fitur ini

pada Tabel 5 dan Gambar 2. Hubungan antara MCS-P dan tiga ukuran dapat

dibagi menjadi dua tahap. Titik cut-off adalah nilai penutupan pertama

Data Insiden Campak di Henan, Cina

Kami menerapkan MCS-P untuk data kasus campak di tingkat kabupaten

di provinsi Henan, China pada Mei 2009; data diekstraksi dari sistem pelaporan

penyakit CDC Cina. Sebanyak 1.371 kasus campak di antara populasi 91.669.661

dilaporkan, dan tingkat kejadian tahunan adalah 17,6 per 100.000. Data dianalisis

menggunakan 50 ukuran klaster spasial maksimum setelah studi simulasi. MCS-P

digunakan untuk evaluasi, dan hasil yang menunjukkan nilai MCS-P maksimum

dipilih dan dibandingkan dengan hasil yang diperoleh menggunakan ukuran

(13)

Maksimum MCS-P tercapai ketika ukuran cluster spasial maksimum ditetapkan

menjadi 2% dari total populasi. Sebanyak 649 kasus dari 14.369.140 individu di

21 kabupaten terdeteksi menggunakan ukuran klaster spasial maksimum 2% (Z2),

sedangkan 886 kasus dari 28,859,679 di 41 kabupaten terdeteksi menggunakan

ukuran klaster spasial maksimum 50% (Z50). Risiko relatif Z2 dan Z50

masing-masing 3.0200 dan 2.0527, dan keduanya terletak di bagian yang sama dari

wilayah studi. Variasi dalam tepi ditemukan dalam kelompok yang terletak di

barat daya dan tenggara (Gambar 3). Untuk memberikan detail tambahan, kami

membandingkan kabupaten di Z2 dan Z50. Dua puluh dari 21 kabupaten di Z2

juga ditemukan di Z50, dan 22 kabupaten tersisa yang berbeda antara Z2 dan Z50

ditunjukkan pada Tabel 7.

[Gambar dihilangkan. Lihat PDF.]

Gambar 3. Insidensi campak di Henan pada Mei 2009 dan kluster terdeteksi

dengan default (a) dan ukuran klaster spasial maksimum yang dipilih

menggunakan MCS-P (b).

Kode administratif dari 42 kabupaten pengelompokan diberi label. RR

masing-masing kabupaten disajikan sebagai bar.

https://doi.org/10.1371/journal.pone.0147918.g003

[Gambar dihilangkan. Lihat PDF.]

Tabel 7. Kabupaten yang berbeda dalam kelompok yang terdeteksi menggunakan

ukuran klaster spasial maksimum 2% dan 50%.

https://doi.org/10.1371/journal.pone.0147918.t007

Sebanyak 12 dari 22 kabupaten menunjukkan tingkat insiden yang lebih

rendah daripada nilai rata-rata. Berdasarkan tujuan dari statistik pemindaian,

kabupaten-kabupaten ini mungkin salah diidentifikasi sebagai daerah

pengelompokan ketika mencari unit spasial berisiko tinggi secara signifikan.

Seperti ditunjukkan pada Gambar 3, kabupaten-kabupaten ini, yang terletak di tepi

kelompok 1 dan 2 di Z50, dekat dengan kabupaten berisiko tinggi. Oleh karena

itu, daerah-daerah ini disertakan dengan dekat beberapa daerah berisiko tinggi di

Z50. Misalnya, meskipun tidak ada kasus di 411224 dan 411330, negara-negara

(14)

pemilihan ukuran kelompok spasial maksimum mengecualikan kabupaten dengan

tingkat insiden rendah.

Pengelompokan daerah yang tidak signifikan dengan RR lebih tinggi dari 1 dalam

Z50 juga dikecualikan dalam Z2. Kabupaten-kabupaten ini dikelilingi oleh

kabupaten-kabupaten berisiko rendah dan tidak signifikan sebagai satu kelompok

potensial. Secara khusus, 411723 menunjukkan RR yang lebih tinggi (1,8574)

dibandingkan dengan daerah sekitarnya dan berfungsi sebagai gugus potensial.

Hasilnya menunjukkan bahwa RR tinggi kemungkinan terjadi secara kebetulan.

Kabupaten-kabupaten ini dikeluarkan dalam Z2 ketika MCS-P digunakan dalam

pemilihan ukuran klaster spasial maksimum.

Meskipun county ini tidak signifikan di Z50, 411623 dianggap sebagai

satu-satunya clustering county di Z2 karena nilai kritis LLR terkait dengan ukuran

cluster spasial maksimum. Pada tingkat signifikansi 0,05, nilai kritis untuk ukuran

klaster spasial maksimum 2% dan 50% adalah 6,0221 dan 7,5736,

masing-masing. Oleh karena itu, 411623 (LLR = 6.0830) signifikan dalam Z2 tetapi tidak

dalam Z50. Ketika menggunakan MCS-P untuk pemilihan ukuran cluster spasial

maksimum, statistik pemindaian lebih sensitif terhadap kelompok kecil, yang

ntest statistiknya mendekati nilai kritis.

Selain itu, 411426 adalah satu-satunya daerah berisiko tinggi yang tidak ada yang

berdekatan dengan kelompok signifikan di Z2. Tiga kabupaten yang berdekatan di

wilayah timur, seperti 411426, 411425, dan 411402, diuji untuk menjadi

kelompok signifikan di Z50. Di Z2, jendela pemindaian pada ukuran klaster

spasial maksimum 2% terlalu kecil untuk mencakup tiga daerah berisiko tinggi di

wilayah timur. Menjadi salah satu kabupaten yang menunjukkan RR terendah dan

paling mungkin mengelompok dari tiga kabupaten, 411426 diuji sebagai tidak

signifikan.

Singkatnya, penggunaan MCS-P dalam pemilihan ukuran kelompok

spasial maksimum mengecualikan kabupaten dengan tingkat insiden rendah dan

tingkat tinggi yang tidak signifikan; kabupaten-kabupaten ini salah dimasukkan

dalam kelompok besar ketika menggunakan parameter default. Selain itu,

(15)

kelompok-kelompok kecil yang menunjukkan tingkat insiden yang relatif tinggi,

yang dapat terlewatkan ketika menggunakan nilai statistik uji kritis yang besar

dengan ukuran cluster spasial maksimum standar. Meskipun ukuran klaster spasial

maksimum yang lebih kecil mungkin mengecualikan bagian dari daerah

pengelompokan, fenomena ini hanya terjadi pada bagian cluster yang paling tidak

mungkin mengelompok.

Diskusi dan kesimpulan

Statistik pemindaian spasial banyak digunakan di berbagai bidang untuk

mengidentifikasi kejadian pengelompokan yang tidak biasa di seluruh wilayah

studi. Ukuran klaster spasial maksimum adalah satu-satunya parameter statistik

pemindaian melingkar yang mempengaruhi kinerjanya. Konsisten dengan studi

sebelumnya, penelitian simulasi ini menunjukkan bahwa ukuran klaster spasial

maksimum yang optimal bervariasi dalam skenario yang berbeda [5]. Dengan

demikian, pemilihan ukuran kelompok spasial maksimum yang tepat untuk setiap

set data dapat meningkatkan kinerja statistik karena model klaster dari kumpulan

data praktis biasanya tidak diketahui. Namun, ukuran kinerja yang ada tidak dapat

diterapkan di sebagian besar aplikasi praktis tanpa cluster yang dikenal.

Keterbatasan ini diatasi menggunakan ukuran kinerja MCS-P yang diusulkan

(Bagian 2.3). MCS-P didasarkan pada kemungkinan kluster yang dilaporkan dan

kemungkinan maksimum perkiraan dari kumpulan data yang digunakan; Oleh

karena itu, ukuran ini dapat dihitung tanpa menggunakan keberadaan kluster yang

diketahui. Studi simulasi juga menunjukkan bahwa hasil MCS-P mirip dengan

pengukuran kinerja lainnya, yaitu, sensitivitas, PPV, dan kesalahan klasifikasi,

dalam banyak situasi. Meskipun MCS-P tidak berlaku untuk kondisi dengan

banyak kelompok yang sangat heterogen, statistik ini dapat digunakan di sebagian

besar bidang di mana kelompok kepentingan menunjukkan karakteristik yang

serupa. Misalnya, dalam studi epidemiologi, wabah penyakit yang sama biasanya

(16)

pada risiko. Selain itu, menyesuaikan MCS-P untuk beberapa kelompok yang

sangat heterogen bisa menjadi arah untuk pekerjaan masa depan kita.

Hasil studi simulasi dikondisikan ke set data. Meskipun model klaster bervariasi

dalam hal jumlah klaster, lokasi, tingkat RR, dan ukuran klaster, penelitian ini

menyajikan beberapa keterbatasan. Sebagai contoh, dalam data benchmark,

cluster yang dihasilkan sangat jauh dari satu sama lain sedemikian rupa sehingga

tidak ada cluster yang terdeteksi yang dapat mencakup bagian-bagian dari cluster

yang berbeda bahkan ketika ukuran cluster spasial maksimum diatur ke 50%

populasi yang berisiko. Fenomena ini menjelaskan mengapa hasil tidak berbeda

dalam kasus dengan ukuran cluster spasial maksimum yang besar. Namun,

kelompok-kelompok itu mungkin terletak berdekatan satu sama lain dalam praktik

yang sebenarnya seperti yang ditunjukkan dalam data kasus campak. Cluster yang

terdeteksi mungkin termasuk area berisiko, yang tidak berdekatan, dan salah

termasuk area non-clustering di sekitarnya. Untuk mengatasi masalah ini, peneliti

harus memilih parameter dengan MCS-P dan mengubah bentuk cluster, yang akan

diselidiki dalam pekerjaan kita di masa depan.

Dua aspek kinerja dianggap untuk mengevaluasi statistik pemindaian

spasial. Namun, langkah-langkah untuk mendeteksi daerah-daerah di dalam

kelompok dapat memberikan evaluasi terbalik terhadap langkah-langkah yang

mewakili kapasitas untuk mengidentifikasi area di luar gugus. Hasil ini umum

dalam studi simulasi. Meskipun tindakan dikondisikan ke set data, pengukuran

kinerja akuntansi untuk satu aspek spesifik dari kinerja, seperti sensitivitas dan

PPV, cenderung memilih ukuran klaster spasial terbesar atau terkecil. Jendela

pemindaian besar yang cukup mencakup lebih banyak bagian dari wilayah studi

lebih mungkin untuk memuat kluster yang benar, sedangkan jendela pemindaian

kecil cenderung tidak memuat area di luar kluster yang benar. Oleh karena itu,

ketika berfokus pada sensitivitas, ukuran cluster spasial maksimum yang besar

harus dipilih. Sebaliknya, ukuran klaster spasial maksimum yang lebih kecil harus

dipilih ketika mempertimbangkan kapasitas untuk mengidentifikasi area di luar

klaster. Selain itu, ukuran cluster spasial maksimum yang besar harus dipilih

(17)

keseluruhan lebih menarik untuk sebagian besar kasus; dengan demikian, ukuran

klaster spasial maksimum yang disukai bervariasi antara model klaster yang

berbeda, dan statistik pemindaian spasial dengan ukuran klaster spasial

maksimum yang berbeda harus diterapkan. Ukuran kinerja keseluruhan, seperti

misklasifikasi dan MCS-P, juga harus digunakan untuk memilih ukuran kelompok

spasial maksimum untuk meningkatkan kinerja. Untuk kumpulan data tanpa

kluster yang benar, MCS-P mungkin satu-satunya ukuran kinerja yang dapat

digunakan. Sebelum menggunakan MCS-P, kluster yang dilaporkan masih perlu

diperiksa. Seperti yang ditunjukkan dalam studi simulasi, MCS-P mungkin tidak

berfungsi sebagai ukuran kinerja keseluruhan yang tepat dalam gugus yang sangat

heterogen yang dilaporkan. Meskipun pemilihan ukuran cluster spasial maksimum

dengan MCS-P dapat meningkatkan kinerja statistik, pendekatan ini akan

mengkonsumsi lebih banyak waktu daripada hanya menggunakan pengaturan

default. Misalnya, MCS-P harus dihitung untuk semua 50 ukuran klaster spasial

maksimum untuk memilih ukuran klaster spasial maksimum yang optimal.

Langkah ini akan mengkonsumsi sebanyak 50 kali waktu komputasi asli. Waktu

perhitungan menjadi lebih lama karena kumpulan data yang diterapkan menjadi

lebih rumit. Oleh karena itu, pemilihan jumlah yang tepat dari ukuran cluster

spasial maksimum yang potensial.

Sumber kutipan

Ma, Y., Yin, F., Zhang, T., Zhou, X. A., & Li, X. (2016). Selection of the

maximum spatial cluster size of the spatial scan statistic by using the maximum

(18)

Naskah Asli

Selection of the Maximum Spatial Cluster Size

of the Spatial Scan Statistic by Using the

Maximum Clustering Set-Proportion Statistic

Ma, Yue; Yin, Fei; Zhang, Tao; Zhou, Xiaohua Andrew; Li, Xiaosong.PLoS One; San

Francisco Vol. 11, Iss. 1, (Jan 2016):

e0147918. DOI:10.1371/journal.pone.0147918

Hide highlighting

Abstrac

Spatial scan statistics are widely used in various fields. The performance of

these statistics is influenced by parameters, such as maximum spatial cluster size,

and can be improved by parameter selection using performance measures. Current

performance measures are based on the presence of clusters and are thus

inapplicable to data sets without known clusters. In this work, we propose a novel

overall performance measure called maximum clustering set–proportion (MCS-P),

which is based on the likelihood of the union of detected clusters and the applied

dataset. MCS-P was compared with existing performance measures in a simulation

study to select the maximum spatial cluster size. Results of other performance

measures, such as sensitivity and misclassification, suggest that the spatial

scan statistic achieves accurate results in most scenarios with the maximum spatial

cluster sizes selected using MCS-P. Given that previously known clusters are not

required in the proposed strategy, selection of the optimal maximum cluster size with

MCS-P can improve the performance of the scan statistic in applications without

identified clusters.

Introduction

Spatial scan statistic, which was introduced by Kulldorff[1], focuses on detecting the

presence and locations of geographic clusters within spatial datasets. The free

software SaTScan[2] allows users to apply this statistic in different fields. A list of

studies that utilized spatial scan statistic is posted in the SaTScan official website[3].

The maximum spatial cluster size is the only parameter that must be selected by

(19)

This parameter is the maximum size that the scanning window can reach as scaled in

terms of spatial distance or percentage of the total population at risk[4]. Ribeiro and

Costa[5] investigated the performance of spatial scan statistics with different

maximum spatial cluster sizes, including secondary clusters; they suggested that

three performance measures are sensitive to the maximum spatial cluster size.

Although simulation datasets support the selection of different maximum spatial

cluster sizes for a specific cluster model, identifying a cluster model applicable for

complex real datasets is complicated. Therefore, the guidelines for selecting the

maximum spatial cluster sizes for real data remain unclear.

Kulldorff[6] reported that a window sized up to 50% of the population at risk can

generally reduce negative clusters. Other researchers also selected lower values for

practical reasons, such as data availability[7], location discontinuity[8], specific

interest on small clusters[9], search for small clusters with high relative risk (RR)[10],

low infectivity of a specific pathogen[11, 12], exploratory analysis for irregular-shaped

clusters[8, 13], or limited available resources for intervention[5]. As such, simple

selection of the maximum spatial cluster size may not be appropriate. The

performance of the spatial scan statistic must be ranked with different parameters in

an application because of varied relationship between the maximum spatial cluster

size and the performance in different data sets. Therefore, a performance measure

that is generally applicable for various applications must be used.

Numerous performance measures are commonly used in simulation studies;

however, few of these measures can be easily applied in real data [14] because they

are based on the presence of given artificial clusters. Identification of all the detected

clusters as true or otherwise is usually impractical. For example, disease surveillance

studies usually have limited available resources; moreover, several performance

measures represent different aspects of performance[15]. As such, outcomes from

multiple measures can be problematic when ranking the performance of different

implementations of spatial scan statistics. Performance measures can be combined

using specific formulas with arbitrary weights, but parameter selection is inevitably

arbitrary. If the overall performance, rather than a specific aspect of performance, is

of interest, then the overall performance measure that is not based on the given

(20)

Performance measures at the aggregation level are commonly used over data sets

generated with a similar underlying model because the former can detect slight

differences among spatial scan statistics with different parameters. However, these

datasets do not exist in reality. Although simulation datasets can be generated from

historical data by using clustering models, this approach is difficult especially when

no such historical data exist[16]. In this regard, a performance measure for a single

data set is preferred than a performance measure based on a batch of data sets

generated with the same model.

In summary, an overall performance measure based on applied dataset, rather than

the known presence of true clusters, can be used to select the optimal spatial

parameters for improving the performance of spatial scan statistics in applications.

However, to the best of our knowledge, this measure has not been developed yet.

This study proposes a novel overall performance measure, namely, maximum

clustering set–proportion (MCS-P), which is based on the likelihood function and is

customized for all significant clusters and applied dataset. A full definition of MCS-P

and additional details are provided in the next section. This new performance

measure is applicable to data sets without known clusters because the presence of

clusters is unnecessary. Section 3 describes the simulation study for selecting the

maximum spatial cluster size to compare MCS-P with existing performance

measures. Section 4 presents the application of MCS-P in case data of measles in

Henan, China, and Section 5 provides the discussion and conclusions.

Methods

Spatial scan statistic

Spatial scan statistics are used to identify the maximum likelihood clusters in the form

of a set Z of spatial units, which reject the null hypothesis in the study area G and

consider p and q as the probability of an event that occurs inside and outside a zone,

respectively. In current applications, we usually focus on detecting zones where p>q.

Although spatial scan statistics vary in terms of the shape of scanning window and

the probability model, most of them employ the logarithm of the likelihood ratio (LLR)

as the test statistic to identify maximum likelihood clusters[17–22]. A maximum

(21)

sub-region Z. The detected cluster is the maximum likelihood estimator of Z.

Let C and cz be the observed number of events in G and z, respectively,

whereas N and nz are the expected number of events in G and z under the null

hypothesis; hence, N = C. Let L(z) be the likelihood under the alternative hypothesis

that z is a cluster and L0 be the likelihood under the null hypothesis; in this case, LLR

is:(1)where L0 is a constant for a given G. The collection z of spatial units can

maximize LLR(z) and L(z).

A scanning window with a pre-defined shape and maximum spatial size is employed

to identify the solution . The size (η) of the window varies between zero and the

maximum spatial cluster size (η(Z)) on each possible focus in G to generate a set of

potential clusters: P = ∪{z|η(z) ≤η(Z)}. The potential cluster in P that maximizes the

likelihood is the estimator of Z and is also called the most likely cluster (MLC). In

addition to this MLC, secondary clusters with high likelihood values are considered.

The precise distribution of the test statistic remains unclear; thus, a Monte Carlo

simulation is employed to obtain the critical value under the null hypothesis. The

LLRs of all potential clusters are compared with the critical value to determine their

significant differences.

Performance measures

Although the capacity to detect the presence of clusters has been widely studied[18,

22–27], the performance or the so-called spatial accuracy of the detected clusters

should also be considered[14, 16, 20]. In most studies, measures concerning two

respective aspects of performance are used in pairs[21, 28–30], with one measure

exhibiting the capacity to correctly identify spatial units inside the true clusters and

the other measure possessing the capacity to correctly identify spatial units outside

the true clusters. Moreover, measures accounting for both aspects are used to

measure the overall performance[5, 31]. The three commonly used performance

measures include sensitivity, positive predictive value (PPV), and misclassification.

Previous studies used the number of spatial units to calculate performance

measures; however, the use of a population-based measure can provide more robust

estimates[32]. Read et al.[14] stated that all spatial units in a study region can be

classified into four types to evaluate the performance of the spatial scan statistic:

(22)

2. Units inside the true cluster(s) but outside the detected cluster(s)

3. Units inside the detected cluster(s) but outside the true cluster(s)

4. Units outside both true and detected cluster(s).

Let the population in each of the four types of spatial units be a, b, c, and d. The

three common performance measures are described as follows:(2)

Sensitivity represents the proportion of the population in the true cluster(s) that is

correctly identified as cluster(s). This measure is used to determine the capacity to

determine true cluster(s).

(3)

PPV, which is commonly used with sensitivity, represents the proportion of the

population in the detected cluster(s) which actually belongs to the true cluster(s). This

measure indicates the capacity to accurately identify spatial units outside the true

cluster(s).

(4)

Misclassification represents the proportion of mistakenly identified populations. This

measure accounts for the population of spatial units within the true cluster(s) but

outside the detected cluster(s), as well as the population of detected spatial units

outside the true cluster(s). If the misclassification is equal to zero, then all spatial

units are correctly identified.

These performance measures are based on the given presence of true clusters and

are not applicable for real data sets with unknown clusters. In this study, we propose

a novel overall performance measure by using the applied data set.

Novel overall performance measure based on applied data sets

Let MLC with η(Z) = i be and the jth significant cluster be , then the maximum spatial

cluster size η(Z) is a parameter of the collection of potential clusters P = ∪{z|η(z)

≤η(Z)}. For a local optimum with η(Z) = i, may differ from the global optimum .

(23)

optimal η(Z) is selected by ranking the LLR of different MLCs. The optimal η(Z)

= i maximizes .

Comparison of the LLR of the corresponding clusters, such as MLC, may be

insufficient for ranking the performance with different η(Z) values when secondary

clusters are of interest. First, pairing of the corresponding multiple clusters is very

complicated. Second, multiple outcomes from comparisons of different paired

clusters may not be consistent. For instance, can be smaller than , whereas can be

larger than . When all the significant clusters are of interest, the significant clusters

classify all spatial units into two sets: clustering set in which events are likely to

cluster, as well as the set in which events in the rest of the spatial units are not likely

to cluster. Therefore, the union of all significant clusters, instead of individual clusters,

can be used as the clustering set when ranking the performance of multiple clusters.

Let the union of significant clusters found with η(Z) = i be Zi0, then(5)

Clustering sets with different η(Z) values can be used to rank the performance of

multiple clusters in a manner similar to that of MLC. With η(Z) maximizing the

likelihood under the alternative hypothesis, events in the clustering set are least likely

to cluster by chance. Moreover, the likelihood function is maximized when LLR is

maximized. Therefore, comparison of the LLR of the clustering set can rank the

performance of multiple clusters with different η(Z) values.

(6)

The LLR conditioned on Zi0 represents the ratio of the likelihood of the clustering set

with η(Z) = i and the likelihood under the null hypothesis. LLR can also measure the

dissimilarity between Zi0 as the clustering set and the null hypothesis. When η(Z)

= i maximizes the LLR(Zi0), L(Zi0) will also be maximized. That is, when η(Z)

= i maximizes LLR(Zi0), the events in Zi0 are more likely to cluster than any other

clustering sets found with other η(Z) values.

Although LLR(Zi0) can be used to rank the performance of the spatial

scan statistic with different η(Z) values, the range of LLR(Zi0) may differ because of

the spatial distribution of events. For instance, a non-clustering spatial unit

surrounded by clusters may be included in the scanning window. The non-clustering

spatial units with relatively high RR near a cluster are more likely to be included in

(24)

values of LLR(Zi0), even in data sets generated with the same model. In addition,

most existing performance measures are built in the form of proportions and rahnge

from 0 to 1. Thus, the adjustment of the effect of spatial distribution on the data set

would render the measure comparable with existing performance measures. An

approximate maximum of LLR from G is therefore used. The spatial scan statistic is

employed to detect clustering spatial units with p>q; hence, the union of spatial units

with RR higher than 1 is selected as the most clustering set (MCS) to obtain the

approximate maximum LLR in G.

(7)

Subsequently, we adjust LLR(Zi0) with LLR(ZMCS), such that the performance measure

MCS-P is:(8)

MCS-P represents the ratio between the LLR of the clustering set with η(Z) = i and

the approximate maximum LLR in G. LLR describes the relative support of the

alternative hypothesis against the null hypothesis. MCS-P presents the closeness of

the relative support of the clustering set to the maximum support obtained from the

dataset. With this adjustment, MCS-P ranges from 0 to 1, which is similar to that of

other performance measures. The denominator LLRMCS is the approximate maximum

LLR obtained from G. In extreme cases, the LLR of the clustering set may be higher

than that of MCS. Although no such case was found in the present study, we should

note that MCS-P is an approximate relative performance measure.

Simulation Study

Simulation data

Simulated benchmark data sets based on a real data set of breast cancer mortality

[33] were used in this study. The population at risk in the simulated data analysis is

the female population from the 1990 census, which contains 29,535,210 individuals

in Northeastern USA. The study region consists of 245 counties in Northeastern

USA[23].

Fifty scenarios were built with 50 different circular cluster models. The models

contain two different total simulated case numbers of 600 or 6000; five different

(25)

containing one cluster located in rural, mixed, or urban area. Two clusters were

located in rural and urban areas, whereas three clusters were located in all the three

areas. These benchmark datasets are available at the SatScan website[34] and

commonly used to evaluate different clustering tests[23] or spatial scan statistics with

scanning windows of different shapes and parameters[24, 32, 35]. Details of the

cluster models are given in Table 1.

[Figure omitted. See PDF.]

Table 1. Simulated cluster models.

https://doi.org/10.1371/journal.pone.0147918.t001

In the latter part of the paper, scenarios are mentioned in the form of “total case

numbers-cluster location-cluster size.” For instance, 600-two-1 refers to the data sets

in the scenario with 600 total simulated cases and two clusters located in urban and

rural areas, with each cluster covering only one county.

Spatial scan parameters

Each of the 50 different maximum spatial cluster sizes were set to increase from 1%

to 50% by increments of 1% for the total population at risk for each data set. Only

clusters with P-values less than 0.05 were considered significant. In cases when no

significant clusters were detected, the detected population was set to zero. Based on

previous study, the inclusion of secondary clusters that overlap with more likely

clusters does not improve the performance[5]. As the default reporting criteria for

secondary clusters, only the secondary clusters unrelated to any likely clusters were

reported. With each maximum spatial cluster size, the performance of the circular

scan statistic was evaluated by MCS-P with three existing performance measures,

such as sensitivity, PPV, and misclassification.

Agreements of MCS-P with other performance measures in different scenarios

To validate MCS-P for different cluster models, we defined the values of each

performance measure that differs by less than 0.01 (1%) for each data set as the

values close to the optimal result[5]. The 50 different maximum spatial cluster sizes

(26)

1. Results of MCS-P and the other performance measure are close to the optimal

result

2. Only the result of the other performance measures is close to the optimal result

3. Only the result of MCS-P is close to the optimal result

4. None of the results of MCS-P and the other performance measure are close to the

optimal result.

For types 1 and 4, MCS-P provided results similar to those of other performance

measures. The agreement of MCS-P with each existing performance measure was

reported for each cluster model. Agreement represents the similarity of MCS-P and

other performance measures for identifying whether a maximum spatial cluster size is

close to the optimal value. An agreement of 100% indicates that all 50 maximum

spatial cluster sizes are accurately identified and similar between that derived from

MCS-P and from another existing performance measure.

The average agreement values of MCS-P with sensitivity, PPV, and misclassification

are 86.6257%, 66.2698%, and 81.3829%, respectively. Most MCS-P results are

similar to those of sensitivity and misclassification. Although relatively low at more

than 66%, the results obtained using MCS-P are similar to those of PPV. Generally,

when the values of MCS-P are close to the optimal results, other performance

measures also achieve values close to the optimal results. Moreover, MCS-P works

better with sensitivity than that with misclassification and PPV. Despite the arbitrary

cut-off points for determining whether the values of the performance measures are

close to their optimal results, the results show the high agreement of MCS-P and the

other performance measures.

The agreements of MCS-P in different scenarios are shown in Table 2. For 45

scenarios, MCS-P exhibits high agreement values with other performance measures

similar to the average agreement. For five scenarios including 600-two-1, 600-two-2,

6000-two-1, 600-three-1, and 6000-three-1, MCS-P exhibits low agreements with the

other performance measures.

(27)

Table 2. Agreements of MCS-P with the other performance measures in different

scenarios.

For measuring the capacity to accurately detect true clusters, the results of MCS-P

are generally similar to sensitivity in all scenarios, except for the abovementioned five

cases. For measuring the capacity for correct identification of spatial units outside the

clusters, the agreement of MCS-P with PPV varies in different scenarios. In

600-rural-1, which generated the highest RR and the smallest population, MCS-P exhibits

high agreement with PPV. For large clusters with low RR, the agreement of MCS-P

with PPV decreases. As an overall performance measure, MCS-P manifests similar

results to those of misclassification for most scenarios. That is, with PPV and

misclassification, MCS-P is highly accurate in scenarios containing clusters with high

RR and small populations. Moreover, MCS-P with sensitivity always exhibits high

agreement in most scenarios.

The result of MCS-P with the other performance measures is less accurate in the five

scenarios than that in the other scenarios; as such, multiple clusters are generated

by different cluster models. One cluster is constructed with very high RR and a small

population in a rural area, whereas other cluster(s) possess low RR with a large

population in urban areas (and mixed areas), as shown in Table 1. Hence, the

clusters are highly heterogeneous. Based on the likelihood of all clustering zones,

MCS-P treats clustering zones from different clusters as a homogeneous clustering

set. Therefore, in cases with multiple instances of highly heterogeneous clusters,

larger values of MCS-P are more likely to be achieved when only the cluster with high

RR and a small population is included compared with that when all the clusters are

included. The clusters become less heterogeneous with increasing cluster sizes;

therefore, the agreements of MCS-P with the other performance measures increase.

Additional details are provided in the comparison of the average MCS-P and other

measures for different cluster models.

Comparison of average MCS-P and the other performance measures for each maximum spatial cluster size

In each scenario, the maximum spatial cluster sizes near the optimal value were

(28)

maximum spatial cluster sizes were compared with those of measures with other

maximum spatial cluster sizes to determine whether the selection can improve the

performance of the spatial scan statistic. The mean values of the performance

measures over replicas in the same scenarios were reported for each maximum

spatial cluster size to provide detailed information regarding the relationship between

MCS-P and the other performance measures.

Generally, with the selected maximum spatial cluster sizes in most scenarios, high

values of MCS-P correspond to high values of sensitivity and PPV and low values of

misclassification. Selection of the maximum spatial cluster sizes using MCS-P,

sensitivity, PPV, and misclassification suggests that the spatial

scan statistics achieve accurate results for most cluster models.

The comparison of averaged MCS-P and other performance measures in different

scenarios shows their detailed relationships. The average MCS-P is positively related

to average sensitivity and PPV but negatively associated with average

misclassification in most scenarios. Similar to the agreements of MCS-P with the

other performance measures, the five scenarios containing highly heterogeneous

clusters, namely, 600-two-1, 600-two-2, 6000-two-1, 600-three-1, and 6000-three-1,

exhibit different relationships between MCS-P and the other performance measures.

The relationship between average MCS-P and the other performance measures are

presented for several typical scenarios. The summary of 6000-three-8 (Table 3)

shows the relationship between the average MCS-P and the other performance

measures. The optimal results of each measure are marked in boldface, whereas

values that differ by less than 0.01 (1%) from the optimal results are underlined. For

the underlined values of MCS-P, the sensitivity and misclassification mostly overlap,

which implies that the spatial scan statistic with the maximum spatial cluster sizes

selected by MCS-P achieve values close to the optimal results of these measures.

Therefore, maximum spatial cluster sizes can be selected with MCS-P to obtain

accurate results for other performance measures. Hence, selection of the maximum

spatial cluster sizes with MCS-P can improve the performance of spatial

scan statistics. Detailed relationships between MCS-P and other performance

measures are presented in Fig 1. The results of the other performance measures

(29)

between MCS-P and other performance measures can be found in the remaining 45

scenarios.

Table 3. Average performance measures for different maximum spatial cluster sizes

in 6000-three-8.

Fig 1. Average MCS-P and other measures in 6000-three-8.

https://doi.org/10.1371/journal.pone.0147918.g001

The five scenarios show the irregular relationship, similar to the agreements of

MCS-P and the other performance measures. Given its conditioning on the likelihood of all

significant clusters as a homogeneous clustering set, MCS-P cannot correctly

measure the performance of multiple instances of highly heterogeneous clusters.

This trend is particularly typical in 600-two-1, as shown in Table 4. One cluster in the

rural area presents a high RR value of 192.89 and a very small population of 2675,

whereas another cluster in the urban area has a large population of 786178 but a low

RR value of 2.73. In the 600-two-1 scenario, the exclusive inclusion of the former

cluster provides higher MCS-P values for the maximum spatial cluster sizes of 1%

and 2% of the population at risk (Table 5). When parts or the entire latter cluster is

included with a large maximum spatial cluster size of over 3%, MCS-P sharply

decreases.

in 600-two-1.

(30)

in 6000-two-16.

This limitation disappears with increasing cluster sizes because of reduced

heterogeneity of the clusters. If very small clusters with great heterogeneity are

reported, then MCS-P may not be used as an appropriate performance measure.

Interestingly, the statistic shows details of the relationships between MCS-P and the

other performance measures. Results of 6000-two-16 describe clearly the details of

this feature in Table 5 and Fig 2. The relationships between MCS-P and the three

measures can be divided into two stages. The cut-off point is the very first value

close to the optimal results of MCS-P, where MCS-P = 0.2701. During the first stage

before MCS-P reaches the values close to the optimal results, sensitivity, PPV, and

misclassification indicate that improved performance is achieved with increasing

MCS-P. Hence, MCS-P works well with the three other performance measures. After

MCS-P reaches the values close to the optimal results, other performance measures

begin to fluctuate slightly around the optimal results. During this stage, MCS-P

increases very slightly (no more than 0.01) when areas with relatively low RR and

small populations are included. Under this condition, the values of the other

measures may slightly decrease but are still close to the optimal results.

Fig 2. Average MCS-P and other measures in 6000-two-16.

This figure shows two stages of the relationship between average MCS-P and the

other performance measures. The vertical line shows the cut-off point where the first

value close to the optimal results of MCS-P is achieved.

The fluctuation of existing performance measures around the optimal value of MCS-P

is due to the fact that marginally expanding or reducing the cluster size will not

significantly alter LLR[4]. Thus, MCS-P in such cases will slightly increase when

including a new area with a relatively low RR but not a very large population. In these

(31)

included. The inclusion of areas with plain RR and large population can decrease

MCS-P. This trend can be found in 600-rural-1. In Table 6, the sensitivity remains

equal to 1 when the true cluster is all included. However, large maximum spatial

cluster sizes, including areas outside the true cluster, lead to a steep decrease in

MCS-P. Therefore, the maximum spatial cluster sizes with MCS-P values close to the

optimal results can be selected to improve the performance of the statistic.

in 600-rural-1.

Measles Incidence Data in Henan, China

We applied MCS-P to case data of measles on the county level in Henan province,

China in May 2009; data were extracted from the disease reporting system of China

CDC. A total of 1,371 cases of measles among a population of 91,669,661 were

reported, and the annual incidence rate was 17.6 per 100,000. The data were

analyzed using 50 maximum spatial cluster sizes following the simulation study.

MCS-P was used for evaluation, and the result demonstrating the maximum MCS-P

value was selected and compared with the result obtained using the default

maximum spatial cluster size of 50% population at risk.

Maximum MCS-P was achieved when the maximum spatial cluster size was set to

2% of the total population. A total of 649 cases of 14,369,140 individuals in 21

counties were detected using a maximum spatial cluster size of 2% (Z2), whereas

886 cases of 28,859,679 in 41 counties were detected using a maximum spatial

cluster size of 50% (Z50). The relative risk of Z2 and Z50 were 3.0200 and 2.0527,

respectively, and both were located in the same parts of the study region. Variations

in edges were found in clusters located southwest and southeast (Fig 3). To provide

additional details, we compared the counties in Z2 and Z50. Twenty of 21 counties in

Z2 were also found in Z50, and the remaining 22 counties that differed between Z2

and Z50 are shown in Table 7.

(32)

Fig 3. Incidence of measles in Henan in May 2009 and clusters detected with default

(a) and selected maximum spatial cluster size using MCS-P (b).

Administrative codes of 42 clustering counties are labeled. RR of each counties are

presented as bar.

Table 7. Different counties in clusters detected using maximum spatial cluster sizes

of 2% and 50%.

A total of 12 of 22 counties showed lower incidence rates than the average value.

Based on the purpose of scan statistics, these counties were probably incorrectly

identified as clustering counties when searching for significantly high-risk spatial

units. As shown in Fig 3, these counties, which were located at the edges of clusters

1 and 2 in Z50, are close to the high-risk counties. Therefore, these counties were

included with near multiple high-risk areas in Z50. For instance, despite the absence

of cases in 411224 and 411330, these counties were misclassified into Z50. In Z2,

the use of MCS-P in maximum spatial cluster size selection excluded counties with

low incidence rates.

Insignificant clustering counties with RR higher than 1 in Z50 were also excluded in

Z2. These counties were surrounded by low-risk counties and were insignificant as a

single potential cluster. In particular, 411723 showed higher RR (1.8574) than that of

the surrounding counties and functioned as a potential cluster. The result suggested

that high RR possibly occurred by chance. These counties were excluded in Z2 when

MCS-P was used in the maximum spatial cluster size selection.

Although this county was insignificant in Z50, 411623 was regarded as the only

clustering county in Z2 because the critical value of LLR is related to the maximum

spatial cluster size. At a significance level of 0.05, the critical values for maximum

spatial cluster size of 2% and 50% were 6.0221 and 7.5736, respectively. Therefore,