• Tidak ada hasil yang ditemukan

sj.ctu.edu.vn

N/A
N/A
Protected

Academic year: 2024

Membagikan "sj.ctu.edu.vn"

Copied!
13
0
0

Teks penuh

(1)

Tgp chi Khoa hgc Trudng Dgt hgc Cdn Tha Phdn A: Khoa hgc Tif nhiin, Cdng nghi vd Moi trudng: 33 (2014): 9-21 #

Tap chi Khoa hpc Tn/dng Dai hgc Can Thd website: sj.ctu.edu.vn

XAY DlTNG HE THONG HO TRO KHUYIiN N 6 N G TREN CAY LUA QUA MANG T H 6 N G TIN DI DQNG

Luang Thd Anh', Nguydn Thai Nghe^ va Nguyin Chi Ngdn^

' Trung Tdm NN- TH, Trudng Dgi hgc Xdy dung Miin Tdy

^ Khoa Cong nghi Thong tin & Truyin thong, Trudng Dgi hgc Cdn Tho

^ Khoa Cong nghe, Trudng Dgi hgc Cdn Tha

Thdng tin chung:

Ngdy nhdn: 08/04/2014 Ngdy chdp nhdn: 28/08/2014 Title:

Development of an mobile communication network- based agricultural extension support system Tirkhda:

Hi thong ho trg khuyen nong, tdch tit tiing Viit, phdn logi vdn bdn

Keywords:

Agricultural extension support system, Vietnamese word segmentation, text classification

ABSTRACT

The objective of this research is to build a link system between farmers and agricultural experts to support the agricultural extension via mobile communication network and to collect real data used for developing automatic agricultural extension systems in the future. This system can be considered as "24/7 farmers link program ". To build this system, at first, we need to build modules for sending and receiving SMS and MMS messages. These modules are important for farmers to send data of rice status to agricultural experts for receiving consultations. Next, a message classification module is built based on a combination of machine learning methods (e.g. SVM) with image and text processing technologies. To make it more convenient for experts and system users, a website is build to integrate these modules into the whole system. Preliminary results show that the construction of this system is feasible. This is also the foundation for building an online automatic agricultural extension support system through mobile communication network.

TOM TAT

Nghien cdu nay dugc thifc hiin vol muc tiiu xdy dung mgt hi thong nhip cdu giiia nhd nong vd cdc chuyin gia nong nghiep nhdm ho trg cong tdc khuyen nong (trudc mdt Id tren cdy lua) qua mgng thong tin di dgng, dong thai thu thgp dir liiu thuc te dung de phdt trien cdc hi thong khuyen nong lu dgng sau ndy. Hi thong ndy co thi dugc xem Id "nhip cdu nhd nong 24/7". Di xdy dung dugc hi thong, trudc hit ta cdn xdy d\fng mo-dun gin vd nhgn tin nhdn SMS/MMS. Cac mo-dun quan trgng ndy ho trg cho nong ddn giri du liiu vi tinh trgng cua cdy lua di dugc tit vdn bdi cdc chuyen gia nong nghiep. Tiep din, mgt md-dun phdn logi tin nhdn dugc thiit I4p dua trin su kit hgp cdc phitang phdp mdy hgc vdi cong nghi x^f ly <itih vd xit ly vdn bdn. De thudn l(n cho cdc chuyin gia vd ngudi dimg hi thong, mgt website dugc xdy dung nhdm tich hgp cdc md-dun trin lgi vdi nhau.

Ket qud nghien cuu budc ddu cho thdy viec xdy dung hi thdng ndy Id rdt khd thi. Do cung la nen tdng de xdy dung hi thong ho trg khuyin nong Ut dgng true tuyen qua mgng thong tin di dgng.

(2)

Tgp chi Khoa hgc Trudng Dgi hgc CSn Tha 1 GlCfl THIEU

Viet Nam hien li mgt nude ndng n ^ e p , phin Idn ngudi din sdng chii ylu dya vio trdng trot vi chin nudi (Tdng cue thong ke, 2014a). Trong do, ciy Ilia ddng vai Ud quan Upng, li ngudn an ninh lugng thyc chu ylu (Tdng cue thdng ke, 2014b,c), Nganh trdng lua di dgt dugc nhimg thinh tyu dang kd dua Vidt Nam trd thinh nude cd sin luong gao xuit khiu tdn hang diu thi ^di.

Ngiy nay, vide Udng lua Ud ndn khd khan, phuc tap hon do thudng xuyen phit sinh nhidu loai siu benh Ig vi mdi trudng, khi hiu bi d nbidm. Vi viy, viec Udng lua ngiy nay doi hdi phii cd su ti'ch lily nhimg kinh nghidm, tich hgp cic tti thdc va thdng tin td nhieu ngudn khic nhau. De duy tri kha ning canh Uanh, ning cao nang suit, chit lugng hgt ggo, ngudi ndng dan hi$n dgi thudng dya vao cic chuyen gia vi cic cd vin ndng nghidp 6^ cung cip kien thirc, thdng tin cho vide ra quylt dinh.

Khd khan d ch5 la cae chuyen gia khdng phii liic nio ciing ludn cd sin khi nha ndng cin din vi chi phi ma ndng din bo ra dd duge hd ttg li khi cao.

Vdi sy phit Uien cda mgng thdng tin vi cic thiet bi di dpng, khi ning tilp cin vdi tri thuc thdng qua mang thdng tin di dpng ngay cing Ud nen don giin va phd biln vdi mpi thanh phin xa hdi, ching hgn nhu djch vu Ua ciiu diem thi tuyen sinh qua SMS, djch vy tin nhin SMS phuc vu bgn dpc sd dyng thu vidn, dich vy Ua cdu thdng tin chiing khoan, ^ gii, gii vang, SMS Marketing, SMS Banking,... Ben canh dd, cdng nghe thong tin v i eic giii phip miy hpc di phit triln kha mgnh md ttong nhihig nim gin diy, ttong khi dd ngudn d\i lieu ndng nghidp dinh cho khai phi hien cdn rit khan hiem. Hi^n tgi, d Ddng bing sdng Cuu Long chua CO hi thdng tin hgc nao dugc xdy dung di ho trcf cong tdc khuyen nong va thyc hien thu thgp da lieu qua mgng thdng tin di d$ng.

Tir thyc tiln do, vide xiy dyng mdt he thdng nhim hd vtq vl m|t thdng tm, 1^ thuit cho nha ndng ddng thdi dd thu thip dii: lidu thyc tl qua mgng thdng tin di d^ng l i rat cin thiet, cip bach va hgply.

Phdn A: Khoa hgc Tu nhien, Cdng nghi vd Mdi trudng: 33 (2014): 9-21 Trong bai viet niy, chung toi di xudt mgt gidi phdp mdi nhdm ho trg cho cong tdc khuyen nong ctt the Id khuyen nong qua mgng thong tin di dgng bang tin nhdn SMS/MMS. Bai vidt tap tnmg nghidn ciiu cac cdng nghe cung nhu cic phuang phip de xay dyng nhung md-dun thiet yeu cho he thdng nbu md-dun gdi va nhin tin nhan SMS/MMS, md- dun phan log! tin nhin ty ddng bai^ ky thuit phin logi van ban diing gjM thuit SVM vi sau ciing Ii xay duDg mdt website hoin chinh dd tich hgp cic module tren, quin Iy v i ciu hinh be thdng.

Vdi sy ra ddi ciia he tbdng niy se khic phuc dugc mpt phin nbOng khd khan cua ngudi ndng dan ttong qui tiinh sin xuat lua gao, vdi khi nang u i ^ dung rdng rii trong rfnh vye ndng nghiep nd duge xem li mdt cdng cu him ich vdi tiem nang rdng ldn de hd ttg IQ* thuit cho ndng dan mpt cich kip thm, tiet kiem chi phi v i la mpt cdng cu hieu qui de thu thip dir lieu thyc tiln lam co sd cho vide phit trien cic he thdng hd ttg hoin toan ty ddng sau niy.

2 XAY DlTNG H $ THONG

2.1 Md hinb boat dpng tong the ciia h£ thdng Md hinh tdng thd cua he thdng dupe bieu dien nhu Uong Hinh 1, 6 do, khi nhi ndng cd van dd/eiu hdi (ching hgn nhu liSn quan din benh hai tten ciy liia, cin tu vin cich dieu Uj,...) thi hp cd thi dgt ciu hdi bing tm nhin SMS hoic chyp lgi hinh inh hien trgng (MMS) (ed thi kem theo ciu hdi) dl glii din he thdng bing dien thogi di ddng.

Yeu cau nay sd dugc he thdng chuyen den cac chuyen gia thich hgp ttong tiing linh vyc dd dupe giii dip. Ngay sau khi nhin dugc phan hdi tu phia chuyen gia, he tiidng se phan hdi lgi kdt qui cho nhi ndng. Nbu viy, vdi y tudng niy, he thdng cung cd thd xem nhu "Nhgt cau nhd nong trfrc tuyen 24/7".

Ddi vdi tin nhan hinh inh, hien tgi he thdng chua hd ttg phan logi ty ddng, khi he thdng nhin dugc hinh anh mi nhi ndng gin ldn, dieu phdi vien se xem xet v i phin logi hinh anh do rdi gdi cho chuydn gia giai dap, sau khi nhan dugc ciu tii Idi eua chuydn gia thi he thdng se ty ddng giii ndi dung tri ldi cho nhi ndng.

(3)

Tap chi Khoa hgc Trudng Dai hgc Cdn Tho Phdn A • Khoa hgc Tif nhien. Cong nghi vd Mdi trudng: 33 (2014). 9-21 (^

^ OJtapbig

{SUS, MMSi..' "t,-|*.*m i S M J l l U S i

Hinh 1: Md hinh boat dpng eua h | tb6ng Ddi vdi tin nhan van ban, khi nhin dugc cau

hdi ciia nha ndng qua md-dun gdi nhan tin nhan SMS, md-dun phan logi tin nhin SMS ty ddng se ty ddng thyc hipn mdt sd budc tiln xu \^ co ban nhu tich tir, chgn tir khoa hay lorn bo tir dimg- Do vin bin li tin nhin, nSn sd lugng td khoa khdng nhilu vi it khi lip lai, nghien ciiu niy dua ra hai phuong an chpn tir khda la phuang an thii cdng va phuang in ty dpng.

Vdi phuang in thu cdng thi he thdng se giii lgi nliQng tu cd ttong danh sach td khda (tgp dac trung vfin b ^ ) di dupe xay dyng thu cdng bdi cac chuyen gia tir trudc, sau dd vec-ta hda cac td dupe gi(J lai do vi dua vao md hinh cua phuong an nay dl phin logi.

- Vdi phuang in ty dpng thi he thdng se chgn ttr khda bang cich logi bd cie tir dung (stopwords) la nhfhig tir thudng xuit hien ttong van ban nhung khdng c6 gia Uj phan logi chang hgn nhu tii "va",

"nhung", "ed", "khdng", sau do vee-to hda tit c i eic tii cdn lai vi dua vio md hinh cua phuang an nay de phin logi.

Trong giai dogn ban diu niy, bd tir khda vi tip dO' li^u do nhdm tic gii thu th|p va xay dyng chua du ldn va da dang nen vide phin logi ty ddng chi nhim muc dich minh hpa cho md-dun phan lo^ ty dpng vi kilm tta vide van hanh cua he thdng, cdn vi?c phan logi vin li mpt h? thdng ban ty ddng, cd sy kilm tta, giam sit cua didu phdi vien. Do he thdng hipn la ban ty dpng nen dieu phdi vien can kilm tra kSt qui phin logi cua md-dun ty dpng v i thyc hien phan loai lai dl lam co sd cho viec xay

dyng va huin luyen lgi md hinh, sau khi didu phdi vien phan logi, ciu hdi mdi dugc chuyin din chuyen gia thich hgp d l tra ldi. Khi n h ^ duge eiu tra ldi tii chuySn gia, he thdng ty d^ng gdi nOi dung Ua ldi cho nha ndng.

Khi hp thdng nhin du sd lugng tin nhin SMS mdi din (theo ciu hinh ciia h^ thdng), hd thdng s2 ty ddng xiy dyng lai bp tu khda v i huin luy^n lai md hinh vdi bp td khda va dil lieu mdi, sau khi huin luyen xong hd thdng sd su dyng md hinh mdi huin luyfin vao phin logi tin nhin mdi din h|

thdng. Hd thing se lap di lap di l§p lgi vi?c xiy dyng lai bp td khda va huan luy^n lgi md hinh cho din khi lugng dG' lidu thu thip dii ldn vi d^ ehinh xic phin loai la chip nhin dugc (do qudin trj vien cua he thong quyet djnh).

Todn bo quy trinh xdy dung hi thong dugc (dm tdt qua cdc budc sau:

Xay dyiig md-dun giii/nliin tm nhSn SMS/MMS

- Xiy dyng md-dun phan logi ngi dung tin nhan van bin bang ky tiiuit SVM.

Tich tu bing cdng cy VnTokenizer.

- Chpn tu khda hoic logi bd tu dimg (tiiy theo cau hinh he tbdng).

- Xiy dyng bp dii Hdu huin luyen cho md hinh SVM.

Huan luy?n md hinh.

Danh gii ket qui.

(4)

Tgp chi Khoa hgc Trudng Dgi hgc Cdn Tho Phdn A: Khoa hgc Tu nhiin, Cdng nghi va Moi Irudng: 33 (2014): 9-21

Xiy dyng website hoin chinh dd tich hpp toin bp cic md-dun, quan ly v i ciu hinb he thong,

Sau diy chiing tdi se ttmh bay chi tilt eic budc dd xiy dyng cic md-dun nay.

2.2 Xay dyng md-dun gui v i nhin tin nhin 2.2.1 Tin nhdn vdn bdn ^MS)

Gdi tin nhdn: Vd tdng till, ed 2 each dl gui tin n h ^ SMS td didn thoai di dpng din miy tinh (Developer's Home, 2014).

Cdch 1: Kdt ndi difin thoai di ddng hoac modem GSM/GPRS/3G vao miy tinb, Sau do diing tap lenh AT d^ chi thi cho dien thogi hoic modem gui tm nhin SMS. Bk gdi tin nhin, trudc hit cin lip SIM dugc nha mang cung cip vio didn thogi hoac modem, sau do ket ndi modem vio miy vi tinh bing diy cap, hdng ngogi ha;/ bluetooth. Sau khi ket ndi thinh cdng, ta cd thd didu khiln modem bing cich gui chi thi den nd. Chi thj dugc sii dyng dl dilu khiln modem dupe gpi li tip l^nh AT (vilt tit ciia ATtention). Tap Ipnh AT li nhihig ehi thj dugc sd dyng de dieu khien modem hay didn thoai di d^ng, danh sich eic ldnh thdng dyng dugc md ti Uong Bing 1,

Bang 1: Mdt sh lenh AT diing de giri tin nhan Lgnh AT Cdng dung

AT + CMGS Gdi tm nhin AT + CMSS Gui tin nhin tir bp luu Uii AT + CMGW Ghi tin nhin vao bp nhd AT + CMGD Xoa tin nhin AT + CMMS Gui tiidm tin nhin

Cdch 2- Ket ndi miy tinh vdi Trung tam SMS (SMSC) hoac SMS Gateway ciia mgng khdng diy hoic nhi cung cip diclL-_vy SMS. Sau dd gdi tin nhan SMS bang cich su dyng cac giao tiiiie/giao di?n dugc ho trp bcri SMSC hoic SMS Gateway,

Cich giri tin nhin thdng qua modem hay didn thogi di ddng ket ndi tryc tidp vdi miy tinh ed ban che l i tdc do giil tin nhin SMS rit thip. Nlu cin toe dp glii cao hon thi can thidt phii thilt lip kit n6i tryc tilp din Trung tim SMS hoic SMS Gateway cua mgng khdng diy. Kit nil nay co the dupe thyc hidn qua mgng Internet hoac kdt ndi quay so. Neu khdi^ thd kdt n6i tryc tiep din Tmng tim SMS hogc SMS Gateway ciia mgng khdng diy thi ta cd the kit noi ddn SMS Gateway eua m0t nha cung cip dich vy SMS nao dd, luc do SMS Gateway niy sd ehuyin tilp tin nhin SMS din mdt Trung tim SMS tiiich hpp, Sau khi dang k^ v i thilt Igp tii khoin vdi nhi mgng khdng day hoic nha cung eip dich vu SMS, ta di cd thi bit dau gui tin

nhin SMS bang each su dung eic giao thuc/giao dien dugc hd Up bdi SMSC hogc SMS Gateway.

Nhgn tin nhin.* Tuong ty nhu vide gui tin SMS, ciing cd 2 each de nhin tin nlian SMS Udn miy tinh.

Cdch 1: Ket ndi di?n tiiogi di dgng hoic modem GSM/GPRS/3G vao miy tinh. Sau do dung miy tinh vi tip lenh AT dd doc tin nhan nhan dugc tu dien thoai di ddng hoac modem. Viec nhgn tin nhin SMS thdng qua mot modem cd m0t lgi till la nhi mgng khdng diy thudng khdng tinh phi nhin tin nhan khi dung vdi mgt Md-dun Nhdn dien Thui bao (the SIM). Bit lgi cua viec nhan tin nhin theo cich niy la modem khdng tfal xd ly mdt so Iugng ldn luu lugng tm nhan SMS truy cip. Cd mdt each dk giii quylt vin dl nay do la sd dyng nhilu modem de can bang tii luu lugng SMS tmy cip.

M5i mpt modem se co mdt the SIM vi sd thud bao ridng.

Bang 2: Mpt sd Ilnh AT diing d l nhin tin nhin SMS

L^nh AT Cdng dung AT + CNMI Dd xic dinh tin nhin mdi AT + CMGL Liet ke tit c i tm nhin AT + CMGR Dpc tin nhin

Cdch 2: Tmy cap den Trung tim tin nhan (SMSC) hogc SMS Gateway cua mgng khdng diy.

Mgi tin nhan SMS nhin dugc sd duge chuyen tiep den may tinh thdng qua giao thuc/giao dien dupe h5 ttg bdi SMSC hogc SMS Gateway.

Ciong gidng nhu viec gui tin nhan, viec nhgn tin nhan tiidng qua didn tho^ hoic modem GSM/GPRS cd mdt sd ban che, dd la tdc dd tmyen tii SMS qua thap, Neu can tdc dp cao hon thi can thiet phai thiet lap ket ndi tryc tiep ddn Trung tim SMS hogc SMS Gateway ciia mgng khdng diy.

Sau khi thiet lip mpt tai khoin vdi nhi mang khdng day hoac nha cung cap dich vy SMS, SMSC hogc SMS Gateway se bat diu chuyen tidp cic tin nhin den iing dung SMS bang cich su dung mpt sd cac giao thiic/giao dien. Cung gidng nhu viec gui tin, dl kit ndi din SMSC, bit budc phai cd cac giao thuc SMSC. ViSc nh|n tin nhin theo each nay cung de nhu vi?c gdi.

2.2.2 Tin nhdn da phuang tiin (MMS) Cdc giao thdc cda md hinh MMS: Cic thiet hi di ddng (MMS Clients) va cac Trung tim tin nhin da phuang tidn mudn lien lac dugc vdi nhau phii thdng qua cae giao thuc. Cd hai tieu chuin quan Upng de dinh nghia cdng nghe MMS, mpt dugc

(5)

Tgp chi Khoa hgc Trudng Dgi hgc Cdn Tho Phdn A: Khoa hgc Tyr nhiin, Cdng nghi vd Mdi Irudng- 33 (2014). 9-21 , xuit hin bdi 3GPP, mdt dugc xuit bin bdi Open

Mobile Alliance (OMA). Hai ca quan tieu chuan nay hgp tic dl dinh nghia cic giao tiiuc MMS. Khi ndi din MMS la noi din cac giao tiidc lien quan.

Cd tat c i mudi mpt logi giao tiidc uong md hinh kiln tnic eia MMS (NowSMS, 2014).

a. MM 1 li giao tiiuc dugc su dung giiia thilt bi di ddng vdi Trung tim tin nhan MMS (MMSC).

Nd dinh nghTa each tiiuc ma mdt diSn thogi di ddng glii va nhan tin nhan thdng qua MMSC.

b. MM2 li giao thue nim gi&a MMS server v i MMS relay.

c. MM3 la giao thuc dupe su dung giiia trung tim MMS va cae he thdng tm nhin khae. Giao thdc nay thdng qua mdi trudng Internet dl kit ndi vdi server ben ngoii. Tren thyc te, giao tiiirc nay chu ylu dugc thyc hi?n thdng qua giao thirc email SMTP.

d. MM4 la giao thdc dugc sd dung d6 ket ndi hai trung tim MMS lai vdi nhau. Giao thuc nay cin thilt cho vide ttao ddi tin nhin da phucmg tien giiia eic mdi trudng MMS rieng bidt (nhu giiia hai mang di dpng khac nhau).

e. MM5 la giao thde cho phep tic ddng qua lai giiia Uung tim MMS va cic thanh phan mgng khic nhu bp ghi djnh vi thudng tni HLR hoac mgt DNS.

f. MM6 la giao thue cho phep tuang tic giua trung tam MMS vi co sd dd !ipu ngudi diing.

g. MM7 la giao thdc dupe sd dung de cho phdp cac ling dung ciia nha cung cip dich vu gia tti gia tang (VASP) gui va nhin tin nhin MMS thdng qua mdt MMSC. Giao thuc MM7 dugc dinh nghia hoan chinh bdi 3GPP, vi li mpt giao thirc dya ttdn SOAP.

h. MMS la giao tiiuc duge su dung giUa trung tam MMS va mot he thdng thanh toan Ua sau.

i. MM9 la giao thuc "dupe su dung gida trung tim MMS va he thdng tta trudc true tuyln.

j . MM 10 la giao thiic cho phep tuang tic giira trung tim MMS vi mpt ca quan kiem soat dich vy tin nhin (MSCF).

k. EAIF li giao tiidc ddc quyen dupe dinh nghia bdi NOKIA, li giao thuc md rdng eiia giao thiic MMl vi thi nd c6 thi duge su dung cho cac nlia cung cip dich vy gia tri gia ting.

Trong khudn khd bii vilt, chiing tdi khdng di siu nghien ciiu cac giao thuc MMS ciing nhu cac

kiln tnic chi tilt ben ttong he thdng MMS, chi tilt cd thi tham khao tgi (NowSMS, 2014).

Cdch nhan tin nhdn MMS: Vd co ban, viec nhin tin nhin tir miy tinh cd thi dupe thyc hi?n bing hai phuang thdc kit noi khac nhau.

Cdch 1: Tin nhin MMS dugc nhSn qua kit noi tryc tiep tdi Trung tim tin nhan MMS cua nhi mang bing each sd dung mdt Uong nhiing giao thiic dugc h§ trg, bao gdm MM4, MM7, ho$c EAIF. Khi sd dung bit ki giao thuc nao ttong cac giao thdc niy, tmng tim tin nhan cua nha mgng sS ty ddng kit ndi din MMS Gateway dl liy tin nhin.

Cdch 2: Tin nhin MMS cd thi dugc liy ve bing each sd dyng cdng nghe SMS kit hgp vdi cong nghf WAP. Dl nhan duge mot tin nhin MMS cin trii qua hai giai dogn, Giai doan mdt, modem n h ^ tin nliin SMS, edn gpi li tin nhin thdng bio MMS. Tin nhin nay chiia URL ciia tin nhin MMS u£n Trung tam tin nhin da phuang tien (MMSC).

Giai doan hai, khi modem da nhin dugc tin nhiin thdng bao MMS, modem md kit ndi GPRS din Wap Gateway dl tii ve n^i dung tin nhin MMS ve tir tmng tim tin nhin da phuang tien.

Hinh 2: Mo hinh nhin tin nhin MMS tir dng dgng Cdch gdi tin nhan MMS: DS gdi tin nhSn MMS thi dng dyng khdi tao mdt kit n6i GPRS din Wap Gateway cua nhi mang vi thyc hidn giti tin nhin MMS din Trung tam tin nhin MMS (MMSC) thdng qua kit noi WAP va GPRS.

Hinh 3: Mo hinh giri tin nhin MMS tir ung dung B^ thuin lgi va khdng mit nhilu thdi gian cho vide xiy dyng v i phit triln be thdng, n^idn ciiu . sir dung tiiu vidn SMSLIB (SMSLib, 2014) dk gdi .|

va nhin tin nhin SMS va tin nhin thdng bio MMS, f tiiu vidn jWAP (jWAP, 2014) dl kit noi ddn Wap Gateway nhi mang v i JMMSLIB (jMrnsLib, 2014) de giai ma tin nhan MMS. Vide gui va nhin tm

(6)

Tgp chi Khoa hgc Trudng Dgi hgc Cdn Tha Phdn A: Khoa hgc Tu nhiin. Cdng nghi vd Moi Irudng: 33 (2014): 9-21 nhin dugc tiiyc hien tiidng qua mdt modem 3G

dugc ket ndi tryc tilp vdi miy tinh.

2 3 Xay dung mo-dun phan logi tin nhin SMS 2.3.1 Bdi todn phdn lap (classification) Li qui ttmh phin Idp mpt ddi tugng du lieu vao mgt hay nhilu ldp cho trudc nhd mdt md hinh phan ldp mi md hinh niy dugc xiy dung dya tten mdt tip hgp cic ddi tugng da lieu di duge gan nhin tu trudc gpi li tip dH lieu hpc (ttaining data).

Qui trinh phin ldp cdn dugc gpi li qui trinb gan nhin cho cac ddi tupng dii lidu. Nhu viy, nhiem vu cua bai toin phin ldp dir lieu li cin xay dyng md hinh (bp) phin ldp dl khi cd mpt du lieu mdi vio thi md hinh phin ldp sd cho biet dd lieu do thudc ldp nio. Cd nhidu bai toan phin ldp dii lieu nhu phan ldp nhi phin, phin ldp da ldp, phin ldp dattj,...

Qui trinh phin ldp dii lieu thudng gdm hai budc: budc "jroy dfeng mo hinh" va budc "sd difng md hinh ".

Budc 1: Mpt md hinh sS duge xiy dyng dya tren vipc phin tich cac ddi tugng dit lidu di dugc gin nhan tii trudc. Tip cic miu dii lidu niy cdn dugc gpi li tgp die liiu hudn luyin (training data set). Cic nliin ldp cua tip dii lidu huin luydn dugc xic dinh bdi eon ngudi trudc khi xiy dyng md hinh. Trong budc nay, chiing ta con phii tinh do chinb xic ciia md hinh bing each su dyng mot tap dii lipu khic de kiem Ua, l^p dd lidu nay gpi la tap da liiu kiem tra (test data set) hoic dimg phuong phip kilm Ua ehdo tten tip dd lieu huin luyen, NIU d$ cbinh xic la chip nhin dugc, md hinh se dugc sd dyng de xic djnh nhan ldp cho cac dii lidu khic mdi ttong tuong lai.

Budc 2: sd dyng md hinh da dugc xay dyng d budc 1 dl phin ldp dir lieu mdi.

Nhu viy, thuit toin phin ldp li mdt anh xg tu midn du* li^u di cd sang mpt mien gia tti cu the ciia thudc tinh ldp, dua vio gii tri cic thudc tinh ciia dii lieu.

2.3.2 Mdy hgc Vic ta hd trg (SVM) Phuong phip SVM ra ddi tii ly thuylt hpc th6ng kd do Vapnik (1995) xiy dyng, SVM cd nhilu tilm ning phit triln vl mit IJ thuylt ciing nhu lii^ dyng ttong thyc tiln, SVM dugc danh gii li mdt trong mudi giii thu|t quan Ugng eua khai md dii lieu. Cic iing dyng thyc tl cho thiy phuang phap SVM cd khi ning phan logi khi tdt ddi vdi bii toin pban logi vin bin cung nhu ttong nhidu ung dyng khic (nhu nhgn dang chii viet tay, phit

hien mat ngudi Uong cie inh, udc lupng bdi quy,...).

Bai toan co bin ciia SVM li bM toan phin loai hai ldp: Cho tmdc n didm ttong khdng gian d chilu (mdi diem thudc vao mdt ldp ki hieu l i +1 hogc —1, muc dich cua giii thuit SVM la tim mdt sidu phing (hyperplane) phan hoach tdi uu cho phep chia n diem niy thinh hai phan sao cho cac didm ciing mpt ldp nam ve mdt phia vdi sidu phing niy.

w x + b = l

,'' wx + b = -1

Hinb 4: Phin Idp tuydn ti'nh vdi SVM Xet tip dir lieu miu co thd tich rdi myen tinh {(xi,yi),(x2,y2),,-.,(xn,yn)} vdi Xj e R"" va yi e {±1}.

Sieu phing tdi uu phin tip dd lieu nay thinh hai ldp la sieu phang cd the tach rdi du lieu thanh hai ldp rieng biet vdi le (margin) ldn nhat. Tdc la, can tim sidu phang H: y = w.x + b = 0 va hai sidu phang Hi, Ha hd trg song song vdi H va cd cimg khoang each den H, Vdi didu kien khdng ed phin tu nio eiia tip miu n ^ giira Hi vi H2, khi dd:

W.X + b >= +1 vdi y = +1 W.X + b>=—1 vdiy = —1 Ket hpp hai dieu kidn ttdn ta cd: - -- -.- y(w.x + b) >= 1

Khoang each ciia sieu phang Hi va Hi ddn H l i

||w||, Ta can tim sieu phang H vdi le ldn nhit, tire Ii

min|[m

giii bii toan tdi uu tim "^ vdi ring budc y(w,x + b) >= 1. Ngudi ta ed the chuyen bii toin sang bai toan tuang duang nhung de giai hon li rain-|w|pvdi rang budc y(w,x + b) >= 1. Ldi giii cho bai toan tdi uu nay li cyc tieu hoa ham Lagrange:

Ii?<ikdi--l^-l^a^(y,<<y^^ >^A)-l (1)

(7)

Tgp chi Khoa hgc Trudng Dgi hgc Cdn Tha Phdn A: Khoa hgc Tg- nhiin, Cong nghi vd Mdi Irudng: 33 (2014): 9-21 Trong dd a Ii cac hg sd Lagrange, oSO. Sau do

ngudi ta chuyin thanh bai toin ddi ngiu la cyc dgi hda ham W(a);

maxa W(a) = maxa (mmw.bL(w,b,a)) (2) Tir do giii dl tim dugc cac gia tti tdi uu cho w,b va a. Vl sau, viec phan logi mdt miu mdi chi li viec kiem tra ham diu sign(wx +b).

Giii thuit SVM co ban gi^ quylt dupe bai toan phin ldp tuyln tinh, tuy nhidn ndu ta kit hgp SVM vdi phuang phap ham nhin (kemel-based method), se cho ph^p giai quyet mpt sd bii toan phi tuyen bing each anh xa dii lieu vao mdt khdng gian ed sd chieu ldn hon. Khdng cd bit ky thay ddi can thiet nio ve mit giii thuit, vide duy nhit c§n lim li thay the cac tich vd hudng eiia hai v^e-ta u.v bdi ham nhan K(u,v).

Bang 3: Mpt si him nhin thirdng dirge diing

Kilu ham Cdng thuc

Tuyen tinh K(u,v) = u.v Da tiuic hie d K(u,v) = (u.v +cf Radial Basis Function K(u,v) = exp(-Y[|u-v|P) Trong nghien ciiu nay, chiing tdi thyc hidn xay dyng va huan luypn md hinh phan logi tin nhin SMS nhd vao sy ho ttg cua cdng cu LibSVM (Chang, C.C, Lin, C.J.,2011).

2.3.3 Phdn logi tin nhdn vdn bdn (SMS) Phan ldp van bin dupe dinh nghia la viec gan tdn cac chii dd (ten Idp/nhin ldp) cho trudc vao cac vin bin dya tten npi dung ciia nd. Phan ldp van ban la cdng vide dupe su dung de hd ttg cho qua trinh tim kilm thdng tin (Information ReUieval), chilt lpc tiidng tin (Information ExUaction), lpc vin ban hojc ty dpng dSn dudng cho cie vin ban tdi nhiing chii de xac djnh trudc dd. Bk xiy dyng bp phan ldp van ban ty ddng, ngudi ta sd dung cae thugt toin miy hpc (machine learning) ed giim sit.

Xay dang md hinh phan logi tin nhSn Chgn lap (target class): Trong nghien ciiu niy chiing tdi chia chuydn i ^ ^ Ida thanh siu chuyin nginh nhd hon (Nguyin Ngpc De, 2008) duge md t i nhu ttong Bang 4.

Bing 4: Cac chuydn nganh nhd tren ciy lua

STT Ten "

Benh hai lua Sau hai Ida Cd hgi lua Gidng Ilia Ky thu^t canh tic Sau thu hogch

Dya vio dd hfi thdng phin lam sau ldp (nhin) tuong dng vdi siu chuydn nganli nhd nay. Cic chuyen gia tham gia vao h^ thdng thuOc mpt ho§c nhieu chuyen nginh Uong siu chuyen nginh nay.

Nhu vay, ta se xay dyng l^p dii lidu huin luyen vdi sau nhan tuong iing. Ngoii ra, ta eiing thdm m§t ldp li ldp tin nhin rie nlu nd khdng thudc m$t Uong sau ldp tten, diy i i mpt dang bai toan phSn ldp da ldp.

Tdch tii tieng viet

DS xiy dyng he thdng phin loai vin ban, vi?c tach vin bin thinh tir ddc lap cd nghia li viec b3t huge. Ddi vdi vin ban Tilng Anh, vi?c tach tir dugc thyc hien kha don giin vi moi tu Tilng Anh phin biet nhau bdi mpt khoing tting. Ngugc lgi, viec tach tir Tilng Viet lgi khi phdc tgp vi mdt tir tieng vidt cd thi cd hogc khdng cd khoing tting.

Co nhilu cich tilp can xu iy tach tir Tilng Vi?t ching hgn nhu phuong phip dimg tir diln, phucmg phap thdng kd... Ky tiiuEit tich tii Tilng Viet cOng da duge nhilu nhdm tic gii nghidn cdu vi xiy dyng cho kit qui vdi dp chinh xic khi cao.

Nghien cuu su dung phin mim tich tir VnTokenizer' dk thyc hien vide tach tin nhin tii^inh cie tir ddc Igp, cdng cy niy dugc phit Uidn dya tten phuong phap so khdp tdi da (Maximum matching) vdi t^p dii lieu sd dung li bang im tilt tieng Vi|t v i tii diln tu vung tilng Vi^t. Cdng cu dupe xiy dyng tten ngdn ngd Java, mi ngudn md.

Co thi dl dang tfch hgp vio cic he thong phin tich tieng Vidt khic.

Quy trinh tiiyc hien tach tii theo phuang phip so khdp tii da dugc minh hpa nhu trong Hinh 6.

Hinh 5: Md hinh phan lop tin nhin van ban vdi

SVM ' http://mim.hus.vnu.edu.vii/phuonglh/node/33

(8)

Tgp chi Khoa hgc Truong Dgi hgc CSn Tha Phdn A: Khga hgc Tu nhiin, Cong nghi vd Mdi Irudng: 33 (2014): 9-21

Hinh 6: Tach td theo pbuvng phap so khdp tii da Cic don vi tii dugc sinh ra tu cdng cu niy bao VI dy: vin ban gdc

gdm cac tu u o i ^ tu diln, cic chudi sd, chudi ky ty nude ngoai, dau ciu, cic lo? ty hdn tgp khic ttong vin ban, cac td mdi hogc cic tu dugc smh ty do theo mgt quy tie nao dd (nhu phuong thuc thdm phu to hay phuang thde lay) hoac cie chudi ki hieu khdng dupe liet kd ttong tii diln. Cdng cu niy tich tir cho dp chmh xac li 96% - 98% theo (Phuong et al, 2010). Vi dy sau minh hpa mdt doan vin ban duoc tich tii bdi VnTokenizer.

Binh gay hgi chit yiu giai dogn ma - de nhdnh. Luc ddu vit binh chi la nhirng chdm nho, mdu xanh xam, sau ldn len co dang hinh thoi (mdt en) dgc tnmg.

Nhiem vu ciia bdo tit ndy Id hid cdc chdt dinh duang co trong cdy lua vd ngoai ra cdn tiet ra dgc to Pyricularin gay dgc cho cdy.

Bdo tu ndm Pyricularia oryzae hay P. grisea phdt trien tdt trong diiu kien nhiet dg mdt tir 24 — 28dgC, dm dg cao tren 80%.

Vi dy: Vin b ^ sau khi tach td

Binh gay hai chu_yiu giai_dogn ma - de_nhdnh . Luc_ddu vit binh chljd nhdng chdm nho , mdujcanh xam , sau Idnjen co dgng hinhjhoi (mdt en) dgcjrimg.

Nhiim_yti ciia bdo_tir ndy Id hdt cdc chdt dinhdudng co trong cdy liia vd ngodi_ra con tiet ra dgc Jo Pyricidarin gay dgc cho cdy.

Baojit ndm Pyricularia_oryzae hay P. grisea phdtjriin tot trong dieukien nhiet_dg mdt tir 24 - 28 dg C, dm dg cao tren 80% .

Nhu ta da biet m0t tCr tidng Vidt trong vin ban thdng thudng cd thi eo hoic khdng c6 khoing uing (mdt hogc nhidu im tiet), do vay vdi vin bin gdc ban diu thi khdng thd phin bidt tii nio la tu diy dii ttong td dien, tir nio ehi la mdt im tiet (mdt phin ciia tu), Sau khi tich tii, cac tir bao gom nhilu im tiet se dugc ndi lgi vdi nhau bang cich su dung ky ty "_" thay thi cho ky ty khoang tring. Vdi cich lam nhu viy thi sau khi thyc hidn tach tir, cac tir (cd nghia tiong tir diln) sS dugc phan bi^t nhau nhd khoing tring giing nhu van ban tieng Anh. Tu dd, ta se dd dang xiy dyng bd td khda cung nhu xay dyng md hinh phin logi vdi van bin di tich.

Xay difng b§ tit khda (dgc trung) Bp tii khda dgc trung la mdt danh sich eic tir khda die ttirag cho siu chuySn nganh liia nhu ttong Bing 4 Uen. Budc chpn tii khda li mOt budc quan Upng quydt djnh nhieu ddn ket qui phin logi ciia he thdng. Trong ngliien cuu nay, chung tdi dd xuat hai phuong in xiy dyng bp td khda:

- Xdy difng b^ lit khoa thii cong: Vdi phuang in niy thi cin ed sy h5 ttg ciia cae chuyen gia vl Ilia, dO tin ciy cua cic tir khoa dugc chpn lgc cao ban, so lugng tir khda it nhung chit lugng hon, dilm ydu cua phuang an niy li mit nhilu thdi gian v i cdng sue de xay dyng vi chpn lgc td khda.

- Xdy dung bg tie khoa tu dgng: Tir tap dd lieu thu thgp duge h^ thong sg thyc hien vice tich td va logi bd tir dimg, do vin bin la tin nhan ndn sd lugng td khoa khdng nhieu vi it khi lap lgi nen he tbdng khdng thyc hien vide giam sd chieu (sd die tnmg) mi diing tit ca cac td cdn lgi sau khi loai bd tu dimg dl xay dyng bd tii khda, vdi phuang in ty dpng thi vide xiy dung bd tir khoa duge thyc hidn nhanh hon vi khdng mat nhieu cdng sdc cung nhu khdng cin sy Up giup eiia cac chuyen gia nhung chit lugng cic tir khda thi khdng cao vi khdng phai td nao duge giir lai cung c6 y nghia phin lorn, sd lupng tir khda se nhidu hon rit nhilu so vdi phuong in thii cdng.

Danh sach td khda dugc luu vio mot tap tin, mdi dong ciia tgp tin mdt li tir khda nhu vi dy sau:

Ddt Giong Nude Sdu

Bilu dien tin nhan bing vec-ta dgc tnmg Trong nghien cdu nay, chiing tdi sd dyng SVM dl phin loai tin nhan van ban do SVM cd nhieu uu

(9)

T^ chi Khoa hgc Truong Dgi hgc Cdn Tha Phdn A: Khoa hgc Tu nhien. Cdng nghi vd Mdi trudng: 33 (2014): 9-21 diem khi su dut^ cho phan loai van bin (Do Thanh

Nghi, 2011; Trin Cao De vi Pham Ngiq^en Khang , 2012). Dinh dgng timg dong ciia tip tin huan luyen nhu sau:

<label><indexl> .<valuel><index2>:<value2>...

Vdi <label> la nhin phan ldp ciia van ban (6 chuyen nganh vl lia nhu Uong Bang 4), <index>

la chi so eiia tir khda, chi sd nay tuong ung vdi sd thd ty ciia tir khda trong tgp tin tu khoa, <value> la gii tri Upng so cua tir khda. Mic dii, cd nhieu each xic dinh gia tri Upng sd ciia tir khda, ttong nghien ciiu nay, do v5n bin li tin nhin SMS nen so lupng tu khda khdng nhieu va it khi lip lgi nen khi vec-ta hda ta khdng quan tim tii khda dd xuit hien bao nhiSu lin mi chi can quan tim nd cd xuit hidn hay khdng, nlu ed xuit hidn thi phin gia ui trpng sd dugc gin li 1, nlu khdng xuit hien thi khdng ein phai luu - dinh dang nay edn dugc ^pi la dinh dang thua. Vl du ve djnb dgng tip tin huin luydn vdi dd lieu ta cic tin nhin.

Tin nhin sau khi tich tir va loai bd tu dimg:

cho_biet phong tri binh vdng lun lua co binh bge Id liia nguyinjihdn phongjrdnh

Dinh dang tip tin huin luy^n cua cac tin nhin tten nhu sau:

; 63:1 95:1 167:1 419:1 420:1 629:1 858:1 948:1 1 56:1 63:1 414:1 420:1 524:1 630:1

Nhu ta da thiy, tin nhin fliu nhit va tiiti hai thupc ldp sau bpnh hgi liia, nhu viy nhan (ldp) cua cic tm nhin nay Uong tgp tin dinh dang ia 1. Vdi tir khda "chobiif li tu khda nim d vi tn thii 63 ttong tgp tin tu khda, nhu v^y dl thi hien mpt hi cd Uong bd td khda va vj \x[ cua nd li thir 63 ttong bp td khda ta viet 63:1, thyc hi^n tuong ty vdi eae tir cdn lai Uong tin nhin v i vdi tit ci tin nhin nhin dupe. Chd y ring thii ty cae tu Uong tm nhin khdng quan ttgng, khi xac djnh Ugng sd cic tu, ta viet theo thd ty td nho din ldn.

Nhu di ttinh bay, vipc xiy dyng bp tir khda die ttung ttong nghien ciiu nay sd dyng hai phuong an, phuong in thu cdng va phuong an ty dpng. Nhu vgy, de xiy dyng tip huin luyen ta ciing ein phii ap dyng hai phuang in nay. Ndu nhu h^ tiidng dugc ngudi sd dyng eiu hinh la su dung phuang in thii cdng till khi xiy dyng tgp huin luyen, he thing se chpn tu khda ttong tin nhin sau khi tich tii bing cich ehi giii lgi nhOng tir ed Uong danh sich tir

khda da dugc chpn thu cdng bdi cic chuyen gia va thyc hien vec-ta cac tu do va luu vio tip tin huin luyen theo phucmg in niy. Neu nhu he thdng dugc ngudi su dyng ciu hinh li sii dyng phuong in tu ddng thi khi xiy dyng t ^ huin luyen, he tiidng se lam viec logi bd cic td ddng (ngugc Igi so vdi phuong in thit cdng la chgn td khda trong bd tii khoa) sau khi thyc hien tich td, sau do vec-ta hda toin bd cac td eon lgi va luu vio tip tin huin luyfn theo phuang in nay.

Mpt vin dl quan ttpng cin quan tim khi xay dyng tap dd lieu la thdi quen nhin tin tidng Viet khdng bd diu cua ngudi dung, do vgy trong qua ttinh xiy dyng t|p dii lieu v i bp td khda neu ta chi sii dyng tieng Vift cd diu thi se lam cho ket qui phan logi tid nen khdng cbinh xac mic dii ndi dung tm nhin ed chua tir khoa cin thilt cho phin logi, chi cd khic la tDf khda dd khdng co diu tieng VidL

D I giim sai sdt ttong phan logi tin nhin, chiing tdi de xuit mpt giii phip de xiy dyng bp tii khoa v i tgp tin huin luyfn ttong md hinh phin logi tin nhin. Do bd tii khda dugc xiy dyng dya ttdn hai each nhu ttdn da de cap, vdi phuong an xiy dung bd tir khda thii cdng thi khi c^p nhgt thdng tin cho timg chuyen nginh ttong siu chuydn nganh liia tren ta ciing ddng thdi cgp nhat cic tir khda dgc trung cho tirng chuyen nganh, cac tir khda dugc sir dung bao gim ci cic tu cd diu tilng Viet, khdng CO diu tilng Vift, cic tir gin diing vdi td khda nhung khdng c6 nghia khae (phong trudng hgp ngudi diing vilt sai chinh ti,...), cac tii khic nhau nhung cung nghia, cae tu dja phuong... Viec xay dyng tgp dd lifu dugc thyc hien bing cich vdi mdt tin nhan tieng Vift cd diu d i dupe phin logi, hf thdng sg ty ddng tao ra them mdt tin nhin tilng Viet khdng ed dau v i luu vio ca sd dii Ufu dl sd dyng cho viec xay dyng lgi md'binhV Vdi each chpn tu khda theo phucmg an ty dpng dya vio tip du lifu ndn chi cin xiy dyng tap dii lifu nhu d tten da dl xuit la ta sd cd mgt bd td khda gdm ca tii cd diu lan tir khdng diu tilng Vift

Tuang ty viec xiy dyn^ tgp tm huin luyfn, ta xay dyng tip tin kilm tra de kidm tra dO chi'nh xac eiia md hinh.

Sd d^ng md hinh

Sau khi kilm tra dg ehinh xic phin ldp cua md hinh, nlu dO chinh xac chap n h ^ dugc, ta dua md hinh vao su dung dk phan logi eic tm nhin mdi.

Tucmg ty nhu qui trinh xiy dyng tip huin luyfn, tin nhin mdi din he thdng sd dugc tidn xii ly, tich tu va vdc-ta hda theo djnh dang gidng nhu dinh dgng cua tip tin huin luyfn vdi phuang in m i

(10)

T^ chi Khoa hgc Trudng Dgi hgc Can Tho Phdn A: Khoa hgc Tu nhiin, Cdng nghi vd Mdi Irudng' 33 (2014): 9-21 ngudi sd dung ciu hinh hf thdng da chpn. Chi cd

mdt dilm khac li nhin phin ldp ciia tin nhin mdi niy la nhin tam cho tin nhin mdi, sau khi dua vio md hinh phin logi, nhin tgm niy sd dugc ty ddng thay thi bing nhin chinh thuc, dinh dgng ciia tip tin cin phan logi nhu sau:

<templabel> <intkxl>:<valuel> <wdex2>:<value2> . 2.4 Xay dung website tich hgp va quin 1^

ciu hinb h | thdng

Website he tiidng dugc xiy dyng theo md hinh MVC tren nln ngdn ngit Java (JSP va Servlet) vdi he quin trj co sd dii lieu MySQL. Hf tiidng quin ly hai nhdm ddi tupng ngudi diing li chuyen gia vi quin ttj/dilu phdi vien.

Chiic nang cua nhdm chuyen gia:

Ding ley, xem, chinh sua thong tin ci nhin.

Tri ldi cic cau hoi cua nha ndng lien quan ddn chuyen mdn.

Phan loai lai tin nhan neu cd sai sot.

Chdc nang ciia nhdm quan tti/ dilu phdi vien:

C ^ nhit cic ldp (chuyen nganh) cin phan loai C^p nhgt danh sach chuydn gia Cip nhit, phin loai tin nhin Cap nhit thdng tin Gateway - Cau hinh hf thdng Thdng ke/bao cio

Md hinh thuc thi kit hgp (ERD)

Cic thyc thi chinh ciia hf th6ng bao gdm;

Chuyen gia, chuyen mdn va tin nhin. Mdt chuyen gia cd thi CO nhieu chuyen mdn, mpt chuyen mdn cd the cdlnbieu chuyen gia dang ky. Mpt chuyen gia cd thi tti ldi nhilu tin nhin, mft tin nhan chi dugc tti ldi bdi mgt chuydn gia. Hinh 7 minh hga cac thyc the ciia he thong dugc ket hgp lgi theo md hinh tliyc till kit hgp (ERD).

3 KET QUA T H V C NGHIEM Sd lugng tii khda budc diu chiing tdi xiy dyng dd minh hga cho phucmg an chpn tii khda dugc xiy dyng thu cdng l i 243 tii, cae tii khda thudc 06 chuyen nginh nhu di d& cap, l i cic tii die trung nhat, cd y nghia ttong phan loai dugc chiing tdi cung vdi sy ttg giiip ciia mdt sd chuydn gia chgn lya va xiy dyng mpt each thii cdng. Tip dir lieu ban diu chiing tdi ty xiy dyng gdm 200 ciu hdi (cd diu tieng Vift) vi 200 ciu hdi khdng cd dau tilng Viet dupe he thdng ty dgng sinh ra tir 200 cau hdi ed diu. D I xac dinh nhan (phan ldp chuydn nginh) cho cic tin nlian nay thi chiing tdi dung md-dun phan logi ban ty ddng dl gin thu cdng. Vdi phuong in chpn td khda ty dpng thi sau khi thyc hien cac budc e3n thiet de xiy dyng bd tu khda tten tap dtt lieu gdm 400 tin nhin ciu hdi nliu tten, chiir^ tdi thu dugc bp tu khda gdm 1044 tii, li nhung tii duge giii lgi sau khi logi bd tii diing, eic ky ty dgc biet vi ky ty sd khdng cd y nghia ttong phin loai.

De phin logi tin nhin bang SVM, chiing tdi sir dyng bd tiiu vifn LibSVM (Chang et al, 2011).

Bing nghi thue kiem tta cheo (10-folds) ttgn tap hpc, md-dun phin logi tin nhan cho dp chinh xic dgt 69,94%. Vdi phuang in ty ddng cho kit qua chinh xic dgt 71,9%.

Do dang ttong giai dogn nghien cuu va thu thip du lieu, nhdm tic gii ehi mdi thyc hifn kiem tta dp chinb xic Uen bd dii lifu thu thip vi xiy dyng dugc, nhdm tie gii ehua thyc hifn kilm tra dp chinh xac vdi tin nhin ngoii thyc tl. Sau khi tin nhin chiia eiu hdi dugc ph&i loai, no dugc chuyin den chuyen gia thich hgp dd Ua ldi, Khi nhgn dugc eiu Ui ldi td chuyen gia, he thdng ty dgng giii ndi dung tti ldi cho nha ndng.

3.1.1 Trang chii hi thong

Chuyen gia v i quin tti dang nh|p vao he thdng thdng qua trang chu. Sau khi dang nhip thi tiiy logi ngudi dung se ed nhiing danh myc chiic nang rieng nhu tten di de cap.

Hinh 7: Md hinh thyc thi kit hpp (ERD)

Hinh 8: Trang chu website hf thing

(11)

Tgp chi Khoa hgc Trudng Dgi hgc Cdn Tha Phdn A: Khoa hgc IV nhiin, Congnghi vd Mdi irudng: 33 (2014): 9-214

3 1.2 Trangphdn logi ngi dung tin nhdn (bdn ty: dgng)

Cd chuc nang hiln tiii cic tin nhin di dugc phan logi bdi md-dun phan loai ty ddng nhung chua dugc phan logi bdi quin Uj/dilu phdi vien.

Dilu phdi vidn dang nhap vao O^g nay dl thyc hifn phin logi (gan nhan) tin nhin.

Hinh 9: Trang phan loai npi dung tin nhan 3.1.3 Trang cdu hinh hi thong

Cho phep thay doi cic thdng sd cau hinh hf tiidng nhu thai gian he thing lip lgi vifc truy van vi huin luyfn lai md hinh, sd l u ^ g tin nhan mdi dl thyc hien huin luyfn lai,...

CAlHlNHHiTliAtG

4 ni«(luUtli«|llpWvlkinsiiikiisi!niill^ia

Hinh 10: Trang cau hinh hf thong 3.1.4 Mdn hinh dien thogi chuyin gia Sau khi hf thdng nhan duge cau hdi cua nha ndng, hf thdng tidn hanh phin loai ciu hdi va gui cau hdi den cho chuyen gia Ui ldi, Ciu hdi di kem mft mi sd dd xic dinh tin nhin Uong co sd du lifu tin nhin, Hinh 11 minh hga ndi dung tin nhin SMS dugc hf thdng giri den cho chuyen gia tra Idi.

3.1.5 Mdn hinh dien thogi nhd nong Sau khi he thing nhin dugc ciu tti ldi ciia chuydn gia, hf tiidng sd ty ddng gdi ciu tti ldi din dien thoai nhi ndng. Hmh 12 minh hga npi duos tin nhin SMS dugc he thing giii cho nhi ndng di giai dip thie mic ciia nhi ndng.

Hinh 11: Tin nhSn dugc hf thing gui den cho chuyen gia tra Idi

Hinh 12: Tin nhin durgc hf thSng giH cho nfa^

ndng dl giai dap thac mac ciia nhi ndng 4 KET LU^N VA D £ XUAT

Trong nghien cuu niy chiing tdi di tim hieu vi ttinh biy khii quit mdt sd kien thdc ve hf thing thdng tin di ddng, cich thuc dl ciu hinh hf thing, gdi va nliin tin nhin tir may vi tinh din difn \ho^

di dfng, eic kj! thuat phan logi van bin vi tin nhSn vin ban bing SVM.

Chiing tdi di tham khao v i tim hllu cich sii dyng mgt sd thu vien, phan mem iing dyng trong hf thing nhu SMSLib, jWAP, jMmsLib, LibSVM.

Chung tdi ciing xiy dyng thu cdng dugc bd tir khda gdm 243 td thugc cae chuydn nganh liia de minh hpa cho vifc phin loai van ban ty dpng bing SVM theo phuong in thu cdng, v i mdt tap dii lifu huan luyfn budc diu cua he thong gdm 200 tin nhSn SMS tham khao tgi (Thu vifn KHCN Quing Tri, 2014; Cty CP Phin bon Binli Diln, 2014; Ban nhi ndng, 2014; Sd KHCN VTnh Phiie, 2014). Ngoii ra, chiing tdi da de xuat cac phuong an de xiy dyng bp tir klida vi tip dd lieu dl ning cao dp chinh xdc cho md hinh phin logi tm nhin tilng Viet khdn^

diu do nhin tin tidng Vidt khdng diu thi dl d^n^

va thuin Ipi hon cho nhi ndn^, nhin dugc nhilu 1^

ty ban ftilng Viet khdng diu la 160 !^ ty/I tiffi nhin, tilng Vift cd diu l i 70 ky ty/1 tm nhin) va nhi ndng khdng can phii cd dien thogi ciu hinh cao mdi cd thi dgt cau hdi bing SMS.

Trdn nen ting do, chiing tdi di phat ttiln dugC mpt hi thSng hd trge khuyin ndng trin cdy liia qua mgng thdng tindi dpng vdi cac chdc ning eo bin nhim hd Up vl thdng tin kj thu^t cho nh^

(12)

Tgp chi Khoa hgc Trudng Dgi hgc CSn Tha Phdn A: Khoa hgc Tu nhiin. Cdng nghi vd Mdi trudng: 33 (2014): 9-21 ndng mgt cich kip thdi, vdi chi phi hpp ly nhit,

giii quydt dugc tuong ddi diy du nhiing thac mic, nhu ciu ciia nhi ndng. Sau mpt thdi gian van hinh va khai thie mgt cich bin ty dgng tten thyc tl, hf thdng sd thu thip dugc mot lugng dii lieu thyc cin thiet, kit hgp vdi cdng nghf khai phi dii hdu, xu ly vin bin y i xu ly inh dl xiy dung mdt hf thdng hd ttg khuydn ndng tten cay liia qua mang thdng tm di dgng mdt each hoan toin ty dgng. Vdi kit qui budc diu nhu di trinh biy Uong phin kit qui cho thiy vide xay dyng v i triln kh^ he thdng niy l i hoan toin khi thi va cd tiem nang phat trien rdng rai.

Hien tai, chung tdi xiy dung md-dun gOri nhin tin nhin da phuang tifn gidi hgn chi vdi hai nha mgng di ddng Ii Mobifone va Vinaphone. Vifc phat tridn he thong de hd Mg nhieu nha mang han se dugc phit trien Uong cac giai dogn sau. Gidi han ve sd lupng k^ ty ttong mdt tin nhin vi viec ngudi dimg cd thdi quen nhin tin tieng Viet khdng cd dau lam cho viec phin loai tin nhin vin ban ttd nen khd khin hon.

N ^ i e n cuu niy lim tien de cho vifc hoan thien he thdng ty dpng hd trg khuyen ndng tten ciy liia qua mang thdng tin di dgng. Nhdng cdng vifc cin tidp tuc thyc hien gdm:

- Phit ttiln hf thdng dd hd trg giii va nhgn tin nhin da phuong tien (MMS) ciia tat c i cic nha mang hien hanh d Viet Nam.

- Phat Uiln md-dun phin logi tin nhin hinh inh ty ddng bing cac kj thugt xu ly inh vi miy hpc.

Xiy dyng mdt he thong hoin toan ty ddng dya vio sy kit hgp hai kj thuit phan logi, phin loai tin nhin vin ban va phin logi tin nhin hinh inh.

- Hoin tfiifn tiiem bd tii khda cho phan loai vin bin, xay dyng md binh phin loai vdi dp ehinh xic cao ban nhd vio ngudn dii lifu thyc te mi he thdng thu thip dugc.

- Hoan thien tbdm lip dd lidu dl cd thi phin logi tin nhin SMS kbdng go diu tieng Viet dugc hifu qui hon.

- Phil ttiln md hinh sang nhieu Imh vyc, nhilu chuydn nganh, chuyen mdn khic ngoii ciy lua.

TAI LIEU THAM KHAO

1. Vapnik, V. 1995. The Nature of Statistical Learning Theory. Springer. New York.

2. Chang, C.C., Lin, C.J.,2011, LIBSVM-a library for support vector machines, http://www.csie.nm.edu.tw/~cjHn/libsvm (Tmy cap: 3/2014).

3. Nguyin Ngpc De, 2008. Giio tiinh ciy liia, Vifn Nghien cdu Phit tridn Ddng bing Sdng Cihi Long, Dai hpc Cin Tho, 4. L. H. Phuang, N . T . M H u y l n v i TV. Ho,

2010. A hybrid approach to word segmentation of Viemamese texts, http://niim.hus.vnu.edu.vn/phuonglli/node/33 (Tmy cap: 01/2014).

5. DdThanhNgbi, 2011, Khai mddu lifu- minh hga bing ngdn ngii R, NXB Dai hpc cin Tha.

6. Trin Cao Df v i Pham Nguyen Khang, 2012. Phan loai van ban vdi May hpc vector ho ttg v i Cay quyet dinh", Tgp chi khoa hpc (21a), U. 5 2 - 6 3 .

7. Tdngcucthdngke,2014a. Thdngke lao dOng tir 15 mdi ttd ldn dang lam viec tgi thdi diem 1/7 hang nim phan theo nginh kinh te,

http://www.gso.gov ,vn/default,aspx?tabid=3 87&idmid=3&ItemID=14227 (Truy cap:

04/2014).

8. Tdng eye tiidng ke, 2014b. Thdng ke difn u'ch gieo ttdng mpt sd cay hang nam, http://www.gso.gov. vn/default.aspx?tabid=3 90&idmid=3&ItemID=14110 (Tmy cap:

04/2014).

9. Tdng cue tiidng ke, 2014e, Thdng kd san lupng mpt sd ciy hing nam,

http://www.gso.gov .vn/defauh,aspx?tabid=3 ---- 90&idmid=3&ItemID=14108 (Tmy cap:

'" 04/2014).

10. Developer's Home, 2014. How to send SMS MessE^es from a Computer/PC?, http://www.developershome.eom/sms/howT oSendSMSFromPC.asp (Tmy cap: 01/2014).

11. NowSMS, 2014. Documentation, http://www.nowsras.com/doe (Tmy cap:

01/2014).

12. SMSLib, 2014, A universal API for sms messagmg, http://smslib,org/ (Tmy cap:

01/2014)

13.JWAP, 2014, http://jwap.soiuceforge,net/

(Truy cap: 02/2014).

U.jMmsLib, 2014. http://jmmsiib.sourceforge.net (Tmy cap: 02/2014).

(13)

Tgp chi Khoa hgc TruangDgi hgc Can Thft Phan A: Khoa hgc Tu nhien. Cong nghf va Moi Inmng: 33 (2014): 9-21 15. Thu vien KHCN Quang Tri, 2014. Cac cau 17. Ban nlia n6ng, 2014. Hoi va dap,

hoi thuong gap, http://www.bannhanong.vietoetnam.net/ho http://elib.dostquangtri.gov.vn/thuvien/IncIu me.php?kh=&catjd=29 (Truy c$p:

de/TVDT.asp?option=4&noidimgcantim=l% 01/2014).

C3%BAa&CSDL=6 (Tray cap:03/2014). 18.S6KHCN VTnhPhlic,2014.hoi dapkhoa 16. Cty CP PhSn bon Binh Dijn, 2014. Cac cau hoc ky thujt,

hoi vS lua, http://123.25.71.107:82/hoidap/index.php?la http://www.biiihdien.com/articlebd.plip?id= nguage=vi&nv=news&op=search&q=c%C3

158&cid=l (Truy cap: 03/2014). %A2y+l%C3%BAa&mod=all (Tmy c|p:

01/2014).

Referensi

Dokumen terkait