0 •
66 ' TAP CHi KHOA HOC TRUONG DAI HQC M d TRHCM - S6 2 (30) 2013
Ht THONG TlT VAN TRirC TUYEN NGHE NGHIEP CHO HOC SINH/SINH VIEN TREN M 6 I TRU'dNG DI DONG
AH^^I
PGS TS. Triin Thdnh Trai' Nguyin Tdn Triiu^Trdn Thi Thanh Thdo^
TOM TAT
vien I
Bdi bdo gi&i thiiu hi thong lu van iruc tuyen nghi nghiep cho hoc sinh sinh 7 Irin moi tru&ng di dgng. Nhu cdu hu&ng nghiep cUa hgc sinh, sinh vien ngdy cdng tdng trong cdc ndm gdn ddy do vdy nhliu web site tu vdn true tuyen dugrc thdnh lgp song song v&l cdc to chuc tu vdn true tiep. Vai suphdt Iriin cong ngh4 di dgng mgnh me, nhiiu phuang ti?n di dgng gid cd phdi chdng tgo diiu kiin gidi quyit nhu cdu dinh hu&ng nghi nghiep cho hgc sinh sinh viin bang nhirng hi thong true tuyen trin moi tru&ng di dgng duac thiil ki iheo nguyen ly "tiep cgn tim kiim Ihdng tin tiiy biin (Personalized Information Retrieval Approach)" tgo nhieu ihudn l^i cho cdc cd nhdn co nhu cdu tuvan chgn nganh nghe phii hop v&i ddc diem tdm sinh lyvd ndng luc cua minh a bdt cir ddu va bdt ky luc ndo. Ngu&i cd nhu cdu tu vdn khong mdt cong lu tim kiem nhieu logi thong tin phdn bo rai rdc & nhiiu ca sa, chinh h^ thong v&i cong cu lim kiem t{r dgng (search engine) ldm cong viec ngng nhoc nay. Bdi bdo gi&i thi4u kiin true, ca s& dir lieu, giao dien cua hi thong kiiu ndy. Mgt phiin bdn 1 dd duoc hien thtrc.
Tir khda. Cdng nghe thdng tin truydn thdng. tu van hudng nghiep true tuydn, cdng cu tim kidm tu dpng ca nhan hda, mdi trudng di ddng.
ABSTRACT
A new mobile-platform career-oriented consultaion system for students is presented.
Due lo inceasingly increasing demand of career-orientation by students, many online consultaion websites and face-to-face consullalion organisations are formed. As mobile technology has been developed largly at reasonable cost, easily approached by any mobile device owner, demand of career-orientaion can be .satisfied on mobile platform with Personalized Information Retrieval Approach.
The Syslem helps inquirers to obtain career-orietatlon inforniation anywhere, at anytime with a mobile device he/her owns.
The work presents the achitechuire, database, user-interface ofthe system. The first version has been realized.
Keywords: Infonnation Communication Technologies, online career-oriented consultation, personalized search engine. Mobile environment.
' Tnrang Dai hoc Mo Tp.HCM
Chuyen viin IT. cong ty Phdn mem dt dgng Greengar,
^ H<?c vien cao hpe Tru&ng Dgi hqc Bdch khoa Tp HCM.
KHOA HQC KY THUAT 87 I. B^T VAN BE
Tu vin hudng nghidp cho hoc sinh/' sinh xien la mpt chu dd rdng, can xu ly nhieu thdng lin vd tim sinh ly. nhu cau nhan luc cua dia phuang vi xa hdi dd tu van cho hgc sinh cac thdng tin lidn quan ddn sd thich ca nhan, dinh hudng va hd trg cho vide chpn nganh nghd phii hgp dd phat tridn.
Vdi su phit uien, phd bidn cua Internet cich tidp can thdng tin hudng nghiep cua hoc sinh ngiy cang da dang.
Do nhu cin trdn. cac md hinh website trac nghi?nb'tu vin hudng nghiep nhu: http://
tuvanhuongnghiep.vn ngay cang phii tridn. Tuy nhien. cic md hinh hidn nav dang gap phai nh&ng han che nhu sau:
Cac md hinh nay phd bidn cic thdng tin chung cho tit ci cic nhdm ddi tugng. mdi ci nhan cd mdl nhu ciu thdng lin khic nhau. Vi du nhu mpt hoc sinh sau khi da Iam bai test vi biet minh phii hgp vdi mdt nhdm nganh gi dd, budc tidp theo li cin biet cac thdng tin vk nganh hgc, nhu ciu tuydn dung cdng vide d hidn tai va cic dinh hudng phat tridn, chinh sich hd trg sau khi tdt nghidp. Vide tdng hpp thdng tin tu nhidu ngudn theo nhu ciu thdng tin nhit djnh ddi hdi hoc sinh can tu minh tim kiem rit nhieu lir nhidu kdnh khic nhau.
Hinb 1. Bieu do cac truing hgp sir dung (User case diagrams) thd hien moi lien h$
giii'a nhu cau thong tin giira hoc sinh/sinh vien vdi cac nguon thong tin tren Internet
Tlifing tin v6 cac nhu cau ciia hoc sinh MOI triiona Inlemei
Cac website va cic trang thdng tin truyen thdng su dung cic giao didn web tuong tie ngudi dimg mgc chidu.
hoic hai chidu d muc hgn chd. Ngudi diing thudng phai chii ddng doc va tim kidm phu hgp tha>' vi dupc tu vin mdl cich chii ddng theo nhu cau ca nhin (Personalized Information
Retrieval Approach ). Su phat tridn ciia mobile inlemet. mobile web vi trang web mgng xa hdi ngiy trd nen phd bidn trong nhung nim gan diy.
theo dd each thuc luang tac gitra ngudi su dung (hgc sinh/sinh vidn) vicic thiet bi cam tay nhu didn thogi thdng minh (smartphone). miy tinh
TAP CHI KHOA HOC TRgpNG OAI HQC M d TP.HCM - s 6 2 (30) 2013 bang (table) t da thay ddi ding kd.
Tit ca nhihig dieu tren dan ddn mdt md hinh xu ly thdng tin mdi tren nen tang didn loin di dpng. Cac dich vu weh nhu httpsy/www.scoopinion.com
hay http://curate.me la nhirng vi du dien hinh ciia md hinh chu ddng dua thdng tin lidn quan va ca nhin hda den vdi ngudi dune.
Hinh 2. Md hiah tim kiem chii dpng (Active Search) cac tbong tin phii hgp voi nhu cau cua ngudi dung mobile voi may chu phan tich (Analytics Server), miy chu de giri
thdng tin (Push Message Server) va thiet bj dau cuoi nguoi diing (mobile devices)
App Server Anatytiu Engna
Personalized Infonnation
I-
GooglefAppJc
Push Message Server
Personalized Information
User Mobile Device
Dd giai quj'dt nhung han chd tren \ a phit trien mpt md hinh dua tren nhiing xu hudng mdi tren. chung tdi de xuat kien true xu ly thdng tin chu dpng v i lien quan ddn ddi tugng hpe sinh/sinh vien. Cac thdng lin d diy bao gdm:
" Cac thdng tin chung ve tinh each dua tren ket qua bai test linh each.
Danh sich c i c cdng vide, nghd nghidp phil h c ^ ddi vol hoc sinh/sinh vidn.
Danh sich c i c trudng tham khao phu hgp cho hoc sinh chuin bi thi vao Dai hpe.
KHOA HQC KY THUAT
Hinh 3 . Bieu do cac tnrdng h o p sv dung lh£ hien tinmg tac giua ngDof diing he thdng vi h | (taong
Hoc sinh •' Sinh vien
Kien trtic xiir ly t d n g q u i t ciia h^ t h d n g t u v i n h u d n g n g h i p p c h o h p e s i n h / s i n h v i c n trdn m o i t r u d n g di d p n g .
Hinh 4. Bieu dd tbanh ph^n (Component diagrams) trinh bay c i c th^nh phan (components) chinh ctia h^ tbdng
S e h o o M J n w m i l r
PaaonMatM Soarcn E n ^ n * ( A p K h e LIK<KM>
& • * * « » •
c
90 TAP CHI KHOA HOC T R U O N G DAI HQC M6 TPHCM - SO 2 (30) 2013 Cic thanb phan quan 1^ dtr h ^ gam:
Dcftt thd tilu su sinh vien (Student Profile Module): Quin ly d& lieu hd so hpe sinh/sinh vidn, thdng tin co ban. cac kdt qui tir cic bai trie nghiem, du lieu tuong tac tir cac mang xa hdi-
Don the cac trirdng dai hpe (Schoo/
University Module): Du lieu vd irudng (cao ding, dgi hoc, nghe), du lidu nay dugc bd sung lu focused crawler.
ikm. the cdng viec/k^ ning (Job/
Skill Module): Dir lieu vd vi^c lim. nhu cau v i nghe nghidp, dir lieu nay dupc bo sung tir focused crawler.
Cac thanh p h i n xir ly v i phan tfch diVlifu eiJa he tbdng:
Thinh phin xu ly dd anh xg tir du lidu tinh each tu kdt qui test thanh cic kdl qua phil hgp, hao gdm' trudng hpe. nganh nghe, cic kJ ning cin luj-dn lip.
Personalized Search Engine; la miy tim kiem dupc c i nhan hda dc tim kidm cic trudng, nganh nghd phii hgp, diu vio la cic tham sd du lieu dugc chuin hda dd dua ra ket qua phu hgp.
Fucused Web Crawler Ii don thd (module) thuc hien vide phan lich, tim kidm v i luu trir cac thdng tin theo mdt lieu chi dugc dinh nghia sin (predefined criteria).
Md hinh trien khai cac cdng nghe t r o n g hd th6ng
Hlnb 5. Mo hinh trien khai he Ihong tim kicm v i gioi thi|u danh sach trimng va c5ng viec phii b i ^ vdi hgc sinh / sinh vien
KHOA H p C KYTH'JAT Hinh 6. Bieu do boat dpng (activity diagram) -
Mo hinh xir ly chinh cua Focused Crawler
M^jnoa VHchMISq
^ Idiu CBU h u n I3i
>i<diu CBU [uong fai
R^FieFEfte«ce_4
•(t_nginh M <ftii
MKfurQO] "On*
™ ^ " ^ 1 FKLREFEREI*CE_5 HUit bkiU ±fi!£ •
a Mnflhlip vtittrnpO)
cNiMkitvMiMI AacdmrtMUcajtaU nreiwiQOy
TAP CHI KHOA HpC TRUONG DAI HOC M 6 TPHCM • S6 2 (30) 2013
Hinh 7. Mo hinh logic dgng quan h | ciia ca sor dir Uf u he thong
O j e c t - O r i e n t e d Model
Model: Focused C r a w l i n g A c l i v i n Di gram Package;
Diagram: FocusedCrawliii Auttior: Trieu
Diatiram Date 8 1.7012 Version: 1 0
tempbteis
• XPATH or DOM Selectors, it's defined and stores in a -"K-V^ database
• Topic models Tor comparision
[ Get next URL )
f Get content J [ Rctric\'c predefined tciiq)iacc J
INO]
This step is optional and is used when implementing fnrcused crawling. Here, the retrieved content is checked against a model lo see if the content is relevant lo what we're searching for. If it Is, then the content is saved locally.
There are 2 implemeiUhns:
* Cliccking by LSA model against the predefined interested topics
• Checking by DOM Seclector
[Extract matched content J
[ Save content J
C Extract URLs )
[ Inject Delay j
Injecl delay - Jf the crawling process is too Fast or if multiple threads are being used, il can sometimes overwhelm the site. Sites protect themselves by blocking the IP address of misbehaving crawlers, Wt may wanl Io optionally injecl a delay between subsequent hits lo
KHOA HOC KY THUAT
2. G U O DIEN
2.1. Giao dien cac cau hot ve tinh each
Hinh 8. Giao di^n cac can hoi ve tinh each
1. Neu mo ta ve minh. bjnja nguoi:
'Sf' a. Noi nhieu hon la nghe ngvo-i khac tioi.
b. Lang nghe ngipoi khac nhi§u hoii la noi.
c Diiiy cacd^uti^t.
d. Chi'i y birc tranh toan earth va nhutig nee c6 tb^ xay ra.
e. Oiiyet dinh moi viec rat khach quan.
f. Quyet diiih moi viec theo gia Iri rieng ctia diiiiig vci cam nlidii cud ban.
C[._ i|. Thtrc hi6ii diintj ke hoadt dat ra, khong muon thay (Toi.
h. Linh hoal khi thtrc hien cac ke hoach.
Mi-l::- -iSSiM,
2. Trong nhQng buoi hop mat hay tranh lu|n cung ban b«,
m
ban...
a. TliJdi la tam diem ciia nr chii y.
h. Cam thay Ihoai mdi khi v mot minh.
c. Thich nhuii<i giai phap tiiuc te.
d. Thich nhtmg y tii>otig sang tao.
e. Thimig tranh loan cho vui.
1. Co g^ng ttanh t^t ca tranh luan va doi dau.
'-•" n. R^l rJii'i liniifl (Ten thivi diau v.i luon rtiniii nio'.
TAP CHf KHOA HQC TRI/ONG DAI HQC Md TP.HCM - SO 2 (30) 2013 1.2. Giao dien tir van ket qua trirong
Hinh 9. Giao dien tir van ket qua tnrcmg
I Knot U o Ditm trung tmh To6it:
Bthn trung binh V6n:
D^ trung tnnh Anh:
Ban IS nguai d l cam thong va doc dao- Ban thich lam viec trong m6i ta/ong ngan nSp. Ban rSt co trSch nhi0m. Khi lam bk. cCr viec gi, ban thirong d6n het tam tri cua minh vao 66
"Ban CO the tro thanh mgt Chuyen vien quang cao, Bien tap tap chi. Nha san xuSt ztc chirong trinh TV, NhSn vien maiketing, Nha vSn/Nha bao
Keywords Bao chi
Tim trirong
Tn/ong E'ai hoc Kien tnic TP.HCM Score 0 0535Q5
TfLrong Bai hoc Khoa hoc xi hoi va Nhan van -'BHQG TPHCIV1
Score: 0 038772
Tnjcrna Bai hoc Khoa hoc xa hi5i va Nhan van - BHQG
3. KET LD.ikN VA HL'ONG PHAT nghiep va traong cho hgc sinh trung lipc TRIEN va thanh nien.
Tnmg bai bao nay. sau khi khao sat Tren ca so do, bai bao ciJng huong cSc xu huong phan tich va xu ly dO lieu dSn cdc chu dS nghien cijru Ion khac, dac thong minli, cung vcri do su phat trien va hie! Ia sir dung cac mo hinh tim kiim ngay cang ph6 bien cua cac thiSt bj di thong tin cliu dong (focused crawling and dpng thong rainh (sm.irtphonc). chiing toi active recommdation) cho viec xii ly tim d3 phan tich. dira ra cac mo hinh xir ly phu ki6m cdc thong tin phu hop va hien tlii tfli hop va hicn thuc cac chCrc nSng co ban uu cho cac thiit bj di dpng.
ciia mpt hp thong true Tuyen tu van huong
KHOA HQC KY THUAT TAI U E U THAM KHAO
1. TS. H6 Thieu Hung. TS. Le Thi Thanh Mai. Xay dimg website dinh hirang dion nganh trudng dai hoc, cao dang du tht phii hpp vai scr thich va nang luc.
2. Bay Ioai hinh thong minh cua Thomas Armstrong, ban dich cua Manh Hai va Thu Hien, Alphabook phat hanh 2007.
3. Frames Of Mind: The Theory Of Multiple Intelligences cua Howard E. Gardner, http i//\vww.epubbud.com.Tead.php?g=NEXHBG 79.
4. Trac nghiem hudmg nghiep cua John Holland hllp://advice.vietnamworks.com/
vi/career/huong-nghiep/irac-nghiem-kham-pha-nghe-nghiep-phu-bop-qua-tinh- cach-cua-ban.html.
5. hup://advice.vietnam\vorks.com'vi/career/huong-nghiep/nghe-nghlep-nao-phu- hop-voi-tri-thong-minh-cua-ban.htm].
6. http://www.lhongtintuyensinh.va'.
7. http://iuvaiihuongnghiep.\n
8. http://www.carcerke>.org/asp/your_persona]ity/holIands_theory_of_career_
choice.html.
9. Tai lieu IrSc nghiem ve tinh each MBTI (Myers-Briggs Type Indicator) http://
en.wikipedia.org/w iki/M\ ers-BriggsType^lndicator.
10. Mobile Web Development http://www.packipub.com.''mobile-web-development/book.
11. Lucene in Action cua Michael McCandless, Erik Hatcher, and Otis Gospodnetic.
Manning Publications, 2010 http://www.manning.com/hatcher3/.
12. Algorithms ofthe Intelligent Web cua Haralambos Marmanis and Dmitry Babenko, Manning Publications, 2009 http://ww\\.manning.com/marmanis/.
13. htlp://en.wikipedia.oJg.''wiki/Focused_crawier.
14. bttp://en.wikipedia.org/wiki/Focused_crawler.
15. http://www.readwriteweb.com;'mobile/2011/03/besl-practices-for-using-push- noiificaiions.php.
16. http://www.codeprojecl.com/Articles/339162/Android-push-notification- implementation-using-A SP.
17. http://developer.android.com/guide/google/gcm''index.hlml.
18. http://jquerymobile.com/.
19. Focused crawling: a new approach to lopic-specific Web resource discover)' http:.v www.almaden. ibm. coni-'almaden; feaL'ww\v8/.
20. LDA Model http:/'vvww.quora.com/What-is-a-good-explanation-of-Latent- Dirichlel-Allocation.
21. http:/.^code.google.com/p/i2lree/ php framework.
22. http://code.google.coin/'p.m>-second-brain/ analytics engine.
23. http://matlet.cs.uinass.edn''index.php Mallet Machine Learning Toolkit in Java.
24. http;//tml-)ava.source forge, net/; Text Mining Library for LSA (Latent Semantic Analysis).
(Ngaynhan bdi: 05/10/2012; Ngdy chdp nhdn ddng: 19/02/2013).