model-based approach to collaborative filtering by using unsupervised self-organising ART2 neural networks which deploys two of the main advantages.
$0RGHO%DVHG$SSURDFKWR&ROODERUDWLYH )LOWHULQJE\1HXUDO1HWZRUNV Anatoli Nachev 'HSWRI$FFRXQWDQF\ )LQDQFH 1DWLRQDO8QLYHUVLW\RI,UHODQG *DOZD\,UHODQG
Ivan Ganchev 'HSWRI(OHFWURQLF &RPSXWHU (QJLQHHULQJ 8QLYHUVLW\RI/LPHULFN,UHODQG
Jacqueline Rowland 'HSWRI$FFRXQWDQF\ )LQDQFH 1DWLRQDO8QLYHUVLW\RI,UHODQG *DOZD\,UHODQG Abstract - Most recommender systems use LWHPVRILQWHUHVWWRRWKHUXVHUV$YDULHW\RI&) collaborative filtering to predict new items of PHWKRGV KDYH EHHQ SURSRVHG WR PDNH interest for a user. In this paper we present a UHFRPPHQGDWLRQV DERXW VXFK LWHPV DV ZHE model-based approach to collaborative filtering by SDJHVPRYLHVDQGERRNV using unsupervised self-organising ART2 neural 7KH ILUVW &) DOJRULWKPV ZHUH EDVHG RQ WKH networks which deploys two of the main advantages REVHUYDWLRQ WKDW SHRSOH XVXDOO\ WUXVW WKH of the ART model - stability and plasticity when the UHFRPPHQGDWLRQV RI OLNHPLQGHG XVHUV 7KH system works in poorly defined domains and *URXSOHQV >@ DQG 5LQJR >@ V\VWHPV DSSOLHG planning of network resources is difficult or even PHPRU\EDVHG DOJRULWKPV WR KHOS XVHUV impossible We report empirical results that show the DXWRPDWLFDOO\ ILQG XVHUV ZLWK VLPLODU impact of ART2 NN parameters on recognition stability, appropriate category granularity, SUHIHUHQFHVDQGFRPELQHWKHLURSLQLRQVWRPDNH classification accuracy, and response time. SUHGLFWLRQV 'LIIHUHQW PHWULFV WR PHDVXUH WKH SUHIHUHQFH VLPLODULW\ EHWZHHQ XVHUV KDYH EHHQ Keywords: collaborative filtering, neural VXJJHVWHG LQFOXGLQJ 3HDUVRQ FRHIILFLHQWV >@ networks, adaptive resonance theory, ART2. FRQVWUDLQW 3HDUVRQ FRHIILFLHQWV >@ YHFWRU VLPLODULW\>@DQGSHUVRQDOLW\W\SH>@ 0RGHOEDVHG &) LQ FRQWUDVW XVHV D XVHU V SUHIHUHQFH GDWDEDVH WROHDUQ D PRGHO ZKLFKLV 1.0 Introduction WKHQ XVHG WR PDNH SUHGLFWLRQV 7KH UHVXOWLQJ PRGHOVDUHW\SLFDOO\VPDOOIDVWDQGHVVHQWLDOO\ 1.1 Collaborative Filtering DUH DV DFFXUDWH DV PHPRU\EDVHG PHWKRGV ([DPSOHV LQFOXGH %D\HVLDQ QHWZRUNV >@ &ROODERUDWLYH ILOWHULQJ &) KDV EHHQ DFWLYHO\ FOXVWHULQJ WHFKQLTXHV >@ >@ 0/3 QHXUDO UHVHDUFKHG LQ UHFHQW \HDUV DQG SURYHG QHWZRUNV >@ LQGXFWLRQ UXOH OHDUQLQJ >@ DQG VXFFHVVIXO LQ SUDFWLFH 7KH JURZWK RI ,QWHUQHW OLQHDUFODVVLILHUV>@ FRPPHUFH KDV VWLPXODWHG WKH XVH RI &) 7KLVSDSHUSURSRVHVDPRGHOEDVHGDSSURDFKWR DOJRULWKPV DV UHFRPPHQGHU V\VWHPV 6XFK FROODERUDWLYH ILOWHULQJ E\ XVLQJ XQVXSHUYLVHG V\VWHPV XWLOL]H NQRZOHGJH DERXW WKH NQRZQ $57 QHWZRUNV ZKLFK GHSOR\ WZR PDLQ SUHIHUHQFHV RI PXOWLSOH XVHUV WR UHFRPPHQG DGYDQWDJHV RI WKH $57 PRGHO VWDELOLW\ DQG
SODVWLFLW\6WDELOLW\LQGLFDWHVDSDWWHUQWKDWGRHV QRW RVFLOODWH DPRQJ GLIIHUHQW FOXVWHU XQLWV DW GLIIHUHQW VWDJHV RI WUDLQLQJ DQG SODVWLFLW\ LQGLFDWHV WKH DELOLW\ RI D QHXUDO QHW WR OHDUQ D QHZ SDWWHUQ HTXDOO\ ZHOO DW DOO VWDJHV RI OHDUQLQJ>@
1.2 ART Neural Networks
1HXUDO QHWZRUNV 11 DUH RIWHQ JLYHQ VRPH LQLWLDO ZHOOGHILQHG VDPSOH RI LQVWDQFHV WKDW UHSUHVHQWWKHGRPDLQWKHQHWZRUNLVDSSOLHGWR $ W\SLFDO 0/3 11 LV H[SHFWHG WR OHDUQ WKLV WUDLQLQJVHWXVLQJDEDFNSURSDJDWLRQDOJRULWKP 7KHVHDUHKRZHYHUSRRUO\GHILQHGGRPDLQVIRU ZKLFKSODQQLQJRIQHWZRUNUHVRXUFHVLVGLIILFXOW RUHYHQLPSRVVLEOHEHFDXVHWKHQHWZRUNQHHGV WR OHDUQ DERXW D FKDQJLQJ RU HYHQ XQNQRZQ LQSXW VSDFH 2IWHQ VXFK DSSOLFDWLRQV SUHIHU WR XVH XQVXSHUYLVHG VHOIRUJDQL]LQJ OHDUQLQJ SDUDGLJPVOLNHWKRVHSURSRVHGE\WKH$GDSWLYH 5HVRQDQFH7KHRU\$57 >@$57LVDIDPLO\ RI 11V WKDW GHYHORS VWDEOH UHFRJQLWLRQ FDWHJRULHV RU FOXVWHUV E\ VHOIRUJDQLVDWLRQ LQ UHVSRQVH WR DUELWUDU\ LQSXW SDWWHUQV 7KHUH DUH YDULRXV XQVXSHUYLVHG $57 DOJRULWKPV VXFK DV $57 $57 $57 DQG )X]]\ $57 DQG YDULRXV VXSHUYLVHG RQHV VXFK DV $570$3 )X]]\$570$3DQG*DXVVLDQ$570$3>@ 7KHIROORZLQJVHFWLRQSURSRVHVDQ$5711± EDVHG DSSURDFK WR &) WKDW SURYLGHV WKH DELOLW\ WRFOXVWHUDWUDLQLQJVHWRILQSXWSDWWHUQVZLWKRXW VXSHUYLVLRQDQGWRFODVVLI\WKHSDWWHUQVDSSOLHG DIWHUDOHDUQLQJSKDVH
2.0 A Model-Based Approach to CF
7KH SURSRVHG PRGHOEDVHG DSSURDFK WR &) LV EDVHGRQWKHIRUPDWLRQRI$5711UHIHUHQFH YHFWRUV ZKLFK SURYLGHV D &) V\VWHP ZLWK WKH DELOLW\ WR FODVVLI\ XVHU SURILOH SDWWHUQV LQWR D FODVVRIVLPLODUSURILOHVWKDWIRUPVWKHEDVLVRID UHFRPPHQGDWLRQV\VWHP
2.1 ART2 Architecture $ W\SLFDO $57 11 DUFKLWHFWXUH >@>@ FRQVLVWV RI WKUHH OD\HUV LQSXW OD\HU )
FRPSDULVRQ OD\HU ) DQG WKH UHFRJQLWLRQ OD\HU )LOOXVWUDWHGLQ)LJXUH 7KHLQSXWOD\HU)UHFHLYHVDQGVWRUHVLQSXW SDWWHUQV Attentional Subsystem F2
F1
-
ȡ
Orienting Subsystem
+ « F0 Input Figure 1. $57DUFKLWHFWXUH 1HXURQV LQ WKH LQSXW OD\HU ) DQG FRPSDULVRQ OD\HU ) DUH RQHWRRQH FRQQHFWHG ZLWK KDUG FRGHG OLQNV ZKLFK FRUUHVSRQGV WR D QRUPDOL]DWLRQ SUHSURFHVVLQJ WR SUHYHQW FDWHJRU\SUROLIHUDWLRQ7KHFRPSDULVRQOD\HU) VWRUHVWKHFXUUHQWLQSXWSDWWHUQDQGSOD\VUROHRI VKRUWWHUP PHPRU\ 670 ZKLOH WKH UHFRJQLWLRQ OD\HU ) VWRUHV WKH SURWRW\SHV RI UHFRJQLWLRQ FDWHJRULHV FOXVWHUV DV WKH ORQJ WHUPPHPRU\/70 )DQG)OD\HUVDUHIXOO\ FRQQHFWHG E\ DGDSWDEOH ZHLJKWHG OLQNV WKDW IRUP WKH WRSGRZQ DQG ERWWRPXS DGDSWLYH ILOWHUV UHSUHVHQWHG E\ EODFN VHPLFLUFOHV RQ )LJXUH ) LV WKH FRPSHWLWLYH OD\HU WKDW FRQVLVWVRIDVHWRIRUGHUHGQHXURQVWKDWGRQRW FRQVWLWXWH DQ\ QHLJKERXUKRRG UHODWLRQV )RU HDFK LQSXW SDWWHUQ D ZLQQHU QHXURQ LV GHWHUPLQHG LQ WKDW OD\HU 7KH ) OD\HU FRQWUROV WKH FODVVLILFDWLRQ ,I D SDWWHUQ LV QRW VLPLODU HQRXJK WR WKH ZHLJKW YHFWRU RI WKH ZLQQHU QHXURQ WKH ) OD\HU EORFNV WKH ZLQQHUIRUWKDW SDWWHUQ7KH)OD\HULQLWLDOO\FRQWDLQVDQXPEHU RI XQFRPPLWWHG QRGHV ZKLFK RQH E\ RQH ZLOO FRQGLWLRQDOO\ JHW FRPPLWWHG XSRQ LQSXW SUHVHQWDWLRQ
2.2 Classification
,Q $57 11V FODVVLILFDWLRQ LV EDVHG RQ WKH UHODWLYH VLPLODULW\ RI DQ LQSXW SDWWHUQ WR WKH
ZHLJKWYHFWRUIRUDFOXVWHUXQLWUDWKHUWKDQWKH DEVROXWHGLIIHUHQFHEHWZHHQWKHYHFWRUVWKDWLV XVHGIRUH[DPSOHLQ62011V 7KHZHLJKWV RQ D FOXVWHU XQLW PD\ EH FRQVLGHUHG WR EH DQ H[HPSODU RU FRGH YHFWRU IRU WKH SDWWHUQV SODFHGRQWKDWFOXVWHU 7KHQXPEHURIFOXVWHUVLQDQ$5711LVQRW UHTXLUHG WR EH GHWHUPLQHG LQ DGYDQFH VR D YLJLODQFHSDUDPHWHUȡFDQEHXVHGWRGHWHUPLQH WKH SURSHU QXPEHU RI FOXVWHUV LQ RUGHU WR GHFUHDVH WKH SUREDELOLW\ RI PHUJLQJ GLIIHUHQW W\SHVRIFOXVWHUVLQWRWKHVDPHFOXVWHU ,Q $57 11V LQSXW SDWWHUQV PD\ EH DSSOLHG VHYHUDOWLPHVDQGLQDQ\RUGHU:KHQDSDWWHUQ LVDSSOLHGVHTXHQWLDOO\WRWKH)DQG)OD\HUVLW JRHV WKURXJK QRUPDOL]DWLRQ DQG IHDWXUH HQKDQFHPHQW7KURXJKWKHERWWRPXSZHLJKWHG OLQNVEHWZHHQ)DQG)WKH)QHXURQZLWKWKH VWURQJHVW DFWLYDWLRQ GHWHUPLQHG E\ FRPSXWLQJ WKH VFDODU ZLQV WKH FRPSHWLWLRQ EHWZHHQ DOO ) QHXURQV %\ DSSO\LQJ WKH WRSGRZQ ZHLJKW YHFWRU WKH RULHQWLQJ VXEV\VWHP FKHFNV LI WKH SDWWHUQOLHVZLWKLQWKHDWWHQWLYHQHVVFRQHRIWKH ZLQQHU QHXURQ NQRZQ DOVR DV FRQH RI VLPLODULW\)HGE\WKH)OD\HUZLWKDYHFWRU r WKH RULHQWLQJ V\VWHP FKHFNV LI DQ LQSXW SDWWHUQ YHFWRU u OLHV ZLWKLQ WKH DWWHQWLYHQHVV FRQH E\ VDWLVI\LQJ
ρ r
> r ≡ rrM
$IWHU HTXLYDOHQW WUDQVIRUPDWLRQV >@ OHQJWK RI WKH YHFWRU r FDQ EH UHSUHVHQWHG E\ DV D IXQFWLRQRIWKHLQSXWYHFWRU u FDWHJRU\YHFWRU z J DQGQHWZRUNSDUDPHWHUV c DQG d
+ c + + c cdz J FRVu z J + cdz J r = + c + c cdz J FRVu z J + cdz J
(TXDWLRQV DQG LPSO\ WKDW JLYHQ IL[HG YDOXHV RI c DQG d YDU\LQJ WKH YLJLODQFH SDUDPHWHU ρ FKDQJHV WKH VL]H RI WKH DWWHQWLYHQHVV FRQH DV VKRZQ LQ )LJXUH WKXV FKDQJLQJWKHJUDQXODULW\RIWKHFOXVWHUV ,I WKH VLPLODULW\ EHWZHHQ WKH LQSXW SDWWHUQ YHFWRU u DQGWKHFDWHJRU\YHFWRU z J LVWRRORZ DQG u OLHV RXW RI WKH DWWHQWLYHQHVV FRQH WKH RULHQWLQJ VXEV\VWHP UHMHFWV WKH ZLQQHU QHXURQ EORFNV LW GXULQJ WKH IXUWKHU SURFHVVLQJ RI WKH
SDWWHUQ DQG WULHV WR ORFDWH DQRWKHU ZLQQHU QHXURQ,IQRQHRIWKHH[LVWLQJQHXURQVILWWKHQ DQHZQHXURQWKDWPDWFKHVWKHSDWWHUQLVDGGHG 7KH $57 FODVVLILFDWLRQ JXDUDQWHHV WKDW WKH 11 DOZD\V FRQYHUJHV WR D VWDEOH VWDWH PDLQWDLQLQJSODVWLFLW\
X Iȡ =-
Figure 2. $57FRQHRIVLPLODULW\
3.0 Learning Mechanism
)RU WKH OHDUQLQJ SURFHVV LQ DQ $57 11 WZR GLIIHUHQWOHDUQLQJPHWKRGVFDQEHDSSOLHGVORZ OHDUQLQJRUIDVWOHDUQLQJEXWRQO\WKHVORZRQH LV DSSOLFDEOH WR WKH PRGHOEDVHG DSSURDFK WR &) ,W SURGXFHV ZHLJKW YHFWRUV RI WKH QHXURQV LQ WKH UHFRJQLWLRQ OD\HU DOVR FDOOHG UHIHUHQFH YHFWRUV (DFK WLPH DQ LQSXW SDWWHUQ LV DSSOLHG DQDSSURSULDWHUHIHUHQFHYHFWRULHFOXVWHUXQLW LV FKRVHQ DQG UHODWHG FOXVWHU ZHLJKWV DUH DGMXVWHGWROHWWKHFOXVWHUXQLWOHDUQWKHSDWWHUQ $GMXVWPHQW RI WKH UHIHUHQFH YHFWRU LV WRZDUGV WKHDSSOLHGSDWWHUQ7KH$57QHWZRUNIROORZV D µZLQQHUWDNHVDOO¶ FRPSHWLWLYH OHDUQLQJ SURFHVVZKLFKLQYROYHVWKHPRGLILFDWLRQRIWKH ZHLJKWHG ERWWRPXS IHHGIRUZDUG DQG WRS GRZQ IHHGEDFNZDUG FRQQHFWLRQV EHWZHHQ ) DQG)DFFRUGLQJGLIIHUHQWLDOHTXDWLRQV DQG >@ F → F F → F
d z ji dt d z ij dt
= g y j pi − z ji
= g y j pi − z ij
,Q WKH FDVH RI FRPSHWLWLYH OHDUQLQJ D ZHLJKW DGMXVWPHQW LV RQO\ SHUIRUPHG LI D VLPLODU HQRXJK SDWWHUQ LV XVHG 7KXV WKH UHIHUHQFH
YHFWRU FRQYHUJHV WRZDUGV WKH DYHUDJH RI DOO OHDUQHG SDWWHUQV 7KLV PHDQV QHZ QHXURQV FDQ EH DGGHG VWHSE\VWHS LI D SDWWHUQ GRHV QRW PDWFKDQ\H[LVWLQJQHXURQFORVHO\HQRXJK 7KURXJK G\QDPLF FUHDWLRQ RI UHFRJQLWLRQ FDWHJRULHV IRU HQFRGLQJ GLVWLQFW LQSXW VDPSOHV DQ $57 11 LV FDSDEOH RI VHOIDGMXVWLQJ WKH VFDOH RI LWV UHFRJQLWLRQ OD\HU LQ WHUPV RI WKH QXPEHURIFRPPLWWHGQRGHVZLWKUHVSHFWWRWKH FRPSOH[LW\ RI WKH SUREOHP GRPDLQ ,WV IDVW FRPPLWPHQW PHFKDQLVP DQG FDSDELOLW\ RI OHDUQLQJ DW PRGHUDWH VSHHG JXDUDQWLHV D KLJK HIILFLHQF\
4.0 A Memory-Based Approach to CF )RU H[SHULPHQWDO SXUSRVHV ZH GHYHORSHG D PHPRU\EDVHG&)FRPSRQHQWWKDWLQFRUSRUDWHV WKH SRSXODU QHLJKERUKRRGEDVHG DOJRULWKP LQWURGXFHG E\ +HUORFNHU HW DO >@ ,W XVHV D VXEVHW RI XVHUV JURXSHG E\ WKHLU VLPLODULW\ WR WKHDFWLYHXVHUDQGDZHLJKWHGFRPELQDWLRQRI WKHLU UDWLQJV LV XVHG WR SURGXFH SUHGLFWLRQV IRU WKHDFWLYHXVHU7KHPHWKRGFDQEHVXPPDUL]HG DVIROORZV /HW Q EH WKH QXPEHURIXVHUV T WKHVHW RI DOO WLWOHV DQG m = T WKH WRWDO QXPEHU RI WLWOHV
7KH QHLJKERUKRRGEDVHG DOJRULWKP FDQ EH GHVFULEHGLQWKHIROORZLQJVWHSV :HLJKWDOOXVHUVZLWKUHVSHFWWRVLPLODULW\ZLWK WKHDFWLYHXVHU6LPLODULW\EHWZHHQWZRXVHUVLV PHDVXUHG DV WKH 3HDUVRQ FRUUHODWLRQ EHWZHHQ WKHLUUDWLQJVYHFWRUVGHILQHGEHORZ m
Pa u =
¦ r
a i
i =
m
¦ r
a i
− ra × rui − ru
m
− ra × ¦ ru i − ru
i = ZKHUH ra i LVWKHUDWLQJJLYHQWRLWHP i E\XVHU i =
a ra LVWKHPHDQUDWLQJJLYHQE\XVHU a DQG m LVWKHWRWDOQXPEHURILWHPV
6HOHFW Q XVHUV WKDW KDYH WKH KLJKHVW VLPLODULW\ ZLWK WKH DFWLYH XVHU 7KHVH XVHUV IRUP WKH QHLJKERUKRRG &RPSXWH D SUHGLFWLRQ IURP D ZHLJKWHG FRPELQDWLRQ RI WKH VHOHFWHG QHLJKERUV¶ UDWLQJV DVWKHZHLJKWHGDYHUDJHRIGHYLDWLRQVIURPWKH QHLJKERU¶VPHDQ
n
pa i = ra +
¦ r u =
u i
− ru × Pa u n
¦P
'HQRWH WKH n × m PDWUL[ RI DOO XVHUV¶ UDWLQJV IRUDOOWLWOHVDV R 0RUHVSHFLILFDOO\WKHUDWLQJ RI XVHU i IRU WLWOH j LV Ri j ZKHUH HDFK
WKHV\PERO IRU µQR UDWLQJ¶ 1H[W ZH XVH Ri WR
a IRU LWHP i Pa u LV WKH VLPLODULW\ EHWZHHQ XVHUV a DQG u DQG n LVWKHQXPEHURIXVHUV
Ri j ∈ ℜ ∪ {⊥} LV HLWKHU D UHDO QXPEHU RU ⊥
N R ∈ T WREHWKHVXEVHWRIWLWOHVWKDWWKHDFWLYH
VHW N R LI DQG RQO\ LI Raj =⊥ ,Q JHQHUDO WHUPVDFROODERUDWLYHILOWHULVDIXQFWLRQ f WKDW WDNHV DV LQSXW DOO UDWLQJV IRU DOO XVHUV DQG UHSODFHVVRPHRUDOORIWKHµQRUDWLQJ¶V\PEROV ZLWKSUHGLFWHGUDWLQJV&DOOWKLVQHZPDWUL[3
° Raj if Raj ≠⊥ Paj = ® °¯ f a R if Raj =⊥
a u
ZKHUH p a i LV WKH SUHGLFWLRQ IRU WKH DFWLYH XVHU
GHQRWH WKH LWK URZ RI R RU WKH YHFWRU RI DOO XVHU L¶V UDWLQJV :H GLVWLQJXLVK RQH XVHU a ∈^ n` DV WKH DFWLYH XVHU :H GHILQH XVHUKDVQRWUDWHGDQGWKXVIRUZKLFKZHZRXOG OLNHWRSURYLGHSUHGLFWLRQ7KDWLVWLWOH j LQWKH
u =
LQWKHQHLJKERUKRRG
5.0 Experiments
:H UDQ H[SHULPHQWV XVLQJ GDWD IURP WKH (DFK0RYLH GDWDVHW SXEOLFO\ DYDLODEOH IURP 'LJLWDO (TXLSPHQW &RUSRUDWLRQ >@ 7KH GDWDEDVH FRQWDLQV QXPHULF UDWLQJV IRU GLIIHUHQWPRYLHVE\XVHUV$]HUR WRILYH VWDU UDWLQJ XVHG H[WHUQDOO\ RQ (DFK0RYLH LV PDSSHG OLQHDUO\ WR WKH LQWHUYDO >@$OWKRXJKGDWDIURPDOOXVHUVLVDYDLODEOH DVLQ>@ZHUHVWULFWHGRXUDQDO\VLVWRWKHXVHUV ZKR KDYH WKH PLQLPXP RI WKH DYHUDJH IRU WKH GDWDEDVH UDWLQJ DFWLYLW\ HQWULHV LQ WKHLU SURILOH DQG H[WUDFWHG YRWH UHFRUGV RI WKHILUVWRIWKRVHXVHUVIURPWKHGDWDEDVH
7KH UHVWULFWHG QXPEHU RI XVHUV UHYHDOV WKH SHUIRUPDQFH RI WKH PRGHOEDVHG &) DSSURDFK XQGHU FRQGLWLRQV ZKHUH WKH UDWLR RI XVHUV WR LWHPVLVORZ7KLVLVDFRQGLWLRQWKDWHYHU\&) VHUYLFH KDV WR JR WKURXJK LQ LWV ILUVW SKDVH ,Q RUGHU WR H[SORUH WKH OHDUQLQJ FXUYHV ZH UDQGRPO\ VHOHFWHG UDWHG LWHPV WR XVH DV D WUDLQLQJVHWDQGDVDWHVWVHW 7KH ILUVW VWDJH RI WKH H[SHULPHQWV DLPHG WR DGMXVWWKH$5711SDUDPHWHUVWKDWSURYLGHD VWDEOHQHWZRUNSHUIRUPDQFHVORZOHDUQLQJUDWH DQG DSSURSULDWH FDWHJRU\ JUDQXODULW\ 7R LGHQWLI\ WKH QHWZRUN SDUDPHWHUV ZH XVHG WKH IXOO WUDLQLQJ VHW GHVFULEHGDERYH DQG DQ $57 VLPXODWRUZLWK)QHXURQV 8VLQJ WKH WUDLQLQJ VHW ZH YDULHG WKH YLJLODQFH SDUDPHWHU ρ WRDFKLHYHDQDSSURSULDWHOHYHORI GHWDLO DQG JUDQXODULW\ RI WKH FOXVWHUV ,Q D VLWXDWLRQ ZKHUH D UHFRPPHQGDWLRQ UHTXLUHV PDQ\ FOXVWHUV HDFK RI ZKLFK FRQWDLQV IHZ LQSXW SDWWHUQV WKH SDUDPHWHU RI YLJLODQFH VKRXOGEHKLJK$VWKHYDOXHVRIWKHSDUDPHWHU DUH ORZHU DV WKH FOXVWHUV DUH IHZHU 7KHRUHWLFDOO\ ρ LV GHILQHG ZLWKLQ WKH LQWHUYDO ≤ ρ ≤ EXW IRU RXU SXUSRVHV RQO\ YDOXHV FORVH WR PDNH VHQVH 7R DFKLHYH QXPEHU RI FOXVWHUV EHWZHHQ DQG ZH XVHG YDOXHV ZLWKLQWKHLQWHUYDO ≤ ρ ≤ 1HWZRUN SDUDPHWHUV a DQG b LQIOXHQFH WKH H[SHFWDWLRQ RI ) SURSDJDWHG WR ) DQG ERWK PXVW KDYH SRVLWLYH YDOXHV 7KH H[SHULPHQWV
FRQGXFWHG OHG WR WKH FRQFOXVLRQ WKDW VWDEOH SHUIRUPDQFH FDQ EH DFKLHYHG E\ YDOXHV a = b = 1HWZRUN SDUDPHWHUV c DQG d LQIOXHQFH WKH RULHQWLQJ VXEV\VWHP 7KH\ DUH PXWXDOO\ GHSHQGHQW DQG KDYH WR VDWLVI\ WKH LQHTXDOLWLHV
@ )LYH VHULHV RI H[SHULPHQWV ZHUH FRQGXFWHG ZLWK D JURZLQJ QXPEHU RI XVHUV (DFK WLPH
UHFRPPHQGDWLRQVZHUHFRPSXWHGWKHUHVSRQVH WLPH ZDV PHDVXUHG 5HVXOWV VXPPDUL]HG LQ )LJXUH VKRZ WKDW WKH SURSRVHG PRGHOEDVHG DOJRULWKP UHVSRQGV IDVWHU WKDQ WKH PHPRU\ EDVHGRQHZKHQWKHQXPEHURIXVHUVLQFUHDVHV
3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0
ART2 NN memory-based
0
500
1000
1500
Number of Users
2000
2500
Figure 4.5HVSRQVHWLPH IRUWKHPRGHODUHJHQHUDOO\OHVVIRUVWRULQJWKH 6.0 Conclusion IXOO GDWDEDVH DQG SUHGLFWLRQV FDQ EH FDOFXODWHG TXLFNO\RQFHWKHPRGHOLVJHQHUDWHG 5HFRPPHQGHUV\VWHPVDUHQRZZLGHO\XVHGLQ :H SURSRVH D PRGHOEDVHG DSSURDFK WR HFRPPHUFHDSSOLFDWLRQVWRDVVLVWFXVWRPHUVLQ FROODERUDWLYH ILOWHULQJ E\ XVLQJ XQVXSHUYLVHG ILQGLQJ UHOHYDQW SURGXFWV IURP WKH PDQ\ WKDW $57117KHVWUHQJWKRIWKHPRGHOLVWKDWLW DUHIUHTXHQWO\DYDLODEOH&ROODERUDWLYHILOWHULQJ LV DSSOLFDEOH LQ SRRUO\ GHILQHG GRPDLQV IRU LV D NH\ FRPSRQHQW RI PDQ\ RI WKHVH V\VWHPV ZKLFK WKH SODQQLQJ RI QHWZRUN UHVRXUFHV LV LQ ZKLFK UHFRPPHQGDWLRQV DUH PDGH WR XVHUV GLIILFXOWRUHYHQLPSRVVLEOH EDVHG RQ WKH RSLQLRQV RI VLPLODU XVHUV LQ D 7KH SURSRVHG DOJRULWKP LV EDVHG RQ IRUPDWLRQ V\VWHP RI$5711UHIHUHQFHYHFWRUVZKLFKSURYLGHV $YDULHW\RI&)ILOWHUVRUUHFRPPHQGHUV\VWHPV D &) V\VWHP ZLWK WKH DELOLW\ WR FODVVLI\ XVHU KDYH EHHQ GHVLJQHG PRVW RI ZKLFK FDQ EH SURILOH SDWWHUQV LQWR VLPLODU SURILOHV JURXSHGLQWRWZRPDMRUFODVVHVPHPRU\EDVHG FODVVLILFDWLRQV ZKLFK WKHQ IRUP WKH EDVLV RI D DQGPRGHOEDVHG UHFRPPHQGDWLRQ V\VWHP 7KURXJK G\QDPLF 0HPRU\EDVHG DOJRULWKPV PDLQWDLQ D GDWDEDVH FUHDWLRQ RIUHFRJQLWLRQFDWHJRULHV IRUHQFRGLQJ RI DOO XVHUV¶ NQRZQ SUHIHUHQFHV IRU DOO LWHPV GLVWLQFWLQSXWVDPSOHVWKH$5711LVFDSDEOH DQG IRU HDFK SUHGLFWLRQ SHUIRUP VRPH RIVHOIDGMXVWLQJLWVUHFRJQLWLRQOD\HU FRPSXWDWLRQ DFURVV WKH HQWLUH GDWDEDVH 7KLV :H SUHVHQW H[SHULPHQWDO UHVXOWV XVLQJ WKH DSSURDFK LV VLPSOHU DQG VHHPV WR ZRUN (DFK0RYLH GDWDVHW 7KH ILUVW JURXS RI UHDVRQDEO\ ZHOO LQ SUDFWLFH DQG QHZ GDWD FDQ H[SHULPHQWV VKRZV WKH LPSDFW RI $57 11 EH DGGHG HDVLO\ DQGLQFUHPHQWDOO\ KRZHYHU LW SDUDPHWHUV RQ WKH UHFRJQLWLRQ VWDELOLW\ VORZ FDQEHFRPHFRPSXWDWLRQDOO\H[SHQVLYHLQWHUPV OHDUQLQJ UDWH DQG DSSURSULDWH FDWHJRU\ RI ERWKWLPH DQGVSDFH FRPSOH[LW\ DVWKH VL]H JUDQXODULW\ 7KH VHFRQG JURXS RI H[SHULPHQWV RIWKHGDWDEDVHJURZV UHSRUWV FODVVLILFDWLRQ DFFXUDF\ LQ WKH 2QWKHRWKHUKDQGPRGHOEDVHG&)DOJRULWKPV FLUFXPVWDQFH RI D JURZLQJ GDWDEDVH RI YRWHV XVH WKH XVHUV¶ SUHIHUHQFHV WR OHDUQ D PRGHO 7KH WKLUG JURXS RI H[SHULPHQWV VKRZV WKH ZKLFK LV WKHQ XVHG IRU SUHGLFWLRQV 7KH\ DUH DGYDQWDJH RI WKH SURSRVHG PRGHO RYHU WKH VPDOO IDVW DQG HVVHQWLDOO\ DV DFFXUDWH DV WKH SRSXODUPHPRU\EDVHGQHLJKERUKRRGDOJRULWKP PHPRU\EDVHGPHWKRGV0HPRU\UHTXLUHPHQWV
E\FRPSDULQJUHVSRQVHWLPHVRIERWKZLWKLQWKH VLWXDWLRQRIDJURZLQJQXPEHURIXVHUV
7.0 References
>@ * &DUSHQWHU 6 *URVVEHUJ $57 6HOI 2UJDQL]DWLRQ RI 6WDEOH &DWHJRU\ 5HFRJQLWLRQ &RGHV IRU $QDORJ ,QSXW 3DWWHUQV $SSOLHG 2SWLFVS >@ +H-7DQ $7DQ & 0RGLILHG $57$ *URZLQJ 1HWZRUN &DSDEOH RI *HQHUDWLQJ D )L[HG1XPEHURI1RGHV,(((7UDQVDFWLRQVRQ 1HXUDO1HWZRUNV9RO1R0D\ >@ +HUORFNHU - -.RQVWDQ $%RUFKHUV DQG -5LHGO $Q DOJRULWKPLF IUDPHZRUN IRU SHUIRUPLQJ FROODERUDWLYH ILOWHULQJ ,Q 3URFHHGLQJV RI WKH QG $QQXDO ,QWHUQDWLRQDO $&0 6,*,5 &RQIHUHQFH RQ 5HVHDUFK DQG 'HYHORSPHQW LQ ,QIRUPDWLRQ 5HWULHYDO %HUNHOH\&$86$S$XJ >@ %LOOVXV ' DQG 0 - 3D]]DQL /HDUQLQJ &ROODERUDWLYH ,QIRUPDWLRQ )LOWHUV ,Q 3URFHHGLQJVRIWKH,QWHUQDWLRQDO&RQIHUHQFHRQ 0DFKLQH/HDUQLQJ >@ 5HVQLFN 3 1 ,DFRYRX 0 6XVKDN 3 %HUJVWURP DQG- 5LHGO *URXS/HQV $Q 2SHQ $UFKLWHFWXUH IRU &ROODERUDWLYH )LOWHULQJ RI 1HWQHZV ,Q 3URFHHGLQJV RI &RPSXWHU 6XSSRUWHG &ROODERUDWLYH :RUN &RQIHUHQFH >@ 6KDUGDQDQG 8 3 0DHV 6RFLDO ,QIRUPDWLRQ ILOWHULQJ $OJRULWKPV IRU $XWRPDWLQJ:RUGRI0RXWK,Q3URFHHGLQJVRI &+, >@ %UHHVH - ' +HFNHUPDQ DQG & .DGLH (PSLULFDO $QDO\VLV RI 3UHGLFWLYH $OJRULWKPV IRU &ROODERUDWLYH )LOWHULQJ ,Q 3URFHHGLQJV RI WKH WK &RQIHUHQFH RQ 8QFHUWDLQW\ LQ $UWLILFLDO,QWHOOLJHQFH >@1DFKHY$,*DQFKHY$GDSWLYH&OXVWHULQJ E\$571HXUDOQHWZRUNV ,Q3URFRIWKH,QW &RQI (QJLQHHULQJ RI ,QWHOOLJHQW 6\VWHPV (,6 0DGHLUD)HE >@ 3HQQRFN ' ( +RUYLW] 6 /DZUHQFH 6 *LOHV &ROODERUDWLYH )LOWHULQJ E\ 3HUVRQDOLW\ 'LDJQRVLV $ +\EULG 0HPRU\ DQG 0RGHO %DVHG $SSURDFK ,Q 3URF RI WKH WK &RQIHUHQFH RQ 8QFHUWDLQW\ LQ $UWLILFLDO ,QWHOOLJHQFHS
>@ =KDQJ - 6HOHFWLQJ 7\SLFDO ,QVWDQFHV LQ ,QVWDQFH%DVHG/HDUQLQJLQ3URFHHGLQJVRIWKH 1LQWK ,QWHUQDWLRQDO &RQIHUHQFH RQ 0DFKLQH /HDUQLQJ $EHUGHHQ 6FRWODQG 0RUJDQ .DXIPDQQSS >@ 8QJDU / DQG ' )RVWHU &OXVWHULQJ 0HWKRGV IRU &ROODERUDWLYH )LOWHULQJ ,Q :RUNVKRSRQ5HFRPPHQGDWLRQ6\VWHPVDWWKH )LIWHHQWK 1DWLRQDO &RQIHUHQFH RQ $UWLILFLDO ,QWHOOLJHQFH-XO\ >@*URVVEHUJ6/LQNLQJ0LQGWR%UDLQ7KH 0DWKHPDWLFVRI%LRORJLFDO,QWHOOLJHQFH1RWLFHV RIWKH$PHULFDQ0DWKHPDWLFDO6RFLHW\YRO SS >@ 3 0&-21(6 (DFK0RYLH FROODERUDWLYH ILOWHULQJ GDWD VHW '(& 6\VWHPV 5HVHDUFK &HQWHU KWWSZZZUHVHDUFKFRPSDTFRP65&HDFKPR YLH