A Hybrid Data Clustering Using Firefly Algorithm Based. Improved Genetic Algorithm. Maheshwarak, Keshav Kaushikb, Vikram Arorac a, b, cDepartment of CSE, ...
Available online at www.sciencedirect.com
ScienceDirect Procedia Computer Science 58 (2015) 249 – 256
6HFRQG,QWHUQDWLRQDO6\PSRVLXPRQ&RPSXWHU9LVLRQDQGWKH,QWHUQHW9LVLRQ1HW¶
$+\EULG'DWD&OXVWHULQJ8VLQJ)LUHIO\$OJRULWKP%DVHG ,PSURYHG*HQHWLF$OJRULWKP 0DKHVKZDUD .HVKDY.DXVKLNE9LNUDP$URUDF DEF
'HSDUWPHQWRI&6(0918QLYHUVLW\3DOZDO,QGLD
$EVWUDFW &OXVWHULQJLVDPRQJWKHGDWDPLQLQJWHFKQLTXHVWRJURXSWKHGDWDLQWRVXEVHWVWRUHWULHYHXVHIXOLQIRUPDWLRQIURPWKHGDWDVHW &OXVWHULQJLQYROYHVVHOHFWLQJWKHNFOXVWHUFHQWUHVUDQGRPO\DQGJURXSLQJWKDWGDWDDURXQGWKRVHFHQWUHV*HQHWLFDOJRULWKPVDUH KHXULVWLF DOJRULWKPV WKDW KDYH EHHQ DSSOLHG WR FOXVWHULQJ SUREOHP IRU RSWLPL]DWLRQ *HQHWLF DOJRULWKPV IROORZ WKH SURFHVV RI QDWXUDOVHOHFWLRQDQGZRUNLQLWHUDWLYHPDQQHUJHQHUDWLQJQHZSRSXODWLRQIURPWKHROGRQH7KHLQLWLDOSRSXODWLRQLVUDQGRPO\ LQLWLDOL]HG 7KH ZKROH LWHUDWLYH SURFHVV LV LQIOXHQFHG E\ WKH LQLWLDO YDOXHV VHOHFWHG DW VWDUW 6R WKH SURSHU VHOHFWLRQ DOVR DIIHFW RSWLPL]DWLRQSUREOHP ,QWKLVSDSHUZHKDYHSURSRVHGDILUHIO\EDVHGJHQHWLFDOJRULWKP)$* ZKHUHWKHLQLWLDOSRSXODWLRQLVVHOHFWHGIURPDSRRO RI SRSXODWLRQ RQ WKH EDVLV RI ILUHIO\ DOJRULWKPV )LUHIO\ DOJRULWKPV DUH DOVR ELRORJLFDOO\ LQVSLUHG DOJRULWKP DQG DUH XVHG WR RSWLPL]DWLRQ SUREOHP )$* DOJRULWKP LV WKHQ DSSOLHG WR WKH SXEOLFDOO\ DYDLODEOH GDWDVHWV IURP 8&, UHSRVLWRU\ 7KH UHVXOWV REWDLQHGDUHYHU\PXFKVDWLVIDFWRU\DQGFRPSHWLWLYHDVFRPSDUHWRWKHEDVLFJHQHWLFDQGILUHIO\DOJRULWKP © 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license 7KH$XWKRUV3XEOLVKHGE\(OVHYLHU%9 (http://creativecommons.org/licenses/by-nc-nd/4.0/). 3HHUUHYLHZXQGHUUHVSRQVLELOLW\RIRUJDQL]LQJFRPPLWWHHRIWKH6HFRQG,QWHUQDWLRQDO6\PSRVLXPRQ&RPSXWHU9LVLRQDQGWKH Peer-review under responsibility of organizing committee of the Second International Symposium on Computer Vision and the Internet ,QWHUQHW9LVLRQ1HW¶ (VisionNet’15) .H\ZRUGV.H\ZRUGV&OXVWHULQJ*HQHWLFDOJRULWKPVILUHIO\DOJRULWKPV)$*DOJRULWKP
0DKHVKZDU (PDLODGGUHVVPDKHVKZDU#JPDLOFRP
1877-0509 © 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of organizing committee of the Second International Symposium on Computer Vision and the Internet (VisionNet’15) doi:10.1016/j.procs.2015.08.018
250
Maheshwar et al. / Procedia Computer Science 58 (2015) 249 – 256
,QWURGXFWLRQ &OXVWHULQJLVDQXQVXSHUYLVHGOHDUQLQJWHFKQLTXHXVHGWRFODVVLI\WKHGDWD0DQ\SUREOHPVLQGLIIHUHQWDUHDVOLNHGDWD PLQLQJ >@ GDWD FRPSUHVVLRQ >@ SDWWHUQ UHFRJQLWLRQ >@ KDYH EHHQ VROYHG XVLQJ FOXVWHULQJ WHFKQLTXH &OXVWHULQJ EDVLFDOO\LQYROYHVFKRRVLQJWKHNFOXVWHUFHQWUHVDURXQGZKLFKWKHGDWDLVFOXVWHUHG7KHFOXVWHUFHQWUHVDUHFKRVHQ UDQGRPO\ IURP WKH GDWD VHW 6HOHFWLQJ WKH FHQWUHV FDUHIXOO\ FDQ DIIHFW RXU UHVXOW *HQHWLF DOJRULWKPV DUH WKH FRPSXWDWLRQDOPRGHOVWKDWDUHEDVHGRQELRORJLFDOHYROXWLRQ>@>@*HQHWLFDOJRULWKPVFRPHXQGHUWKHFDWHJRULHV RI HYROXWLRQDU\ DOJRULWKPV DQG DUH JHQHULF SRSXODWLRQ EDVHG PHWDKHXULVWLF RSWLPL]DWLRQ DOJRULWKPV >@ *HQHWLF DOJRULWKP >@ ZDV ILUVW SURSRVHG DQG GHYHORSHG E\ -RKQ +ROODQG LQ *HQHWLF DOJRULWKP EDVHG DSSURDFK KDV EHHQXVHGIRUFODVVLILFDWLRQWDVNLQGDWDPLQLQJ>@,WKDVEHHQXVHGWRH[WUDFWIHDWXUHV>@>@,WLVDOVRXVHGWR GLVFRYHUNQRZOHGJHIURPGDWDEDVHE\FRPELQLQJZLWKQHXUDOQHWZRUN>@)X]]\FODVVLILFDWLRQEDVHGPRGHOVKDYH EHHQSURSRVHGLQSDSHU>@>@ZKLFKDUH EDVHGRQ JHQHWLFDOJRULWKPV*HQHWLFDOJRULWKPVDUHXVHGWRGHYHORS IDFHUHFRJQLWLRQV\VWHPVDQGGLVFRYHULQJUXOHVIURPELRORJLFDOGDWD>@>@DQG>@*HQHWLFDOJRULWKPVDUHDOVR XVHGIRUFOXVWHULQJWDVNRIGDWDPLQLQJ>@7KHFKURPRVRPHVDUHVHOHFWHGUDQGRPO\DVWKHFOXVWHUFHQWUH *HQHWLF DOJRULWKP EDVLFDOO\ ZRUNV LQ LWHUDWLYH PDQQHU DQG JHQHUDWHV WKH QHZ SRSXODWLRQ IURP WKH ROG SRSXODWLRQ (DFK VWULQJ LQ WKH SRSXODWLRQ LV UHSUHVHQWHG LQ ELQDU\ IRUP *HQHWLF DOJRULWKP LQYROYHV WKUHH JHQHWLF RSHUDWRUV QDPHGVHOHFWLRQFURVVRYHUDQGPXWDWLRQDQGDSSOLHVWKHVHRSHUDWRUVRQWKHLQLWLDOSRSXODWLRQVWULQJVWRSURGXFHD QHZJHQHUDWLRQRIVWULQJV7KLVLWHUDWLYHSURFHVVLPSURYHVWKHTXDOLW\RIWKHVROXWLRQVXFFHVVLYHO\7KHSURFHVVHQGV ZKHQDQRSWLPDOVROXWLRQLVIRXQG)LJXUHVKRZVKRZJHQHWLFDOJRULWKPLWHUDWLYHO\ILQGVWKHVROXWLRQWRDSUREOHP 7KHWKUHHEDVLFRSHUDWRUVSHUIRUPWKHIROORZLQJIXQFWLRQDOLWLHV x 6HOHFWLRQ 6HOHFWLRQ RSHUDWRU VHOHFWV D SURSRUWLRQ RI WKH H[LVWLQJ SRSXODWLRQ WR SURGXFH D QHZ SRSXODWLRQ 6HOHFWLRQFULWHULDLVEDVHGRQWKHYDOXHRIWKHILWQHVVIXQFWLRQLHSRSXODWLRQZLWKJRRGILWQHVVYDOXHLVVHOHFWHG IRUFURVVRYHUDWQH[WVWHS)LWQHVVIXQFWLRQLVWKHSUREOHPGHSHQGHQWKHXULVWLFIXQFWLRQDQGPHDVXUHVWKHTXDOLW\ RIWKHVROXWLRQ )LJ*HQHWLF$OJRULWKP
Maheshwar et al. / Procedia Computer Science 58 (2015) 249 – 256
x
x
251
&URVVRYHU 7KLV RSHUDWRU LV XVHG WR JHQHUDWH WKH QHZ SRSXODWLRQ ZKHQ WKH SDUHQW SRSXODWLRQ LV FURVVRYHU ,Q FURVVRYHUDUDQGRPSRLQWDORQJWKHOHQJWKRIFKURPRVRPHVLVVHOHFWHGDQGJHQHVRIRQHFKURPRVRPHDIWHUWKLV SRLQW DUH VZDSSHG ZLWK WKH JHQHV RI DQRWKHU FKURPRVRPH 'LIIHUHQW PHWKRGV DUH XVHG IRU VHOHFWLRQ RI FKURPRVRPHV EHIRUH DSSO\LQJ FURVVRYHU RSHUDWRU OLNH URXOHWWH ZKHHO VHOHFWLRQ UDQN VHOHFWLRQ WRXUQDPHQW VHOHFWLRQ HWF 0DQ\ FURVVRYHU WHFKQLTXHV DUH XVHG GHSHQGLQJ RQ WKH VHOHFWLRQ RI SRLQW DORQJ WKH OHQJWK RI FKURPRVRPHHJVLQJOHSRLQWFURVVRYHUWZRSRLQWFURVVRYHUFXWDQGVOLFHHWF>@ 0XWDWLRQ 0XWDWLRQ RSHUDWRU LV XVHG WR SURYLGH VWRFKDVWLFLW\ WR WKH VROXWLRQ DQG PDLQWDLQ JHQHWLF GLYHUVLW\ EHWZHHQWKH WZR JHQHUDWLRQV,Q PXWDWLRQDQDUELWUDU\ELWLVFKDQJHGIURPLWVLQLWLDOVWDWH'LIIHUHQW PXWDWLRQ W\SHV LQYROYH ELW VWULQJ PXWDWLRQ *DXVVLDQ PXWDWLRQ QRQXQLIRUP PXWDWLRQ HWF >@ 0XWDWLRQ RSHUDWRU SUHYHQWVWKHORFDOPLQLPDE\DYRLGLQJWKHFKURPRVRPHSRSXODWLRQIURPEHLQJVLPLODU
7KHVHWKUHHRSHUDWRUVDUHDSSOLHGLWHUDWLYHO\XQWLOWKHRSWLPL]HGVROXWLRQLVIRXQGVWDUWLQJIURPWKHLQLWLDOSRSXODWLRQ ZKLFKLVVHOHFWHGDWUDQGRP:HKDYHSURSRVHGDQDSSURDFKWRVHOHFWWKHLQLWLDOSRSXODWLRQXVLQJILUHIO\DOJRULWKP UDWKHUWKDQVHOHFWLQJLWUDQGRPO\,QWKLVDSSURDFKVRPHVHWVRIUHSUHVHQWDWLYHDUHFKRVHQWKDWUHSUHVHQWWKHZKROH SRSXODWLRQDQGSURYLGHEHWWHUVROXWLRQ $IWHUDEULHILQWURGXFWLRQWRWKHFOXVWHULQJDQGJHQHWLFDOJRULWKPLQVHFWLRQ,ZHKDYHGLVFXVVHGWKHZRUNLQJRIILUH IO\DOJRULWKPLQVHFWLRQ,,6HFWLRQ,,,GHVFULEHVWKHILUHIO\EDVHGLPSURYHGJHQHWLFDOJRULWKP)$*DOJRULWKP ,Q VHFWLRQ,9WKHSHUIRUPDQFHRIWKH)$*DOJRULWKPLVHYDOXDWHGDQGWKHUHVXOWVDUHFRPSDUHGZLWKJHQHWLFDQGILUHIO\ DOJRULWKPV6HFWLRQ9FRQFOXGHVWKHSDSHUDQGGLVFXVVHVGLUHFWLRQVIRUIXWXUHZRUNV )LUHIO\$OJRULWKP )LUHIO\DOJRULWKP)$ SURSRVHGE\;LQ6KH@7KLVRSWLPL]DWLRQWHFKQLTXHLVEDVHGRQWKHIDFWWKDWWKHHDFK ILUHIO\DWWUDFWVWRRWKHUILUHIO\RQWKHEDVLVRIWKHEULJKWQHVVLHILUHIO\ZLWKORZEULJKWQHVVLVDWWUDFWHGWRZDUGILUHIO\ ZLWK PRUH EULJKWQHVV DQG KHQFH VHDUFK VSDFH LV H[SORUHG HIILFLHQWO\ @ DQG UDQG LV UDQGRP QXPEHU JHQHUDWRU ZLWK QXPEHUV XQLIRUPO\GLVWULEXWHGLQUDQJH>@3DUDPHWHUȖFRQWUROVWKHYDULDWLRQLQDWWUDFWLYHQHVVDQGGHILQHFRQYHUJHQFH,Q PRVWRIFDVHVLWVYDOXHVOLHLQUDQJH>@ )LUHIO\DOJRULWKP 3DUDPHWHUV,QLWLDOL]DWLRQ W 0D[B*HQHUDWLRQWKHPD[LPXPQXPEHURIJHQHUDWLRQV 2EMHFWLYHIXQFWLRQI[ ZKHUH[ [[G ,QLWLDOSRSXODWLRQRIILUHIOLHVRU[LL Q /LJKWLQWHQVLW\,LIRUHDFKILUHIO\DW[LYLDI[L :KLOHW0D[B*HQHUDWLRQ )RUL WRQ )RUM WRQ ,I,M!,L PRYHILUHIO\LWRZDUGVMHQGLI (YDOXDWHQHZVROXWLRQVDQGXSGDWHOLJKWLQWHQVLW\ (QGIRUM (QGIRUL 5DQNWKHILUHIOLHVDQGILQGWKHFXUUHQWEHVW (QGZKLOH 3VHXGRFRGH)LUHIO\DOJRULWKP
)LUHIO\%DVHG*HQHWLF$OJRULWKP)$*$OJRULWKP $VGLVFXVVHGLQLQWURGXFWLRQVHFWLRQJHQHWLFDOJRULWKPLWHUDWLYHO\ILQGVWKHRSWLPXP VROXWLRQWRDSUREOHPVWDUWLQJ XVLQJ D UDQGRP SRSXODWLRQ RI FKURPRVRPH 7KLV UDQGRP SRSXODWLRQ VHOHFWLRQ VWHS LV SHUIRUPHG XVLQJ ILUHIO\ DOJRULWKPZKHUHWKHUHSUHVHQWDWLYHFKURPRVRPHVIURPWKHLQLWLDOO\VHOHFWHGUDQGRPSRSXODWLRQDUHFKRVHQDVYHU\ ILUVWSRSXODWLRQWRVWDUWWKHJHQHWLFDOJRULWKPSURFHVV7KHVHUHSUHVHQWDWLYHFKURPRVRPHVDUHVHOHFWHGJOREDOO\XVLQJ ILUHIO\DOJRULWKP)LUHIOLHVZLWKJOREDOEHVWSRVLWLRQFRQVWLWXWHWKHLQLWLDOSRSXODWLRQRIFKURPRVRPHV 7KH LQLWLDO SRSXODWLRQ LV SDUWLWLRQHG LQWR GLIIHUHQW VHWV RU FOXVWHU DQG WKH ILUHIO\ DOJRULWKP LV XVHG WR FRPSXWH WKH FHQWHU RI HDFK FOXVWHU 7KLV FHQWUH ZLOO UHSUHVHQW WKH ZKROH FOXVWHU RU VHWRI SRSXODWLRQ DQG ZLOO SDUWLFLSDWH LQ WKH JHQHWLFDOJRULWKPSURFHVV3VHXGRFRGHVKRZVKRZWKHZKROHSURFHVVLVSHUIRUPHG 3VHXGRFRGHJLYHVDQDEVWUDFWLGHDRIWKHSURSRVHGDSSURDFK7KH)$*DOJRULWKPLVGLYLGHGLQWRWZRVWDJHV,Q ILUVWVWDJH)LUHIO\DOJRULWKP)$ LVDSSOLHGWRGLIIHUHQWVHWVRILQLWLDOO\VHOHFWHGUDQGRPSRSXODWLRQ7KHSDUDPHWHUV VHOHFWLRQIRU)$LV 3RSXODWLRQ6L]H ,QQHUORRSLWHUDWLRQV Ȗ ȕ Į 7R FDOFXODWH WKH JOREDO EHVW SRVLWLRQ WKH DWWUDFWLYHQHVV RI RQH ILUHIO\ WRZDUG DQRWKHU LV GHILQHG XVLQJ HTXDWLRQ JLYHQEHORZ>@ [L [LȕHȖLM[M±[L ȕHȖLJEHVW[EHVW[L ĮUDQG±
Maheshwar et al. / Procedia Computer Science 58 (2015) 249 – 256
253
:KHUHJEHVWLVWKHJOREDORSWLPDODQG[EHVWLVWKHJOREDOEHVWSRVLWLRQRIWKHILUHIO\ 6WDUW 3DUDPHWHULQLWLDOL]DWLRQ 'HILQHVVHWVRILQLWLDOUDQGRPVHOHFWHGSRSXODWLRQ W 0D[B*HQHUDWLRQWKHPD[LPXPQXPEHURIJHQHUDWLRQV 2EMHFWLYHIXQFWLRQI[ ZKHUH[ [[G ,QLWLDOSRSXODWLRQRIILUHIOLHVRU[LL Q /LJKWLQWHQVLW\,LIRUHDFKILUHIO\DW[LYLDI[L )RUHDFKVHWWRV :KLOHW0D[B*HQHUDWLRQ )RUL WRQ )RUM WRQ ,I,M!,L PRYHILUHIO\LWRZDUGVMHQGLI (YDOXDWHQHZVROXWLRQVDQGXSGDWHOLJKWLQWHQVLW\ (QGIRUM (QGIRUL 5DQNWKHILUHIOLHVDQGILQGWKHJOREDOEHVWDQGILQGWKH SRVLWLRQRIWKHILUHIO\ZLWKJOREDOEHVW (QGZKLOH (QGIRUHDFK $SSO\JHQHWLFDOJRULWKPZLWKWKHFDOFXODWHGQHZ UHSUHVHQWDWLYHFKURPRVRPHV (QG 3VHXGRFRGH,PSURYHG*HQHWLF$OJRULWKP
7KHILUVWVWDJHUHVXOWVLQWRWKHJHQHUDWLRQRIFKURPRVRPHWKDWUHSUHVHQWVZKROHVHW7KHVHFKURPRVRPHVDUHQRWKLQJ EXW WKH ILUHIO\ ZLWK JOREDO EHVW SRVLWLRQ FDOFXODWHG XVLQJ )$ DOJRULWKP 7KHVH FKURPRVRPHV QRZ SODFHG LQ WKH PDWLQJSRROIURPZKHUHWKHVHWDNHSDUWLQFURVVRYHUDQGPXWDWLRQSURFHVVRIWKHJHQHWLFDOJRULWKP ([SHULPHQW$QG5HVXOW ([SHULPHQWVKDYHEHHQFRQGXFWHGRQIRXUSXEOLFO\DYDLODEOHGDWDVHWVQDPHO\,ULVJODVVEUHDVWFDQFHUDQGZLQHIURP 8&,PDFKLQHUHSRVLWRU\>@7KHWDEOHEHOORZVXPPDUL]HVWKHFKDUDFWHULVWLFVRIWKHVHGDWDVHWV &ROXPQVVSHFLI\WKHQRRIVDPSOHWKHQRRIDWWULEXWHVDQGFODVVHVRIHDFKGDWDVHW
254
Maheshwar et al. / Procedia Computer Science 58 (2015) 249 – 256
7DEOH'DWDVHWV
'DWD6HW
,QVWDQFHV
$WWULEXWHV
&ODVVHV
,ULV
*ODVV
%UHDVW&DQFHU
:LQH
7KH UHVXOWV RI )$* DOJRULWKPV DUH FRPSDUHG WR JHQHWLF DQG ILUHIO\ DOJRULWKP &RPSDULVRQ VWXG\ VKRZV WKDW WKH UHVXOWVDUHYHU\VDWLVIDFWRU\ 7DEOH DQG WDEOH VXPPDUL]H WKH UHVXOW REWDLQHG ZKLOH FDOFXODWLQJ WKH LQWHU FOXVWHU GLVWDQFH DQG LQWUD FOXVWHU GLVWDQFHIRUHDFKRIWKHDOJRULWKPRYHUWKHGLIIHUHQWGDWDVHWV,WLVFOHDUIURPWDEOHWKDWWKHUHVXOWVDUHPXFKEHWWHULQ FDVHRI)$*DOJRULWKPWKDQRWKHUWUDGLWLRQDODOJRULWKPV 7DEOH&RPSDULVRQRILQWHUFOXVWHUGLVWDQFH
$OJRULWKP ,ULV
)LUH)O\ $OJRULWKP
*HQHWLF DOJRULWKP
)$* $OJRULWKP
*ODVV
%UHDVW &DQFHU :LQH
7DEOH&RPSDULVRQRILQWUDFOXVWHUGLVWDQFH
$OJRULWKP ,ULV
)LUH)O\ $OJRULWKP
*HQHWLF DOJRULWKP
)$* $OJRULWKP
*ODVV
%UHDVW&DQFHU
:LQH
)LJXUHD DQGE VKRZWKHLQWHUFOXVWHUGLVWDQFHDQGLQWUDFOXVWHUGLVWDQFHFRPSDULVRQJUDSKRI)$*DOJRULWKP DQGUHYHDOVWKDWWKHLQWHUFOXVWHUGLVWDQFHLQ)$*DOJRULWKPLVPRUHWKDQWKHILUHIO\DOJRULWKPDQGJHQHWLFDOJRULWKP
255
Maheshwar et al. / Procedia Computer Science 58 (2015) 249 – 256
0.035 0.03 0.025 0.02 0.015 0.01 0.005 0
Wine
FAG Algorithm
Breast Cancer
Genetic algorithm
Fire Fly Algorithm
Glass
Fire Fly Algorithm
Iris
0.1 0.08 0.06 0.04 0.02 0
Genetic algorithm FAG Algorithm
)LJD ,QWHUFOXVWHUGLVWDQFHE ,QWUDFOXVWHUGLVWDQFH
&RQFOXVLRQ$QG)XWXUH:RUN ,Q WKLV SDSHU ZH KDYH SURSRVHG DQG LPSOHPHQWHG )$* DOJRULWKP IRU VHOHFWLQJ WKH LQLWLDO SRSXODWLRQ WDNLQJ SDUWLFLSDWLRQLQJHQHWLFDOJRULWKPDQGGHWDLOHGKRZILUHIO\DOJRULWKPFDQEHXVHGWRLPSURYHWKHJHQHWLFDOJRULWKP E\XVLQJLWWRVHOHFWLQLWLDOUDQGRPSRSXODWLRQRIFKURPRVRPH7KLVDOJRULWKPDOVRUHVXOWVLQJOREDORSWLPL]DWLRQDW LQLWLDOVWDWHRIJHQHWLFDOJRULWKPDQGSUHYHQWIURPJHWWUDSSHGLQORFDOPLQLPD7KHH[SHULPHQWVDUHFRQGXFWHGRQ GDWDVHWVWDNHQIURP8&,PDFKLQHUHSRVLWRU\DQGFRPSDULVRQVUHYHDOWKDWWKHUHVXOWVDUHPXFKVDWLVIDFWRU\ZLWK)$* DOJRULWKP )XWXUHZRUN7KHUHDUHDQXPEHURIGLUHFWLRQVWRIXWXUHZRUN)LUVWLWZRXOGEHLQWHUHVWLQJWRDSSO\WKHDOJRULWKP RQRWKHUSXEOLFDOO\DYDLODEOHGDWDVHWIURPXFLOHDUQLQJUHSRVLWRU\>@DQGFKHFN WKHXQLYHUVDOLW\RIWKHDOJRULWKP 6HFRQG WKH SURSRVHG DSSURDFK FDQ EH XVHG WR VROYH GLIIHUHQW RSWLPL]DWLRQ SUREOHPV DQG GLIIHUHQW WDVN RI GDWD PLQLQJOLNHFODVVLILFDWLRQ7KLUGPHWDOHDUQLQJWHFKQLTXHV>@FDQEHDQLQWHUHVWLQJIXWXUHGLUHFWLRQ 5HIHUHQFHV
&3L]]XWLDQG'7DOLDµµ3$XWR&ODVVVFDODEOHSDUDOOHOFOXVWHULQJIRUPLQLQJODUJHGDWDVHWV¶¶LQ,(((WUDQVDFWLRQRQ.QRZOHGJHDQGGDWD HQJLQHHULQJ9ROSS0D\ - 0DUU ³&RPSDULVRQ 2I 6HYHUDO &OXVWHULQJ $OJRULWKPV IRU 'DWD 5DWH &RPSUHVVLRQ RI /3& 3DUDPHWHUV¶¶ LQ ,((( ,QWHUQDWLRQDO &RQIHUHQFHRQ$FRXVWLFV6SHHFKDQG6LJQDO3URFHVVLQJ9ROSS-DQXDU\ .&:RQJDQG*&//L³6LPXOWDQHRXV3DWWHUQDQG'DWD&OXVWHULQJIRU3DWWHUQ&OXVWHU$QDO\VLV¶¶LQ,(((7UDQVDFWLRQRQ.QRZOHGJH DQG'DWD(QJLQHHULQJ9ROSS/RV$QJHOHV86$-XQH :LNLSHGLDRUJHYROXWLRQDU\DOJRULWKP 6WXDUW-5XVVHOO3HWHU1RUYLJ $UWLILFLDO,QWHOOLJHQFH$0RGHUQ$SSURDFK '(*ROGEHUJ*HQHWLFDOJRULWKPVLQVHDUFKRSWLPLD]DWLRQDQGPDFKLQHOHDUQLQJ$GGLVRQ:HVOH\5HDGLQJ0$ 3HL 0 *RRGPDQ (' 3XQFK ) )HDWXUH([WUDFWLRQ XVLQJJHQHWLFDOJRULWKP&DVH &HQWHU IRU &RPSXWHU$LGHG (QJLQHHULQJDQG 0DQXIDFWXULQJ:'HSDUWPHQWRI&RPSXWHU6FLHQFH &ODVVLILFDWLRQWDVNXVLQJJHQHWLFDOJRULWKP .HUPDQL%*:KLWH0:1DJOH+7)HDWXUHH[WUDFWLRQE\JHQHWLFDOJRULWKPVIRUQHXUDOQHWZRUNVLQEUHDVWFDQFHUFODVVLILFDWLRQ (QJLQHHULQJLQ0HGLFLQHDQG%LRORJ\6RFLHW\,(((WK$QQXDO&RQIHUHQFHYROQRSSYRO6HS .DQQDQ $ 0DJXLUH *4 6KDUPD $ 6FKRR 3 *HQHWLF $OJRULWKP %DVHG )HDWXUH 6HOHFWLRQ $OJRULWKP IRU (IIHFWLYH ,QWUXVLRQ 'HWHFWLRQLQ&ORXG1HWZRUNV 'DWD0LQLQJ:RUNVKRSV,&'0: ,(((WK,QWHUQDWLRQDO&RQIHUHQFHRQYROQRSS 'HF =KRX