Gabor Feature Selection Based on Information Gain - Science Direct

14 downloads 2758 Views 273KB Size Report
in scientific research such as bioinformatics, machine learning and computer .... The information gain (IG) is the difference between entropy of the class and .... In Advances in Computer Science and Information Technology Springer, 2011, pp.
Available online at www.sciencedirect.com

ScienceDirect Procedia Engineering 181 (2017) 892 – 898

WK,QWHUQDWLRQDO&RQIHUHQFH,QWHUGLVFLSOLQDULW\LQ(QJLQHHULQJ,17(5(1*

*DERU)HDWXUH6HOHFWLRQ%DVHGRQ,QIRUPDWLRQ*DLQ 6]LGyQLD/HINRYLWVD /iV]Oy/HINRYLWVE a

Petru Maior University, Nicolae Iorga Street 1, Tîrgu-MureЮ 540088, Romania b Sapientia University, Corunca 1C, Tîrgu-MureЮ 540485, Romania

$EVWUDFW ,QWKHILHOGRIPDFKLQHYLVLRQREMHFWGHWHFWLRQKDVEHFRPHDSRSXODUDUHDRYHUWKHSDVWVHYHUDO\HDUV,WLVDSSOLHGRQDODUJHVFDOH LQVFLHQWLILFUHVHDUFKVXFKDVELRLQIRUPDWLFVPDFKLQHOHDUQLQJDQGFRPSXWHUYLVLRQRULQHYHU\GD\OLIHOLNHWUDIILFVXSHUYLVLRQ DFFHVVFRQWUROLGHQWLILFDWLRQDQGDXWKHQWLFDWLRQV\VWHPVDQGDOVRLQLQGXVWU\URERWLFVHWF (YHU\ DSSOLFDWLRQ KDV LWV RZQ SDUWLFXODULWLHV DQG ZRUNV RQO\ LQ VRPH ZHOOGHILQHG FRQGLWLRQV 7KH PDLQ GLƥFXOW\ RI JHQHUDO REMHFWGHWHFWLRQFRPHVIURPWKHH[WUHPHGLYHUVLW\LQZKLFKDOOREMHFWVDSSHDU7KH\KDYHDODUJHYDULHW\RIDSSHDUDQFHDVSHFW IRUPGLPHQVLRQFRORUSRVLWLRQURWDWLRQDQJOHLOOXPLQDWLRQVKDGRZRURFFOXVLRQ ,Q WKLV SDSHU ZH XVH QXPHURXV *DERU ILOWHUV IRU IHDWXUH H[WUDFWLRQ VSHFLDOO\ WXQHG IRU JOREDO IDFH DQG ORFDO H\H GHWHFWLRQ %HFDXVHWKHKLJKGLPHQVLRQDOLW\RIWKHGDWDWKHREWDLQHGIHDWXUHVDUHKDUGO\PDQDJHDEOH:HSURSRVHWRDSSO\LQWKHWUDLQLQJ DQGWHVWSKDVHVIHDWXUHVHOHFWLRQ)HDWXUHVHOHFWLRQLVDQLPSRUWDQWVWHSLQDOPRVWHYHU\GDWDPLQLQJSUREOHP7KHVHOHFWLRQRIWKH PRVWUHSUHVHQWDWLYHIHDWXUHGHVFULSWRUVLVGRQHE\PHDVXULQJWKHSDLUZLVHHQWURS\RIWKHILOWHUUHVSRQVHV7KHILQDOFODVVLILFDWLRQ UHVXOWLVJLYHQE\WKHPRVWLQIRUPDWLYHILOWHUUHVSRQVHVREWDLQHGIURPLQIRUPDWLRQJDLQRIDZHDNFODVVLILHUVFRPSXWHGIURPWKH FRUUHVSRQGLQJILOWHUUHVSRQVHVRQWKHWUDLQLQJVHW %HVLGHV WKLV SDSHU FRPSDUHV RWKHU OHDUQLQJ PHWKRGV XVHG LQ RXU SUHYLRXV ZRUNV ZLWK WKH FXUUHQWO\ SURSRVHG DSSURDFK FRPSDULQJWKHUROHRIPHDVXULQJWKHLQIRUPDWLRQJDLQDQGWKHPXWXDOLQIRUPDWLRQEHWZHHQWKHVHOHFWHGILOWHUV ‹7KH$XWKRUV3XEOLVKHGE\(OVHYLHU/WG © 2017 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). 3HHUUHYLHZXQGHUUHVSRQVLELOLW\RIWKHRUJDQL]LQJFRPPLWWHHRI,17(5(1* Peer-review under responsibility of the organizing committee of INTER-ENG 2016 Keywords:IHDWXUHVHOHFWLRQ*DERUILOWHUVLQIRUPDWLRQJDLQPXWXDOLQIRUPDWLRQ





&RUUHVSRQGLQJDXWKRU7HOID[ E-mail address:V]LGRQLDOHINRYLWV#VFLHQFHXSPUR

1877-7058 © 2017 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license

(http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of the organizing committee of INTER-ENG 2016

doi:10.1016/j.proeng.2017.02.482

893

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898

,QWURGXFWLRQDQG5HODWHG:RUN 7KH IHDWXUH VHOHFWLRQ WDVN LV DQ LPSRUWDQW SDUW RI HYHU\ GDWD PLQLQJ SUREOHP ,Q LPDJH SURFHVVLQJ DQG REMHFW GHWHFWLRQ IURP ' RU ' LPDJHV LV PXFK PRUH HVVHQWLDO EHFDXVH RI WKH ELJ DPRXQW RI LQIRUPDWLRQ WKDW FDQ EH REWDLQHG IURP WKHVH W\SHV RI LQSXW GDWD ,Q RUGHU WR H[WUDFW WKH XVHIXO LQIRUPDWLRQ WR EH DSSOLHG LQ FODVVLILFDWLRQ VHYHUDOGLVFULPLQDWLYHIHDWXUHVKDYHWREHFRPSXWHG>@ ,Q JHQHUDO IHDWXUH VHOHFWLRQ PHWKRGV DUH GLYLGHG LQ WZR FDWHJRULHV ILOWHU PHWKRGV DQG ZUDSSHU PHWKRGV 7KH ZUDSSHUPHWKRGVHYDOXDWHWKHSHUIRUPDQFHRIHDFKIHDWXUHE\DFODVVLILHU7KHILOWHULQJPHWKRGVHYDOXDWHILOWHULQJ IHDWXUHV WKDW PD\ FDUU\ XVHIXO LQIRUPDWLRQ IURP WKH SRLQW RI YLHZ RI WKH WDVN SURSRVHG :UDSSHU PHWKRGV REWDLQ EHWWHUSHUIRUPDQFHEXWFRQWUDU\WRWKHPILOWHUPHWKRGVDUHPXFKIDVWHU ,QFRPSXWHUYLVLRQDZLGHDUHDRIVWDQGDUGIHDWXUHVKDVEHHQSURSRVHGDQGXVHGVXFKDV6FDOH,QYDULDQW)HDWXUH 7UDQVIRUP 6,)7  >@ 3&$6,)7 3ULQFLSDO &RPSRQHQW $QDO\VLV 6,)7  >@ +2* +LVWRJUDP RI 2ULHQWHG *UDGLHQWV >@%DJRI:RUGV>@VLPLODULW\PHDVXUH>@ 6XUHO\ QRQH RI WKHVH IHDWXUHV DSSOLHG RQ LWV RZQ FDQ OHDG WR WKH GHVLUHG GHWHFWLRQ SHUIRUPDQFH 7KLV LV WKH UHDVRQZK\WKH\KDYHWREHLQFOXGHGLQFRPSOH[GHWHFWLRQV\VWHPV6HOHFWLQJWKHPRVWDGHTXDWHVHWRIIHDWXUHVLVD FUXFLDOSDUWLQHYHU\FODVVLILFDWLRQSUREOHP ,QRXUV\VWHPZHKDYHGHDOWZLWK*DERUZDYHOHWV>@XVHGDVLQWHUHVWSRLQWGHWHFWRUVDQGIHDWXUHGHVFULSWRUV$V LWLVNQRZQDQLQILQLWHQXPEHURI*DERUIHDWXUHVFDQEHGHILQHG1RWDOOWKHVHIHDWXUHVFDQEHFRQVLGHUHGUHOHYDQW 7KLVSDSHUVWXGLHVWKHLQIRUPDWLRQJDLQDQGWKHUHGXQGDQF\RIWKHVHOHFWHGIHDWXUHV 6HYHUDO IHDWXUH UDQNLQJ PHWKRGV KDYH EHHQ SURSRVHG LQ WKH OLWHUDWXUH FRUUHODWLRQEDVHG IHDWXUH VHOHFWLRQ >@ LQIRUPDWLRQ JDLQ DWWULEXWH HYDOXDWLRQ >@ JDLQ UDWLRQ DWWULEXWH HYDOXDWLRQ >@ IHDWXUH VHOHFWLRQ EDVHG RQ PXWXDO LQIRUPDWLRQ FULWHULD RI PD[GHSHQGHQF\ PD[UHOHYDQFH DQG PLQUHGXQGDQF\ >@ GRXEOH LQSXW V\PPHWULFDO UHOHYDQFH ',65 UHO\LQJRQDPHDVXUHRIYDULDEOHFRPSOHPHQWDULW\>@PD[LPL]DWLRQRIJOREDOLQIRUPDWLRQ>@ ,Q WKLV DUWLFOH WKH VHOHFWLRQ RI PRVW DGHTXDWH IHDWXUHV IRU REMHFW FODVVLILFDWLRQ WDVN LV EDVHG RQ WKH LQIRUPDWLRQ WKHRUHWLFDOFRQFHSWVRILQIRUPDWLRQJDLQDQGPXWXDOLQIRUPDWLRQ 7KH RUJDQL]DWLRQ RI WKLV SDSHU LV DV IROORZV WKH ILUVW VHFWLRQ WKH LQWURGXFWLRQ SUHVHQWV WKH PRVW LPSRUWDQW DUJXPHQWV RI IHDWXUH VHOHFWLRQ DQG UDQNLQJ EULHIO\ UHIHUV WR WKH PRVW LPSRUWDQW IHDWXUHV DQG LQIRUPDWLRQ EDVHG VHOHFWLRQPHWKRGVDSSOLHGLQWKHGRPDLQ6HFWLRQSUHVHQWVRXUV\VWHPVDQGGHVFULEHVWKHXVHGIHDWXUHVFRPELQHG ZLWK UHGXQGDQF\ HOLPLQDWLRQ DQG LQIRUPDWLRQ JDLQ UDQNLQJ 6HFWLRQ  SUHVHQWV RXU UHVXOWV FRPSDULQJ WKH WZR PHWKRGV H[FOXVLYH DQG LQFOXVLYH WKH LQIRUPDWLRQ WKHRUHWLFDO VHOHFWLRQ )LQDOO\ FRQFOXVLRQ LV GUDZQ DQG IXWXUH LPSURYHPHQWVDUHGHVFULEHG 2XU'HWHFWLRQ6\VWHP 7KH FUHDWHG V\VWHP LV D ORFDO DVSHFW EDVHG GHWHFWLRQ DSSURDFK FRQVLVWLQJ RI WKUHH SDUWV GHWHFWLRQ RI LQWHUHVW SRLQWV ORFDO GHVFULSWRU DQG WKH REMHFW PRGHO 7KH VHOHFWLRQ RI LQWHUHVW SRLQWV DQG WKH ORFDO SDWFK GHVFULSWRU LV LPSOHPHQWHG E\ XVLQJ D PXOWLWXGH RI WZRGLPHQVLRQDO *DERU ZDYHOHWV 7KH QH[W VWHS LV WKH FODVVLILFDWLRQ RI WKH REWDLQHG IHDWXUH YHFWRUV ZKLFK GHVFULEH WKH REMHFW SDUW EHVW 7KH FKRLFH RI *DERU ILOWHUV IRU REMHFW GHWHFWLRQ LV GHPRQVWUDWHG E\ VHYHUDO DXWKRUV > @ ZKR KDYH SURYHQ WKH YDOLGLW\ RI *DERU ZDYHOHWV ZKLFK PRGHO WKH UHVSRQVHRIWKHFHOOVRIKXPDQYLVXDOFRUWLFDOILHOGV 7KH*DERUZDYHOHWLVDVLQXVRLGDOSODQHZDYHPRGXODWHGE\D*DXVVLDQVXUIDFHLQ'VSDFH(YHU\LPDJHFDQEH GHFRPSRVHGE\DVHOHFWHGEDVLVRI*DERUZDYHOHWV,QRWKHUZRUGVHYHU\*DERUZDYHOHWH[WUDFWVDFHUWDLQIUHTXHQF\ GRPDLQ RI DQ LPDJH 7KH JRDO LV WR VHOHFW WKRVH *DERU ILOWHUV ZKLFK H[WUDFW GLIIHUHQW IUHTXHQFLHV IURP WKH VDPH LPDJHLQVXFKDZD\WKDWWKHREWDLQHGUHVSRQVHVGLVWLQJXLVKWKHREMHFWIURPWKHEDFNJURXQGVDVZHOODVSRVVLEOH 7KHGLVDGYDQWDJHRI*DERUILOWHUVFRPHVIURPLWVDQDO\WLFDOIRUP VHHHTXDWLRQ  ZKLFKGHSHQGVRQSDUDPHWHUV

g x y

ª x  x r y  y r º  » D E  »¼ i>[ x  x Q  y  y  P @

 S ««¬ e k

e



 

894

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898

 LV WKH DPSOLWXGH RI WKH *DXVVLDQ HQYHORSH T WKH URWDWLRQ DQJOH RI WKH *DXVVLDQ DQG WKH SODQH ZDYH k D  E WKH VWDQGDUG GHYLDWLRQ RI WKH *DXVVLDQ LQ ' x  y WKH FHQWHU RI WKH *DXVVLDQ [ Q  WKH VSDWLDO IUHTXHQF\RIWKHVLQXVRLGDOZDYHDQG P WKHSKDVHRIWKHZDYH 7KHVHSDUDPHWHUVFDQEHUHGXFHGWRRQO\ O  bw D T E\DSSO\LQJVRPHFRQVWUDLQWVGHVFULEHGLQ>@ZKHUH

ZKHUH

O LV WKH ZDYHOHQJWK bw WKH EDQGZLGWK D WKH DWWHQXDWLRQ RI WKH *DXVVLDQ LQ KRUL]RQWDO GLUHFWLRQ DQG T LV WKH DQJOHHQFORVHGE\WKHKRUL]RQWDOD[LVRIWKH*DXVVLDQDQGWKHSURSDJDWLRQGLUHFWLRQRIWKHZDYH&RQVLGHULQJWKHVH SDUDPHWHUVZHKDYHGHILQHGLQRXUSUHYLRXVZRUNV>@WKHPRVWDGHTXDWH*DERUILOWHUVZKLFKDUHDEOHWR GLVWLQJXLVKWKHWDUJHWREMHFWIURPRWKHUREMHFWRUIURPWKHEDFNJURXQG %DVHGRQWKHVHWKHV\VWHPFRPSXWHVWKHILOWHUUHVSRQVHVFHQWHUHGRQWKHLPDJHSDWFK,QRUGHUWRFKRRVHRQO\WKH PRVWUHSUHVHQWDWLYHILOWHUVDQGWKHZHLJKWRIHDFKRQHLQWKHILQDOGHFLVLRQDOHDUQLQJDOJRULWKPKDVWREHDSSOLHG :H KDYH DSSOLHG WKH *HQWOH%RRVW >@ DOJRULWKP FRPELQLQJ LW ZLWK LQIRUPDWLRQ JDLQ IHDWXUH VHOHFWLRQ DQG PXWXDO LQIRUPDWLRQHYDOXDWLRQ7KHLGHDRIPHDVXULQJWKHUHGXQGDQF\RIWKHVHOHFWHGFODVVLILHUVFDPHIURP>@ 7KHFODVVLILFDWLRQSHUIRUPDQFHRIHYHU\ILOWHULVPHDVXUHGRQWKHWUDLQLQJVHW7DNLQJLQDFFRXQWWKHHUURUUDWHRI HYHU\VXFKZHHNFODVVLILHUFRPSXWHGIURPWKHFRUUHVSRQGLQJILOWHUDGHVFHQGLQJRUGHURIFODVVLILFDWLRQLVVHW 7KHQRYHOSDUWRIWKLVSDSHULVWKHIHDWXUHVHOHFWLRQEDVHGRQLQIRUPDWLRQJDLQRIWKHFODVVLILHUV7KHQH[WVWHSLV WKH PHDVXUHPHQW RI WKH PXWXDO LQIRUPDWLRQ EHWZHHQ DOO SDLUV RI WKH DOUHDG\ VHOHFWHG DQG WKH FXUUHQWO\ HYDOXDWHG FODVVLILHU,IWKHUHGXQGDQF\RIWZRSDLUVRIFODVVLILHUVLVKLJKWKHQWKHVHFRQGODWHUVHOHFWHGFODVVLILHULVQHJOHFWHG 7KDWVHFRQGFODVVLILHUKDVORZHULQIRUPDWLRQJDLQWKDQWKHSUHYLRXVO\VHOHFWHGEXWZLWKVLPLODUPXWXDOLQIURPDWLRQ 7KHEDVLFFRQFHSWVRIWKHLQIRUPDWLRQWKHRU\DUHWKHHQWURS\DQGWKHPXWXDOLQIRUPDWLRQ 7KHHQWURS\ H(X)LVWKHXQFHUWDLQW\RIDGLVFUHWHUDQGRPYDULDEOH xZLWKSUREDELOLW\GLVWULEXWLRQp(x), KDYLQJWKH IRUP >@

H X



¦ p x ORJ p x 



 

x X

0XWXDOLQIRUPDWLRQPHDVXUHVWKHPXWXDOGHSHQGHQFHRIWZRUDQGRPYDULDEOHVXDQGY,WFDQEHFRPSXWHGIURP WKHPDUJLQDOSUREDELOLWLHVp(x), p(y)RIWKHWZRYDULDEOHVDQGWKHMRLQWSUREDELOLW\p(x, y)RIWKHP

¦ ¦ p x y ORJ

I X Y

x X yY



p x y  p x p y



 

,QWKHRU\LQIRUPDWLRQJDLQLVDQHQWURS\EDVHGIHDWXUHLWLVGHILQHGDVWKHDPRXQWRILQIRUPDWLRQSURYLGHGE\WKH IHDWXUH LWHPV IRU WKH FRQVLGHUHG FODVV ,QIRUPDWLRQ JDLQ LV FDOFXODWHG E\ KRZ PXFK RI D WHUP FDQ EH XVHG IRU FODVVLILFDWLRQRILQIRUPDWLRQLQRUGHUWRPHDVXUHWKHLPSRUWDQFHRIIHDWXUHVIRUWKHFODVVLILFDWLRQ>@ 7KHVHLQIRUPDWLRQJDLQVFDQEHFDOFXODWHGLQWKHIROORZLQJZD\ ,QIRUPDWLRQFRQWHQWLQWKHZKROHWUDLQLQJVHW>@ c

H C



¦ p & ORJ p &  i

i



 

i 

ZKHUHWKH CLVWKHZKROHWUDLQLQJVHWWKHQXPEHURIFODVVHVLV c=2 DQG p(Ci)LVWKHSUREDELOLW\RIVDPSOHVIURP FODVV Ci FRPSXWHGIURPWKHHQWLWLHVLQWKHWUDLQLQJVHW p(Ci)=ni/n. niLVWKHQXPEHURILQVWDQFHVLQFODVV iDQG nLV WKHQXPEHURIWRWDOLQVWDQFHV  7KHLQIRUPDWLRQJDLQ (IG) LVWKHGLIIHUHQFHEHWZHHQHQWURS\RIWKHFODVVDQGWKHFRQGLWLRQDOHQWURS\RIWKHFODVV DQGWKHVHOHFWHGIHDWXUHREWDLQHGE\DSSO\LQJf

895

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898

IG

H C  p f H C _ f  p f H C _ f  

 

ZKHUH p f DQG p f DUH WKH SUREDELOLWLHV RI SUHVHQFH RU DEVHQFH RI IHDWXUH f DQG H C _ f UHVSHFWLYHO\ H C _ f DUHWKHHQWURSLHVRIWKHFRQGLWLRQDOSUREDELOLW\GLVWULEXWLRQVRIWKHSUHVHQFHRUWKHDEVHQFHRIIHDWXUHf 7KHH[SOLFLWIRUPXODRIIG LV>@ c

IG



¦

c

p &i ORJ p &i  p f

i 

¦ i 

c

p &i _ f ORJ p &i _ f  p f

¦ p &

i

_ f ORJ p &i _ f 

 

i 

7KLVIRUPXODPHDVXUHVWKHXVHIXOQHVVRIIHDWXUHfLQWKHFODVVLILFDWLRQ,IIGLVJUHDWHUWKDQWKHSUHYLRXVYDOXH ZLWKRXWWKHIHDWXUHfWKHQWKHfFXUUHQWO\VHOHFWHGIHDWXUHLVPRUHXVHIXOIRUFODVVLILFDWLRQ 7KHVHWKHRUHWLFDOFRQVLGHUDWLRQVKDYHEHHQDSSOLHGLQRXUV\VWHPDQGWKHUHVXOWVDQGH[SHULPHQWVDUHSUHVHQWHGLQ WKHIROORZLQJVHFWLRQ

5HVXOWVDQG([SHULPHQWV %DVHG RQ WKH WKHRUHWLFDO FRQFHSWV GHVFULEHG DERYH ZH KDYH FUHDWHG RXU GHVFULSWRU DQG GHWHFWRU)LUVW ZHKDYH H[WUDFWHGWKHLQWHUHVWSRLQWVFRPSXWLQJELGLPHQVLRQDO*DERUILOWHUUHVSRQVHV7KHVHWRILPDJHIHDWXUHVZKLFKKDGWR EHFODVVLILHGZHUHGHILQHGLQDGLPHQVLRQDOVSDFH O  bw D  T  ,QWKLVVSDFHZHKDYHFRQVLGHUHGILOWHUVZLWK GLIIHUHQWSDUDPHWHUVZKLFKZHUHVHOHFWHGDVGHVFULEHGLQRXUSUHYLRXVDUWLFOHV>@7KHWUDLQLQJFRQVLVWVRI FODVVLI\LQJWKHREMHFWE\WKH*DERUILOWHUIHDWXUHYHFWRUVREWDLQHG 7KHQH[WSKDVHZDVWKHDSSOLFDWLRQRIWKHIROORZLQJDOJRULWKPWKDWFRQVLGHUVWKHLQIRUPDWLRQJDLQRIHDFKIHDWXUH EDVHGZHDNFODVVLILHUDQGWKHPXWXDOLQIRUPDWLRQEHWZHHQHYHU\SDLURIIHDWXUHEDVHGFODVVLILHU 7KHILUVWVWHSRIWKHSURSRVHGDOJRULWKPLVWKHFRPSXWDWLRQRILQIRUPDWLRQJDLQRIWKHGHWHUPLQHGIHDWXUHVRQWKH WUDLQLQJVHW7KHLQIRUPDWLRQJDLQYDOXHREWDLQHGIURPWKHWUDLQLQJVHWGHWHUPLQHVDUDQNLQJRIWKHIHDWXUHV%DVHGRQ WKLVHYDOXDWLRQWKHPRVWXQLPSRUWDQWIHDWXUHVZHUHHOLPLQDWHG7KHQH[WVWHSLVWKHDSSOLFDWLRQRI*HQWOH%RRVW FODVVLILFDWLRQDOJRULWKPRQO\RQWKHUHPDLQLQJVHWRILPSRUWDQWIHDWXUHV7KHRULJLQDO*HQWOH%RRVWDOJRULWKP>@LV PRGLILHGE\FRQVLGHULQJQRWRQO\WKHEHVWUHJUHVVLRQVWXPSLQRWKHUZRUGVWKHEHVWIHDWXUHVEDVHGZHDNFODVVLILHU KDYLQJWKHORZHVWHUURUIURPWKHWUDLQLQJVHWEXWDOVRFRPSXWLQJWKHPXWXDOLQIRUPDWLRQEHWZHHQWKHFXUUHQWZHDN FODVVLILHU DQG WKH DOUHDG\ VHOHFWHG FODVVLILHUV ,I WKLV PXWXDO LQIRUPDWLRQ LV JUHDWHU WKDQ D WKUHVKROG YDOXH WKHQ WKH FXUUHQW ZHDN FODVVLILHU LV QHJOHFWHG DQG WKH SUHYLRXVO\ VHOHFWHG VLPLODU FODVVLILHU ZLWK KLJKHU LQIRUPDWLRQ JDLQ UHPDLQVVHOHFWHGLQWKHILQDOFODVVLILHU  Info gain and mutual information based classification algorithm Input: Set of Gabor features S with feature responses xk  X and target classes yk  ^ ` Output: classifier 1: compute the information gain for all the selected feature-based weak classifiers 2: sort the obtained features based on the info gain rank at step 1 3: select the best N features and apply the GentleBoost algorithm 3: foreach fi in S 4: compute the weak regression stump fi a ˜ xk ! T  b  k 5: select the best regression stump with lowest error rate f PLQ (a,b,T) 6: compute mutual information Hj for all f j  F and fmin 7: if Hj>threshold then goto step 3 endif 8: 8: update final classifier F=F+fmin 9: endfor 10: return final classifier F 

896

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898

7KH LQIRUPDWLRQ EDVHG UDQNLQJ RI WKH VHOHFWHG ILUVW  ILOWHU EDVHG FODVVLILHUV LV JLYHQ LQ 7DEOH  WKH FROXPQV UHSUHVHQWWKHUDQNLQJWKHLQIRUPDWLRQJDLQFRPSXWHGLQGHVFHQGLQJRUGHUDQGWKHSDUDPHWHUVHWRIWKHIHDWXUHV 7DEOH,QIRUPDWLRQJDLQRIZHDNFODVVLILHUVEDVHGRQ*DERUIHDWXUHV 1R

,QIRJDLQ

O

Tq

6

EZ

1R

,QIRJDLQ

O

T

6

EZ

































































































































































































 7KHPRVWLUUHOHYDQWIHDWXUHVZKLFKZHUHQRWLQFOXGHGLQWKHWDEOHDERYHKDGDQLQIRUPDWLRQJDLQOHVVWKDQ $IWHU VHOHFWLQJ WKH PRVW LQIRUPDWLYH ILOWHUV ZH KDYH DOVR PHDVXUHG WKH PXWXDO LQIRUPDWLRQ RI HYHU\ SDLU RI VHOHFWHGILOWHUEDVHGFODVVLILHU7KHKLJKHVWYDOXHVPXWXDOLQIRUPDWLRQPHDVXUHGEHWZHHQWKHPLVJLYHQLQ7DEOH 7DEOH0XWXDOLQIRUPDWLRQEHWZHHQSDLUVRIILOWHUV 1R

0XWXDOLQIR

O

Tq

6 

EZ

O

Tq

6 

EZ













































































































































 ,WFDQEHREVHUYHGWKDWWKHIHDWXUHVHWSDLUVWKDWFDUU\WKHVDPHLQIRUPDWLRQKDYHDPD[LPXPYDOXHRIWKHSDLUV KDYLQJPXWXDOLQIRUPDWLRQRIDQGFDUU\GLIIHUHQWLQIRUPDWLRQDQGFDQEHFRQVLGHUHGLUUHGXQGDQW7KHZKROH WDEOHRIUHGXQGDQW*DERUILOWHUVXVHGFDQEHVWXGLHGLQRXUSUHYLRXVDUWLFOH>@ ,IZHDQDO\]HWKHVHOHFWHG¿OWHUVZHPD\REVHUYHWKDWWKHILOWHUVZLWKWKHKLJKHVWPXWXDOLQIRUPDWLRQKDYHVLPLODU SDUDPHWHUV,QWDEOHWKHFROXPQV (O1, T1, S1, bw1 UHIHUWRWKHSDUDPHWHUVRIWKHILUVWILOWHUVDQGFROXPQV O2, T2, S2, bw2  UHIHU WR WKH ILOWHU ZKLFK LV UHGXQGDQW FRPSDUHG WR WKH ILUVW 7KLV IDFW VXJJHVWHG XV HOLPLQDWH WKH VHFRQGILOWHUVKDYLQJORZHULQIRUPDWLRQJDLQIURPWKHILQDOFODVVLILFDWLRQSURFHVV %DVHGRQWKHVHPHDVXUHPHQWVZHKDYHFRQVLGHUHGWKHEHVWIHDWXUHYHFWRUVIURPWKHWRWDOQXPEHURI:H PHDVXUHGWKHFODVVLILFDWLRQSHUIRUPDQFHDOVRRQWKHWUDLQLQJDQGRQWKHWHVWVHW )RUWKHWUDLQLQJZHKDYHXVHGWKH)(5(7>@GDWDEDVHFRPELQHGZLWKRXURZQVHWRILPDJHV7KHWUDLQLQJVHW FRQVLVWVRISRVLWLYHDQGQHJDWLYHH[DPSOHVDQGWKHWHVWVHWRIDQGSDWFKHV7KHLPDJHSDWFKXVHG LQWKHWUDLQLQJSKDVHLVFHQWUHGRQWKHH\HDQGWKHQHJDWLYHLPDJHVKDYHEHHQH[WUDFWHGUDQGRPO\IURPWKHIDFHEXW QRWWKHH\H 'XULQJ RXU H[SHULPHQWV ZH KDYH FRPSDUHG WKH FODVVLILFDWLRQ UHVXOWV RI WKH *HQWOH%RRVW DOJRULWKP > @ )LUVW ZLWKRXWLQFOXGLQJWKHLQIRUPDWLRQJDLQREVHUYDWLRQVVHFRQGLQFOXGLQJWKHLQIRJDLQEDVHGUDQNLQJRIWKHILOWHUVDQG ILQDOO\HOLPLQDWLQJWKHUHGXQGDQWILOWHUVIRUPWKHFODVVLILFDWLRQSURFHVV7KHVHUHVXOWVFDQEHVHHQLQWDEOH 7KHGLIIHUHQFHRIWKHWKUHHFODVVLILFDWLRQPHWKRGVLVQRWUHOHYDQW$OWKRXJKLPSRUWDQWDGYDQWDJHRILQIRUPDWLRQ JDLQHYDOXDWLRQDQGUHGXQGDQF\HOLPLQDWLRQFRPHVIURPWKHVSHHGXSRIFRPSXWDWLRQWLPH 

897

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898 7DEOH&RPSDULVRQRI'HWHFWLRQUDWHIRU7HVWLPDJHV 

*HQWOH%RRVW>@>@

*%ZLWK,QIRJDLQ>@

*%ZLWKLUUHGXQGDQF\>@

'HWHFWLRQ5DWH







)DOVH3RVLWLYH(UURU5DWH







)DOVH1HJDWLYH(UURU5DWH







 &RPSDUHG WR WKH *HQWOH%RRVW DOJRULWKP WKH IHDWXUHV ZLWK WKH KLJKHVW LQIRUPDWLRQ JDLQ FDQ EH REWDLQHG ZLWK VLPSOHSUREDELOLW\FDOFXOXV 7KH PXWXDO LQIRUPDWLRQ VXSSRVHV RQO\ D SDLUZLVH HYDOXDWLRQ DQG EDVHG RQ WKH UHVXOWV PHDVXUHG UHGXQGDQW IHDWXUHV FDQ EH GLUHFWO\ HOLPLQDWHG 7KH ILJXUH EHORZ LOOXVWUDWHV WKH H\H GHWHFWLRQ ZLWKRXW DQG ZLWK LQIRUPDWLRQ WKHRUHWLFDOFRPSXWDWLRQDQG*HQWOH%RRVWFODVVLILFDWLRQ$VLWFDQEHVHHQWKHGLIIHUHQFHVDUHKDUGO\REVHUYDEOH 





)LJ'HWHFWLRQH[DPSOHIURP)(5(7GDWDVHW*HQWOH%RRVWYV,QIRJDLQPXWXDOLQIRUPDWLRQFODVVLILFDWLRQ

&RQFOXVLRQDQG)XWXUH:RUN

7KLV SDSHU SUHVHQWV D ORFDO SDUWEDVHG GHVFULSWRU DQG GHWHFWRU IRU IDFLDO IHDWXUH GHWHFWLRQ 7KH IHDWXUHV DUH GHWHUPLQHGE\VSHFLDOO\GHILQHGILQHWXQHG*DERUILOWHUUHVSRQVHVSUHVHQWHGLQGHWDLOLQRXUSUHYLRXVSDSHUV> @7KHFODVVLILFDWLRQLVGRQHE\XVLQJWKUHHGLIIHUHQWDOJRULWKPV)LUVWWKHLQIRUPDWLRQJDLQLVFRPSXWHGIRUHYHU\ IHDWXUH FRQVLGHUHG IURP WKH WUDLQLQJ VHW 6HFRQG WKH *HQWOH%RRVW DOJRULWKP LV DSSOLHG EXW RQO\ RQ WKH PRVW LQIRUPDWLYH IHDWXUHV UHPDLQLQJ IURP SUHYLRXVO\ VHOHFWHG WKH LQIR JDLQ EDVHG IHDWXUH VHOHFWLRQ VWHS 7KH ODVW VWHS LQFOXGHGLQWKH*HQWOH%RRVWDOJRULWKPLVWKHFRPSXWDWLRQRIPXWXDOLQIRUPDWLRQEHWZHHQWKHVHOHFWHGFODVVLILHUV,I WKHPXWXDOLQIRUPDWLRQEHWZHHQSDLUZLVHFRQVLGHUHGZHDNFODVVLILHUVLVWRRKLJKWKDQWKHODWHUVHOHFWHGFODVVLILHULV GURSSHG7KLVVOLJKWPRGLILFDWLRQRIWKHDOJRULWKPEULQJVOLWWOHFKDQJHLQWKHRYHUDOOFODVVLILFDWLRQSHUIRUPDQFHRI WKHFODVVLILHUEXWWKHDGYDQWDJHRILWLVWKHSUHOLPLQDU\HOLPLQDWLRQRILQIRUPDWLYHO\XVHOHVVIHDWXUHV $V D IXWXUH ZRUN ZH SURSRVH WR FRPELQH WKH SUHVHQWHG PHWKRGV IRU RWKHU IDFLDO IHDWXUHV LQ RUGHU WR REWDLQ D KLJKHUDFFXUDF\RIGHWHFWLRQHYHQLIWKHIDFHLVSDUWLDOO\RFFOXGHG 5HIHUHQFHV >@ 6$SSDYX55DMDUDP01DJDPPDL13UL\DQJD63UL\DQND%D\HVWKHRUHPDQGLQIRUPDWLRQJDLQEDVHGIHDWXUHVHOHFWLRQIRU PD[LPL]LQJWKHSHUIRUPDQFHRIFODVVLILHUV,Q$GYDQFHVLQ&RPSXWHU6FLHQFHDQG,QIRUPDWLRQ7HFKQRORJ\6SULQJHUSS >@ %$]KDJXVXQGDUL$67KDQDPDQL)HDWXUHVHOHFWLRQEDVHGRQLQIRUPDWLRQJDLQ,QWHUQDWLRQDO-RXUQDORI,QQRYDWLYH7HFKQRORJ\DQG ([SORULQJ(QJLQHHULQJ ,-,7(( ,661SS >@ 7%HUJ31%HOKXPHXU322)3DUW%DVHG2QHYV2QH)HDWXUHVIRU)LQH*UDLQHG&DWHJRUL]DWLRQ)DFH9HULILFDWLRQDQG$WWULEXWH (VWLPDWLRQ7KH,(((&RQIHUHQFHRQ&RPSXWHU9LVLRQDQG3DWWHUQ5HFRJQLWLRQ &935 SS± >@ 1'DODO%7ULJJV+LVWRJUDPVRIRULHQWHGJUDGLHQWVIRUKXPDQGHWHFWLRQ&RPSXWHU9LVLRQDQG3DWWHUQ5HFRJQLWLRQ&935S± 

898

Szidónia Lefkovits and László Lefkovits / Procedia Engineering 181 (2017) 892 – 898

>@ :'XFK-%LHVLDGD7:LQLDUVNL.*UXG]LĔVNL.*UąEF]HZVNL)HDWXUHVHOHFWLRQEDVHGRQLQIRUPDWLRQWKHRU\ILOWHUV,Q1HXUDO 1HWZRUNVDQG6RIW&RPSXWLQJ3K\VLFD9HUODJ+'SS >@ )(5(7'DWDEDVHKWWSZZZLWOQLVWJRYLDGKXPDQLGIHUHWIHUHWBPDVWHUKWPO >@ -)ULHGPDQ7+DVWLH57LEVKLUDQL$GGLWLYHORJLVWLFUHJUHVVLRQDVWDWLVWLFDOYLHZRIERRVWLQJ ZLWKGLVFXVVLRQDQGDUHMRLQGHUE\WKH DXWKRUV 7KHDQQDOVRIVWDWLVWLFV     >@ -+DQ0.DPEHU-3HL'DWDPLQLQJFRQFHSWVDQGWHFKQLTXHV(OVHYLHU >@ 7+DVWLH57LEVKLUDQL-+)ULHGPDQ7KH(OHPHQWVRI6WDWLVWLFDO/HDUQLQJ'DWD0LQLQJ,QIHUHQFHDQG3UHGLFWLRQ6SULQJHUVHULHVLQ VWDWLVWLFV >@@76/HH,PDJHUHSUHVHQWDWLRQXVLQJ'*DERUZDYHOHWV,(((7UDQVDFWLRQVRQ3DWWHUQ$QDO\VLVDQG0DFKLQH,QWHOOLJHQFH      >@6]/HINRYLWV1RYHO*DERU)LOWHU%DVHG3DWFK'HVFULSWRUWK-XELOHH,QWHUQDWLRQDO6\PSRVLXPRQ,QWHOOLJHQW6\VWHPVDQG,QIRUPDWLFV 6,6@6]/HINRYLWV//HINRYLWV'LVWDQFHEDVHGN11&ODVVLILFDWLRQRI*DERU-HW/RFDO'HVFULSWRUV7KHWK,QWHUQDWLRQDO&RQIHUHQFH ,QWHUGLVFLSOLQDULW\LQ(QJLQHHULQJ,17(5(1*3URFHGLD7HFKQRORJ\  ± >@6]/HINRYLWV,PSURYHPHQWVRQ*DERU'HVFULSWRU5HWULHYDOIRU3DWFK'HWHFWLRQ-RXUQDORI&RPSXWLQJDQG,QIRUPDWLFV      >@6$/HL)HDWXUH6HOHFWLRQ0HWKRG%DVHGRQ,QIRUPDWLRQ*DLQDQG*HQHWLF$OJRULWKP,QWHUQDWLRQDO&RQIHUHQFHRQ&RPSXWHU6FLHQFHDQG (OHFWURQLFV(QJLQHHULQJ ,&&6(( +DQJ]KRXSS >@%/HLEH,QWHUOHDYHG2EMHFW&DWHJRUL]DWLRQDQG6HJPHQWDWLRQ3K'WKHVLV >@'*/RZH'LVWLQFWLYHLPDJHIHDWXUHVIURPVFDOHLQYDULDQWNH\SRLQWV,QWHUQDWLRQDO-RXUQDORI&RPSXWHU9LVLRQ  ± >@3(0H\HU&6FKUHWWHU*%RQWHPSL,QIRUPDWLRQWKHRUHWLFIHDWXUHVHOHFWLRQLQPLFURDUUD\GDWDXVLQJYDULDEOHFRPSOHPHQWDULW\,((( -RXUQDORI6HOHFWHG7RSLFVLQ6LJQDO3URFHVVLQJ     >@+3HQJ)/RQJ&'LQJ)HDWXUHVHOHFWLRQEDVHGRQPXWXDOLQIRUPDWLRQFULWHULDRIPD[GHSHQGHQF\PD[UHOHYDQFHDQGPLQUHGXQGDQF\ 3DWWHUQ$QDO\VLVDQG0DFKLQH,QWHOOLJHQFH,(((7UDQVDFWLRQVRQ     >@&6KDQJ0/L6)HQJ4-LDQJ-)DQ)HDWXUHVHOHFWLRQYLDPD[LPL]LQJJOREDOLQIRUPDWLRQJDLQIRUWH[WFODVVLILFDWLRQ.QRZOHGJH %DVHG6\VWHPV   >@/6KHQ/%DL'%DUGVOH\@;6KHQ=/LQ-%UDQGW