Face Recognition using Fourier Descriptor and

4 downloads 0 Views 188KB Size Report
cutting edge technologies in commercial, law enforcement ... iv) Neurons in third layer=350 .... [2] Gonzalez and Woods, “Digital Image Processing”,2nd edition.
Face Recognition using Fourier Descriptor and FFNN Sangeeta Kakarwal1, Mahananda Malkauthekar2, Shubhangi Sapkal3, Dr. Ratnadeep Deshmukh4

4.

1. P.E.S. College of Engineering, Aurangabad. 2. M.I.T. College of Engineering, Aurangabad 3. Govt. College of Engg., Karad. Dr. Babasaheb Ambedkar Marathwada University, Aurangabad.

Abstract-We present in this paper, Fourier descriptor and feedforward neural network for face recognition. Analysis is done for various numbers of iterations. Comparison shows that faces are recognized with FFNN more accurately with 50000 iterations. For experiment, FERET database is used.

Keywords: Fouurier Descriptor, FFNN. I. INTRODUCTION Face recognition involves computer recognition of personal identity based on geometric or statistical features derived from face images. Even though humans can detect and identify faces with little or no effort, building an automated system that accomplishes such objective is, however very challenging. The challenges are even more profound when one considers the large variations in the visual stimulus due to illumination conditions, viewing directions or poses, facial expression, aging, and disguises such as facial hair, glasses or cosmetics. Face recognition research provides the cutting edge technologies in commercial, law enforcement, and military applications. An automated vision system that performs the functions of face detection, verification and recognition will find countless unobtrusive applications, such as airport security, and access control, building surveillance and monitoring, human-computer intelligent interaction and perceptual interfaces[1].

II. FOURIER DESCRIPTOR[2] Location and description of substantial variations in the first derivative of object boundaries often yield suitable information. A typical mathematical technique is shape description based on the Fourier transform[4]. We are dealing with two-dimensional images, but our world is three-dimensional and the same objects, if seen from different angles, may form very different 2D projections. If x(k)=xk andy(k)=yk are the coordinates expressed notation for the boundary of an image can be represented as the sequence of coordinates

s(k)=[x(k),y(k)], for k=0,1,2….,k-1. Each coordinates pair can be treated as a complex number so that s(k)=x(k)+jy(k). The discrete fourier transform (DFT) of s(k) is a(u)=∑s(k)e-j2∏uk/K for k= 0 to k-1 and u=0,1,2,…..,k-1. The complex coefficients a(u) are called the Fourier descriptors of the boundary. The inverse fourier transform of these coefficients restores s(k) sk(u)=∑a(u)ej2∏uk/K Suppose, instead of all the fourier coefficients, only the first P coefficients are used. This is equivalent to setting a(u)=0 for u>p-1.the result is the following approximation to s(k): si(k)=∑ a(u)e j2∏uk/k for k=0,1,,k-1 and for u=0 to p-1. Although only P terms are used to obtain each component of s(k), k still ranges for 0 to k-1.

III. FEEDFORWARD NEURAL NETWORK A feedforward artificial neural network consists of layers of processing units, each layer feeding input to the next layer in a Feed forward manner through a set of connection weights or strengths. The weights are adjusted using the back propagation learning law. The patterns have to be applied for several training cycles to obtain the output error to an acceptable low value. Once the network is trained, it can be used to recall the appropriate pattern for a new input pattern. The computation for recall is straightforward, in the sense that the weights and the output functions of the units in different layers are used to compute the activation values and the output signals. The signals from the output layer correspond to the output. Back propagation learning emerged as the most significant result in the field of neural networks. The back propagation learning involves propagation of the error backwards from the input training pattern, is determined by computing the outputs of units for each hidden layer in the forward pass of the input data. The error in the output is propagated backwards only to determine the weight updates [3].

In Fourier descriptor method, images are first converted to binary images. Using labeling, all regions in the image are found and take boundaries of each region in the image. These boundary values are used to find fourier descriptors, which are imaginary values in the form of (x + iy). Inverse Fourier descriptors using different values of Fourier descriptors such as 100%, 50%, 30% and 20% are obtained and analyzed the recognition results with different values. Instead of using FD images, mean values of each row of the image are used for classification. p_mean = mean(vi) where, vi is the ith vector in the image. V. RESULT ANALYSIS 1. FFNN is trained for front face images with 100%, 50%. 30%, 20%, 10% FD (350 images of 70 persons, 70*5=350) and following network architecture is used: i) Number of layers=3 ii) Neurons in first layer=48 iii) Neurons in second layer=192 iv) Neurons in third layer=350 v) Transfer function for all layers is ‘tansig’. Test results for different face images and different percentages of reconstructed images with Epochs=50000 are shown in Table1 and Graph1. Table 1: Recognition rate by 3-layer (48-192-350) FFNN (50000 epochs) with all (100%, 50%, 30%, 20%, 10%) FD values Quar Quar Half No. of Fron Alte % of right ter rnate ter images for t FD right left expr face used Tra Te essio s for ini sti n testing ng ng 100 350 70 98.5 62.8 32.8 27.1 15.7 50 350 70 100 62.8 25.7 24.2 12.8 30 350 70 100 65.7 28.5 25.7 11.4 20 350 70 100 64.2 21.4 28.5 10.0 10 350 70 100 58.5 24.2 21.4 7.14

Classification by FFNN with all (100%, 50%, 30%, 20%, 10%) FD

2. FFNN is trained front face images with 100%, 50%. 30%, 20%, 10% FD (350 images of 70 persons, 70*5=350) and following network architecture is used: i) Number of layers=3, ii) Neurons in first layer=48 iii) Neurons in second layer=240 iv) Neurons in third layer=350 v) Transfer function for all layers is ‘tansig’ Test results for different face images and different percentages of reconstructed images with Epochs=60000 are shown in Graph2 and Table2. Table 2: Recognition rate by 3-layer (48-240-350) FFNN (60000 epochs) with all (100%, 50%, 30%, 20%, 10%) FD values % of No. of Fron Alterna Quar Quar Half right ter FD images for t te ter right used express left Trai Testi face for s ion ning ng testing 100 350 70 100 64.2 31.4 21.4 8.5 50 350 70 100 64.2 27.1 20.0 11.4 30 350 70 100 61.4 22.8 21.4 11.4 20 350 70 100 61.4 27.1 11.4 7.1 10 350 70 100 52.8 27.1 12.8 7.1

Classification by FFNN w ith all (100%, 50%, 30%, 20%, 10%) FD 120 100 Recognition rate

IV. ANALYSIS OF IMAGE RECOGNITION WITH FD

Front faces

80

Alternate expression

60

Quarter left Quarter right

40

Half right

20 0 100

50

30

20

10

% of FD

Graph2: Recognition rate by 3-layer (48-240-350) FFNN (60000 epochs) with all (100%, 50%, 30%, 20%, 10%) FD values

3) Comparison between recognition rate with two architectures given below of FFNN and using all (100%, 50%, 30%, 20%, 10%) FDs are shown in Table3 and Graph3: i) 3-layer (48-192-350) FFNN, epochs=50000 ii) 3-layer (48-240-350) FFNN, epochs=60000

120

Recognition rate

100 Front faces

80

Alternate expression 60

Quarter left Quarter right

40

Half right

20 0 100

50

30

20

10

% of FD

Graph1: Recognition rate by 3-layer (48-192-350) FFNN (5000 epochs) with all (100% 50% 30% 20% 10%) FD values.

Table 3: Comparison between recognition rate by two FFNNs Layers of Front Alternate Quarter Quarter FFNN faces expression left right 48-192-350 99.71 62.85 26.56 25.42 (50000 epochs) 48-240-350 100 60.85 21.71 17.42 (60000 epochs)

Half right 11.42 9.13

Graph3 shows the difference between recognition rate of two network architectures. When number of neurons in the hidden layer are increased above some limit, the result is inverse. Generally, up

to some limit, increasing number of neurons in hidden layers and/or number of iterations improves recognition rate.

Recognition rate

120

[11]

[12]

100 48 - 19 2 -3 5 0 ( 50 00 0 ep o c h s )

80 60

48 - 24 0 -3 5 0 ( 60 00 0 ep o c h s )

40

[13]

20

rig ht al f H

Q ua rte rl ef t Q ua rte rr ig ht

Fr on Al tf te ac rn at es e ex pr es si on

0

[14] [15] [16]

Ty p es o f f a c e s

Graph 3: Recognition rate by two FFNNs

VI.

CONCLUSIONS

FFNN is a multilayer Neural Network, which uses back propagation for learning. From the obtained results, it can be said that, three layers are sufficient for classification. As number of layers increased more than three, it does not improve the recognition rate, but instead of that, if number of epochs and neurons in hidden layers are increased upto some limit, it improves the recognition rate. Reconstructed images are obtained which reduces dimensions of database. Instead of training network for one image of each person with some reduced dimensions, images with different dimensionality reduction (i.e. more than one feature vectors of each person) gives recognition rate more than others. Here, 5 feature vectors for each person with different dimensions i.e.100%, 50%, 30% ,20%, 10% dimensions, (70*5=350 feature vectors for 70 persons) are used.

REFERENCES [1]

Kevin Bowyer, “Face Recognition Technology: Security versus Privacy”, IEEE Technology And Society Magazine | SPRING 2004 [2] Gonzalez and Woods, “Digital Image Processing”,2nd edition [3] B. Yegnanarayana, “Artificial Neural Networks” [4] Milan Sonka, Vacloc Hlavac-Ronger,Boyle, “Image Processing, analysis and Machine Vision” [5] Steve, C. Giles, Ah Tsoi, Andrew, “Face recognition: A hybrid neural network approach” [6] Gonzalez, Woods and Eddins, “Digital Image Processing using MATLAB” [7] Gonzalez and Woods, “Digital Image Processing”,2nd edition [8] Raphael Feraud, “A Fast and Accurate Face Detector Based on Neural Networks”, IEEE on Pattern Analysis and Machine Intelligence, Vol. 23, No. 1, Jan 2001. [9] Elzbieta Pekalska, “Classifiers for dissimilarity-based pattern recognition”, International Conference on Pattern Recognition (ICPR’00), 2000 IEEE [10] Terrillon, McReynolds, Sadek, “Invariant NeuralNetwork based Face Detection with Orthogonal FourierMellin Moments”, Proceedings of the International

[17]

Conference on Pattern Recognition (ICPR’00), 2000 IEEE Blanz and Vetter, “Face Recognition Based on Fitting a 3D Morphable Model”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 25, No. 9, September 2003 Haiyuan WU, Ryutaro KIDO and Shioyama, “Improvement of Continuous Dynamic Programming for Human Gesture Recognition”, Proceedings of the International Conference on Pattern Recognition (ICPR’00), 2000 IEEE O. Sukmarg, “Combined Shape Descriptors of Face Region and Contour of Human Object in MPEG Compressed Domain” Website: www.icgst.com Website: http://www.statsoft.com/textbook/glosn.html Website: http://www.cs.ut.ee/~toomas_1/linalg/lin1/node16.html Website: http://rkb.home.cern.ch/rkb/AN16pp/node215.html