Deep Neural Networks Based Recognition of Plant

0 downloads 0 Views 4MB Size Report
Jun 23, 2016 - cation was developed in C++ using the OpenCV library [36], with possibility of .... in Graphics Processing Unit (GPU) mode. Every training.
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/304308800

Deep Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification Article in Computational Intelligence and Neuroscience · June 2016 DOI: 10.1155/2016/3289801

CITATIONS

READS

30

623

5 authors, including: Srdjan Sladojevic

Andras Anderla

University of Novi Sad

University of Novi Sad

20 PUBLICATIONS 36 CITATIONS

14 PUBLICATIONS 42 CITATIONS

SEE PROFILE

SEE PROFILE

Darko Stefanović University of Novi Sad 16 PUBLICATIONS 50 CITATIONS SEE PROFILE

All content following this page was uploaded by Andras Anderla on 23 June 2016. The user has requested enhancement of the downloaded file.

Hindawi Publishing Corporation Computational Intelligence and Neuroscience Volume 2016, Article ID 3289801, 11 pages http://dx.doi.org/10.1155/2016/3289801

Research Article Deep Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification Srdjan Sladojevic,1 Marko Arsenovic,1 Andras Anderla,1 Dubravko Culibrk,2 and Darko Stefanovic1 1

Department of Industrial Engineering and Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia 2 Department of Information Engineering and Computer Science, University of Trento, Via Sommarive 9, Povo, 38123 Trento, Italy Correspondence should be addressed to Andras Anderla; [email protected] Received 9 February 2016; Revised 12 May 2016; Accepted 29 May 2016 Academic Editor: Marc Van Hulle Copyright © 2016 Srdjan Sladojevic et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The latest generation of convolutional neural networks (CNNs) has achieved impressive results in the field of image classification. This paper is concerned with a new approach to the development of plant disease recognition model, based on leaf image classification, by the use of deep convolutional networks. Novel way of training and the methodology used facilitate a quick and easy system implementation in practice. The developed model is able to recognize 13 different types of plant diseases out of healthy leaves, with the ability to distinguish plant leaves from their surroundings. According to our knowledge, this method for plant disease recognition has been proposed for the first time. All essential steps required for implementing this disease recognition model are fully described throughout the paper, starting from gathering images in order to create a database, assessed by agricultural experts. Caffe, a deep learning framework developed by Berkley Vision and Learning Centre, was used to perform the deep CNN training. The experimental results on the developed model achieved precision between 91% and 98%, for separate class tests, on average 96.3%.

1. Introduction The problem of efficient plant disease protection is closely related to the problems of sustainable agriculture and climate change [1]. Research results indicate that climate change can alter stages and rates of pathogen development; it can also modify host resistance, which leads to physiological changes of host-pathogen interactions [2, 3]. The situation is further complicated by the fact that, today, diseases are transferred globally more easily than ever before. New diseases can occur in places where they were previously unidentified and, inherently, where there is no local expertise to combat them [4–6]. Inexperienced pesticide usage can cause the development of long-term resistance of the pathogens, severely reducing the ability to fight back. Timely and accurate diagnosis of plant diseases is one of the pillars of precision agriculture [7]. It is crucial to prevent unnecessary waste of financial and

other resources, thus achieving healthier production, by addressing the long-term pathogen resistance development problem and mitigating the negative effects of climate change. In this changing environment, appropriate and timely disease identification including early prevention has never been more important. There are several ways to detect plant pathologies. Some diseases do not have any visible symptoms, or the effect becomes noticeable too late to act, and in those situations, a sophisticated analysis is obligatory. However, most diseases generate some kind of manifestation in the visible spectrum, so the naked eye examination of a trained professional is the prime technique adopted in practice for plant disease detection. In order to achieve accurate plant disease diagnostics a plant pathologist should possess good observation skills so that one can identify characteristic symptoms [8]. Variations in symptoms indicated by diseased plants may lead to an improper diagnosis since amateur gardeners and hobbyists could have more difficulties determining it

2

Computational Intelligence and Neuroscience

than a professional plant pathologist. An automated system designed to help identify plant diseases by the plant’s appearance and visual symptoms could be of great help to amateurs in the gardening process and also trained professionals as a verification system in disease diagnostics. Advances in computer vision present an opportunity to expand and enhance the practice of precise plant protection and extend the market of computer vision applications in the field of precision agriculture. Exploiting common digital image processing techniques such as colour analysis and thresholding [9] were used with the aim of detection and classification of plant diseases. Various different approaches are currently used for detecting plant diseases and most common are artificial neural networks (ANNs) [10] and Support Vector Machines (SVMs) [11]. They are combined with different methods of image preprocessing in favour of better feature extraction. In machine learning and cognitive science, ANN is an information-processing paradigm that was inspired by the way biological nervous systems, such as the brain, process information. The brain is composed of a large number of highly interconnected neurons working together to solve specific problems. An artificial neuron is a processing element with many inputs and one output. Although artificial neurons can have many outputs, only those with exactly one output will be considered. Their inputs can also take on any value between 0 and 1. Also, the neuron has weights for each input and an overall bias. The weights are real numbers expressing importance of the respective inputs to the output. The bias is used for controlling how easy the neuron is getting to output 1. For a neuron with really big bias it is easy to output 1, but when the bias is very negative then it is difficult to output 1. The output of the neuron is not 0 or 1. Instead, it is 𝛼 ⋅ (𝑤 ⋅ 𝑥 + 𝑏), where 𝛼 is called the transfer function. There are different types of transfer function: step, linear, sigmoid, and so forth. The smoothness of 𝛼 means that small changes Δ𝑤𝑗 in the weights and Δ𝑏 in the bias will produce small change Δoutput in the output from the neuron. Small output change is approximated by Δoutput ≈ ∑

𝜕output 𝜕output Δ𝑤𝑗 + ∑ Δ𝑏. 𝜕𝑤𝑗 𝜕𝑏

(1)

Basically, the small change in weight or bias causes the small corresponding change in the network output (Figure 1). Neural networks, with their outstanding ability to derive meaning from complex or imperfect data, can be applied for extracting patterns and detecting trends that are too difficult to notice by humans or computer techniques. Other advantages of ANNs are adaptive learning, self-organization, real time operations, and so forth. There are two main categories of ANNs when speaking about architecture: feed-forward ANNs where the output of any layer is unlikely to influence that same layer and feedback ANNs where signals travel in both directions by involving loops in the network.

w + Δw

Output + Δoutput Output

Input

Hidden

Figure 1: Simple model of ANN.

The method described in this paper is a new approach in detecting plant diseases using the deep convolutional neural network trained and fine-tuned to fit accurately to the database of a plant’s leaves that was gathered independently for diverse plant diseases. The advance and novelty of the developed model lie in its simplicity; healthy leaves and background images are in line with other classes, enabling the model to distinguish between diseased leaves and healthy ones or from the environment by using deep CNN. The rest of the paper is organized as follows: Section 2 presents related work, Section 3 presents methodology, Section 4 presents achieved results and related discussion, and finally, Section 5 holds our conclusions.

2. Related Work Implementing the appropriate management strategies like fungicide applications, disease-specific chemical applications, and vector control through pesticide applications could lead to early information on crop health and disease detection. This could facilitate the control of diseases and improve productivity. In [12], authors present, review, and recognize the demand for developing a rapid, cost-effective, and reliable health-monitoring sensor that facilitates advancements in agriculture. They described the currently used technologies that include spectroscopic and imaging-based and volatile profiling-based plant disease detection methods for the purpose of developing ground-based sensor system to assist in monitoring health and diseases in plants under field conditions. After analysis of their work and analysis presented by the authors of [13–16], it was decided to use image processing disease recognition approach among other approaches commonly used for plant disease diagnostics, for instance, double-stranded ribonucleic acid (RNA) analysis, nucleic acid probes, and microscopy. Numerous procedures are currently in use for plant disease detection applying computer vision. One of them is disease detection by extracting colour feature as authors in [17] have presented. In this paper YcbCr, HSI, and CIELB colour models were used in the study; as a result, disease spots

Computational Intelligence and Neuroscience were successfully detected and remained unaffected by the noise from different sources, such as camera flash. In addition, plant disease detection could be achieved by extracting shape features method. Patil and Bodhe applied this technique for disease detection in sugarcane leaves where they have used threshold segmentation to determine leaf area and triangle threshold for lesioning area, getting the average accuracy of 98.60% at the final experiments [18]. Furthermore, extracting texture feature could be used in detecting plant diseases. Patil and Kumar proposed a model for plant disease detection using texture features such as inertia, homogeneity, and correlation obtained by calculating the gray level cooccurrence matrix on image [19]. Combined with colour extraction, they experimented on detecting diseases on maize leaves. Combination of all these features provides a robust feature set for image improvement and better classification. In [20], the authors have presented a survey of well-known conventional methods of feature extraction. Due to the rapid progress of Artificial Intelligence (AI) science, work in this paper is mainly focused on applying these methodologies and techniques. There are some approaches which apply the feed-forward back propagation of neural networks consisting of one input, one output, and one hidden layer for the needs of identifying the species of leaf, pest, or disease; this model was proposed by the authors in [21]. They developed a software model, to suggest remedial measures for pest or disease management in agricultural crops. Another technique proposed by the authors in [22] incorporates the features extracted by Particle Swarm Optimization (PSO) [23] and forward neural network in direction of determining the injured leaf spot of cotton and improving the accuracy of the system with the final overall accuracy of 95%. Also, detection and differentiation of plant diseases can be achieved using Support Vector Machine algorithms. This technique was implemented for sugar beet diseases and presented in [24], where, depending on the type and stage of disease, the classification accuracy was between 65% and 90%. Likewise, there are methods that combine the feature extraction and Neural Network Ensemble (NNE) for plant disease recognition. Through training a definite number of neural networks and combining their results after that, NNE offers a better generalization of learning ability [25]. Such method was implemented only for recognizing tea leaf diseases with final testing accuracy of 91% [26]. Another approach based on leaf images and using ANNs as a technique for an automatic detection and classification of plant diseases was used in conjunction with 𝐾-means as a clustering procedure proposed by the authors in [27]. ANN consisted of 10 hidden layers. The number of outputs was 6 which was the number of classes representing five diseases along with the case of a healthy leaf. On average, the accuracy of classification using this approach was 94.67%. The authors in [28–31] presented the deep learning methods for solving most complex tasks in different areas of

3 research in biology, bioinformatics, biomedicine, robotics, and 3D technologies. In our study, we exploit the deep learning method for plant disease recognition, driven by evolvement of deep learning techniques and their application in practice. Extensive search of the state-of-the-art literature yielded no evidence that researchers explored deep learning approach for plant diseases recognition from the leaf images. Our method of recognition by applying deep CNN is presented in the sections below.

3. Materials and Methods The entire procedure of developing the model for plant disease recognition using deep CNN is described further in detail. The complete process is divided into several necessary stages in subsections below, starting with gathering images for classification process using deep neural networks. 3.1. Dataset. Appropriate datasets are required at all stages of object recognition research, starting from training phase to evaluating the performance of recognition algorithms. All the images collected for the dataset were downloaded from the Internet, searched by disease and plant name on various sources in different languages, such as Latin, English, German, Serbian, and Hungarian. Images in the dataset were grouped into fifteen different classes. Thirteen classes represented plant diseases which could be visually determined from leaves. In order to distinguish healthy leaves from diseased ones, one more class was added in the dataset. It contains only images of healthy leaves. An extra class in the dataset with background images was beneficial to get more accurate classification. Thus, deep neural network could be trained to differentiate the leaves from the surrounding. The background images were taken from the Stanford background dataset [32]. In this stage, all duplicated images taken from different sources were removed by developed python script applying the comparing procedure. The script removed the duplicates by comparing the images’ metadata: name, size, and the date. After the automated removal, images were assessed by human experts in much iteration. Next step was to enrich the dataset with augmented images. The main goal of the presented study is to train the network to learn the features that distinguish one class from the others. Therefore, when using more augmented images, the chance for the network to learn the appropriate features has been increased. Finally, a database containing 30880 images for training and 2589 images for validation has been created. The augmentation process is described in Section 3.3. Table 1 shows all supported diseases together with the number of original images and number of augmented images for every class used as training and validation dataset for the disease classification model. 3.2. Image Preprocessing and Labelling. Images downloaded from the Internet were in various formats along with different

4

Computational Intelligence and Neuroscience Table 1: Dataset for image classification of leaf disease.

Class (1) Healthy leaf (2) Pear, cherry, and peach, porosity (3) Peach, powdery mildew (4) Peach, Taphrina deformans (5) Apple, pear, Erwinia amylovora (6) Apple, pear, Venturia (7) Apple, powdery mildew (8) Apple, Rust (9) Pair, Gymnosporangium sabinae (10) Pair, gray leaf spot (11) Grapevine, wilt (12) Grapevine, mites (13) Grapevine, powdery mildew (14) Grapevine, downy mildew (15) Background images

Number of original images

Total number of images: original and augmented

Number of images from the dataset used for validation

565 265 108 152 232 183 120 163 267 122 287 250 237 297 1235

4523 2124 1296 1552 2368 2200 1440 1960 2142 1464 2300 2000 1900 2376 1235

331 152 90 156 205 151 118 163 185 198 114 230 183 201 112

4483

30880

2589

resolutions and quality. In order to get better feature extraction, final images intended to be used as dataset for deep neural network classifier were preprocessed in order to gain consistency. Furthermore, procedure of image preprocessing involved cropping of all the images manually, making the square around the leaves, in order to highlight the region of interest (plant leaves). During the phase of collecting the images for the dataset, images with smaller resolution and dimension less than 500 px were not considered as valid images for the dataset. In addition, only the images where the region of interest was in higher resolution were marked as eligible candidates for the dataset. In that way, it was ensured that images contain all the needed information for feature learning. Images used for the dataset were image resized to 256 × 256 to reduce the time of training, which was automatically computed by written script in Python, using the OpenCV framework [33]. Many resources can be found by searching across the Internet, but their relevance is often unreliable. In the interest of confirming the accuracy of classes in the dataset, initially grouped by a keywords search, agricultural experts examined leaf images and labelled all the images with appropriate disease acronym. As it is known, it is important to use accurately classified images for the training and validation dataset. Only in that way may an appropriate and reliable detecting model be developed. In this stage, duplicated images that were left after the initial iteration of gathering and grouping images into classes described in Section 3.1 were removed from the dataset. 3.3. Augmentation Process. The main purpose of applying augmentation is to increase the dataset and introduce slight distortion to the images which helps in reducing overfitting during the training stage. In machine learning, as well as

in statistics, overfitting appears when a statistical model describes random noise or error rather than underlying relationship [34]. The image augmentation contained one of several transformation techniques including affine transformation, perspective transformation, and simple image rotations. Affine transformations were applied to express translations and rotations (linear transformations and vector addition, resp.) [35] where all parallel lines in the original image are still parallel in the output image. To find a transformation matrix, three points from the original image were needed as well as their corresponding locations in the output image. For perspective transformation, a 3×3 transformation matrix was required. Straight lines would remain straight even after the transformation. For the augmentation process, simple image rotations were applied, as well as rotations on the different axis by various degrees. Transformations applied in augmentation process are illustrated in Figure 2, where the first row represents resulting images obtained by applying affine transformation on the single image; the second row represents images obtained from perspective transformation against the input image and the last row visualizes the simple rotation of the input image. The process of augmentation was chosen to fit the needs; the leaves in a natural environment could vary in visual perspective. For this stage, in order to automate the augmentation process for numerous images from the dataset, particular application was developed in C++ using the OpenCV library [36], with possibility of changing the parameters of transformation during the run-time, which improves flexibility. 3.4. Neural Network Training. Training the deep convolutional neural network for making an image classification model from a dataset described in Section 3.1 was proposed.

Computational Intelligence and Neuroscience

5

Affine transformation

(a)

Perspective transformation

(b)

Rotation

(c)

Figure 2: Image transformations used for augmentation: (a) affine transformations; (b) perspective transformations; (c) rotations.

There are several well-known state-of-the-art deep learning frameworks, such as Python library Theano [37] and machine learning library that extends Lua, Torch7 [38]. In addition, there is Caffe, an open source deep learning framework developed by the BVLC [39] containing reference pretrained CaffeNet model. For the purpose of this research, this framework was used, along with the set of weights learned on a very large dataset, ImageNet [40]. Caffe framework is suitable for both research experiments and industry deployment. The core of framework is developed in C++ and provides command line, Python, and MATLAB interfaces. Caffe’s integration with cuDNN library

accelerates Caffe models [41, 42]. CaffeNet is a deep CNN which has multiple layers that progressively compute features from input images [43]. Specifically, the network contains eight learning layers and five convolutional and three fully connected layers [44]. CaffeNet architecture is considered a starting point, but modified and adjusted to support our 15 categories (classes). Last layer was altered and the output of the softmax layer was parameterized to the requirements of presented study. The convolutional layer is the essential building block of the convolutional neural network. The layer’s parameters are comprised of a set of learnable kernels which possess a small

6

Computational Intelligence and Neuroscience

receptive field but extend through the full depth of the input volume [45]. Each convolutional layer has 𝑀 maps of equal size, 𝑀𝑥 and 𝑀𝑦 , and a kernel of size 𝐾𝑥 , and 𝐾𝑦 is shifted over the certain region of the input image. The skipping factors 𝑆𝑥 and 𝑆𝑦 define how many pixels the filter/kernel skips in 𝑥- and 𝑦direction between subsequent convolutions [46]. The size of the output map could be defined as 𝑀𝑥𝑛 = 𝑀𝑦𝑛

=

𝑀𝑥𝑛−1 − 𝐾𝑥𝑛 + 1, 𝑆𝑥𝑛 + 1 𝑀𝑦𝑛−1 − 𝐾𝑦𝑛 𝑆𝑦𝑛 + 1

(2)

architecture [50]. Bayesian optimization experiments also proved that ReLUs and dropout have synergy effects, which means that it is advantageous when they are used together [51]. The advance of CNNs refer to their ability to learn rich mid-level image representations as opposed to handdesigned low-level features used in other image classification methods [52]. Figure 4 illustrates the filtered output images after every convolutional and pooling layer of the deep network. Output images are labelled with the name of corresponding layer at the bottom right corner of every image.

+ 1, 𝑛

where 𝑛 indicates the layer. Each map in layer 𝐿 is connected to most 𝑀𝑛−1 maps in layer 𝐿𝑛−1 . Rectified Linear Units (ReLU) are used as substitute for saturating nonlinearities. This activation function adaptively learns the parameters of rectifiers and improves accuracy at negligible extra computational cost [47]. It is defined as 𝑓 (𝑧𝑖 ) = max (0, 𝑧𝑖 ) ,

(3)

where 𝑧𝑖 represents the input of the nonlinear activation function 𝑓 on the 𝑖th channel. Deep CNN with ReLUs trains several times faster. This method is applied to the output of every convolutional and fully connected layer. Despite the output, the input normalization is not required; it is applied after ReLU nonlinearity after the first and second convolutional layer because it reduces top-1 and top-5 error rates. In CNN, neurons within a hidden layer are segmented into “feature maps.” The neurons within a feature map share the same weight and bias. The neurons within the feature map search for the same feature. These neurons are unique since they are connected to different neurons in the lower layer. So for the first hidden layer, neurons within a feature map will be connected to different regions of the input image. The hidden layer is segmented into feature maps where each neuron in a feature map looks for the same feature but at different positions of the input image. Basically, the feature map is the result of applying convolution across an image. Each layer’s features are displayed in a different block, where visualization represents the strongest activation for the provided feature map, starting from the first convolutional layer, where features go from individual pixels to simple lines, to the fifth convolutional layer where learned features like shapes and certain parts of leaves are displayed (Figure 3). Another important layer of CNNs is the pooling layer, which is a form of nonlinear downsampling. Pooling operation gives the form of translation invariance [48]; it operates independently on every depth slice of the input and resizes it spatially. Overlapping pooling is beneficially applied to lessen overfitting. Also in favour of reducing overfitting, a dropout layer [49] is used in the first two fully connected layers. But the shortcoming of dropout is that it increases training time 2-3 times comparing to a standard neural network of the exact

3.5. Performed Tests. The common approach in measuring performance of artificial neural networks is splitting data into the training set and the test set and then training a neural network on the training set and using the test set for prediction. Thus, since the original outcomes for the testing set and our model predicted outcomes are known, the accuracy of our prediction can be calculated. Different tests were performed with 2589 original images, when trained with 30880 images from database. For the accuracy test, 10-fold cross validation technique was used to evaluate a predictive model. The cross validation procedure was repeated after every thousand training iteration. Overall estimated result of the test is graphically represented as top-1, to test if the top class (the one having the highest probability) is the same as the target label. The top-5 error rate is there to test if the target label is one of the top 5 predictions, the ones with 5 of the highest probabilities. The number of images used for the validation test from each labelled class is given in Table 1. Test results are presented in Section 4, for both complete dataset and each class separately. 3.6. Fine-Tuning. Fine-tuning seeks to increase the effectiveness or efficiency of a process or function by making small modifications to improve or optimize the outcome. The classification function in the original CaffeNet model is softmax classifier that computes probability of 1,000 classes of the ImageNet dataset. Fine-tuned learning experiments require a bit of learning, but they are still much faster than learning from scratch [53]. To start the fine-tuning procedure, this softmax classifier was removed, as mentioned and illustrated in Section 3.4 and the new one was initialized with random values. The new softmax classifier was trained from scratch using the back-propagation algorithm with data from the dataset described in Section 3.1. This dataset has 15 different categories [43]. Due to the smaller size of the dataset used for this research when compared to ImageNet, ILSVRC-2012 [54], overfitting was constrained by using lower initial learning rates for the fine-tuned hidden layers [55]. The learning rate of the top layer was set to 10, while the learning rate of all the other seven learning layers was 0.1. The back-propagation algorithm ran for 100,000 iterations. The process of finetuning was repeated changing parameters of hidden layers and hyperparameters. The best suited model for plant disease detection was achieved through the process of experimental

Computational Intelligence and Neuroscience

7

0

0 50

0

0

20

50

50

40

100

100

60

150

150

80

0

50

100

150

20

40

60

80 100

0

50 100 150 200 250 300

(b)

0 50 100 150 50

250

300

(a)

0

200

250 0

200

150

200

100

200

100

100 (e)

150

0

0

50

50

100

100

150

150

200

200

250

250 0

0

50 100 150 200 250

(c)

50 100 150 200 250

(d)

0 50 100 150 200 0

0

50 100 150 200 250

(f)

(g)

50

100

150

200

(h)

Figure 3: Visualization of features in trained classification model: (a) original image; (b) the first layer filters, Conv1; (c) the first layer output, Conv1 rectified responses of the filters, first 36 only; (d) the second layer filters, Conv2; (e) the second layer output, Conv2 (rectified, only the first 36 of 256 channels); (f) the third layer output, Conv3 (rectified, all 384 channels); (g) the fourth layer output, Conv4 (rectified, all 384 channels); (h) the fifth layer output, Conv5 (rectified, all 256 channels).

Conv1

Pool1

Norm1

Conv2

Pool2

Norm1

Conv3

Conv4

Conv5

Pool5

Figure 4: Output layers images.

adjustment of the parameters. The results of the model finetuning are presented and explained further in Section 4. 3.7. Equipment. A single PC was used for the entire process of training and testing the plant disease detection model described in this paper. Training of the CNN was performed in Graphics Processing Unit (GPU) mode. Every training iteration took approximately eight hours on this specified machine whose basic characteristics are presented in Table 2.

4. Results and Discussion The results presented in this section are related to training with the whole database containing both original and augmented images. As it is known that convolutional networks are able to learn features when trained on larger datasets, results achieved when trained with only original images will not be explored. After fine-tuning the parameters of the network, an overall accuracy of 96.3% was achieved, after the 100th training

8

Computational Intelligence and Neuroscience Table 2: Basic machine characteristics.

Accuracy and loss through the iterations

1.0

3.0

0.8

Loss (%)

2.5 2.0

0.6

1.5

0.4

1.0

0.2

0.5 0.0

0

20000

40000 60000 Iterations

80000

0.8 0.6 0.4 0.2 0.0

0.0 100000

Training log loss Validation accuracy

Figure 5: Accuracy of the fine-tuned CNN.

iteration (95.8% without fine-tuning). Even after the 30th training iteration high accuracy results were achieved with exceedingly reduced loss, but after the 60th iteration, the balance in accuracy and loss was carried out in high accuracy. The green line in the graph in Figure 5 shows the network’s success on the validation test set, through training iterations. After every 10 thousand training iterations, the snapshot of the model was obtained. The blue line in the graph represents the loss during the training stage. Through training iterations, loss was rapidly reduced. Top-1 success was 96.3% and top-5 success was 99.99% after 100,000 iterations which are shown in Figures 6 and 7, respectively. Furthermore, the trained model was tested on each class individually. Test was performed on every image from the validation set. The results are displayed to emphasize how many images from total of each class are accurately predicted. Figure 8 illustrates trained model’s prediction results separated for every class. The class numbers follow enumeration from Table 1. From the results displayed in Figure 8, it is notable that the trained model’s accuracy was slightly less for classes with lower number of images in the training dataset, more specifically classes peach, powdery mildew, apple, powdery mildew, and grapevine, wilt. Achieved accuracy was in range from 91.11% for peach, powdery mildew, up to 98.21% for background images. High accuracy of model’s prediction of background images allows good separation of plants leaves and the surroundings. As suggested by good practice principles, achieved results should be compared with some other results. Taking into account the fact that during this research our own image

0

20000

40000 60000 Iterations

80000

100000

Figure 6: Top-1 accuracy success. Top-5 through the iterations 1.0 Top-5 (%) of success

3.5

Top-1 (%) of success

Characteristics 16 Gb Intel Core i7-4790 CPU @ 3.60 GHz ×8 GeForce GTX TITAN X 12 Gb Linux Ubuntu 14.04 64 bits

Validation accuracy (%)

Hardware and software (1) Memory (2) Processor (3) Graphics (4) Operating system

Top-1 through the iterations

1.0

0.9 0.8 0.7 0.6 0.5 0

20000

40000 60000 Iterations

80000

100000

Figure 7: Top-5 accuracy success.

database was developed, no one has used it up to now. In addition, since no one has used deep learning to identify plant diseases in scientific literature, it is impossible to compare it with other examples. Nonetheless, as a result of extensive review, deep learning techniques have showed better results in pattern recognition, in the image segmentation and object detection. This is also proven in practice by numerous competitions won by convolutional neural networks [56]. Presently, there is a commercial solution, Leafsnap [57], which uses visual recognition in order to identify tree species from their leaves’ images but as the network presented in this paper is classifying the plant diseases instead of types of plant, Leafsnap was not used for comparison of the achieved results. Finally, comparing our results with other methods of detecting diseases from leaves images, it can be said that our method provides better results [23, 24, 26, 27].

5. Conclusions There are many methods in automated or computer vision plant disease detection and classification process, but still, this research field is lacking. In addition, there are still no commercial solutions on the market, except those dealing with plant species recognition based on the leaves images. In this paper, a new approach of using deep learning method was explored in order to automatically classify and detect plant diseases from leaf images. The developed model was able to detect leaf presence and distinguish between healthy leaves and 13 different diseases, which can be visually

Computational Intelligence and Neuroscience

Number of images

350 300

9 the authors hope to achieve a valuable impact on sustainable development, affecting crop quality for future generations.

Testing accuracy 96.37%

250

96.09%

96.59%

200

96.05% 94.87%

150

97.3% 95.09%

94.7%

91.53%

100

91.92%

96.49%

91.11%

96.17%

97.01%

98.21%

50 0

1

2

3

4

5

6

7

8

9 10 11 12 13 14 15

Classes All images Accurately predicted

Figure 8: Prediction accuracy for each class separately.

diagnosed. The complete procedure was described, respectively, from collecting the images used for training and validation to image preprocessing and augmentation and finally the procedure of training the deep CNN and fine-tuning. Different tests were performed in order to check the performance of newly created model. New plant disease image database was created, containing more than 3,000 original images taken from the available Internet sources and extended to more than 30,000 using appropriate transformations. The experimental results achieved precision between 91% and 98%, for separate class tests. The final overall accuracy of the trained model was 96.3%. Fine-tuning has not shown significant changes in the overall accuracy, but augmentation process had greater influence to achieve respectable results. As the presented method has not been exploited, as far as we know, in the field of plant disease recognition, there was no comparison with related results, using the exact technique. In comparison with other techniques used and presented in Section 2, comparable or even better results were achieved, especially when taking into account the wider number of classes in the presented study. An extension of this study will be on gathering images for enriching the database and improving accuracy of the model using different techniques of fine-tuning and augmentation. The main goal for the future work will be developing a complete system consisting of server side components containing a trained model and an application for smart mobile devices with features such as displaying recognized diseases in fruits, vegetables, and other plants, based on leaf images captured by the mobile phone camera. This application will serve as an aid to farmers (regardless of the level of experience), enabling fast and efficient recognition of plant diseases and facilitating the decision-making process when it comes to the use of chemical pesticides. Furthermore, future work will involve spreading the usage of the model by training it for plant disease recognition on wider land areas, combining aerial photos of orchards and vineyards captured by drones and convolution neural networks for object detection. By extending this research,

Competing Interests The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments The research presented in this paper was supported by FP7 IRSES Project QoSTREAM.

References [1] K. A. Garrett, S. P. Dendy, E. E. Frank, M. N. Rouse, and S. E. Travers, “Climate change effects on plant disease: genomes to ecosystems,” Annual Review of Phytopathology, vol. 44, pp. 489– 509, 2006. [2] S. M. Coakley, H. Scherm, and S. Chakraborty, “Climate change and plant disease management,” Annual Review of Phytopathology, vol. 37, no. 1, pp. 399–426, 1999. [3] S. Chakraborty, A. V. Tiedemann, and P. S. Teng, “Climate change: potential impact on plant diseases,” Environmental Pollution, vol. 108, no. 3, pp. 317–326, 2000. [4] A. J. Tatem, D. J. Rogers, and S. I. Hay, “Global transport networks and infectious disease spread,” Advances in Parasitology, vol. 62, pp. 293–343, 2006. [5] J. R. Rohr, T. R. Raffel, J. M. Romansic, H. McCallum, and P. J. Hudson, “Evaluating the links between climate, disease spread, and amphibian declines,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 45, pp. 17436–17441, 2008. [6] T. Van der Zwet, “Present worldwide distribution of fire blight,” in Proceedings of the 9th International Workshop on Fire Blight, vol. 590, Napier, New Zealand, October 2001. [7] S. A. Miller, F. D. Beed, and C. L. Harmon, “Plant disease diagnostic capabilities and networks,” Annual Review of Phytopathology, vol. 47, pp. 15–38, 2009. [8] M. B. Riley, M. R. Williamson, and O. Maloy, “Plant disease diagnosis. The Plant Health Instructor,” 2002. [9] J. G. Arnal Barbedo, “Digital image processing techniques for detecting, quantifying and classifying plant diseases,” SpringerPlus, vol. 2, article 660, pp. 1–12, 2013. [10] H. Cartwright, Ed., Artificial Neural Networks, Humana Press, 2015. [11] I. Steinwart and A. Christmann, Support Vector Machines, Springer Science & Business Media, New York, NY, USA, 2008. [12] S. Sankaran, A. Mishra, R. Ehsani, and C. Davis, “A review of advanced techniques for detecting plant diseases,” Computers and Electronics in Agriculture, vol. 72, no. 1, pp. 1–13, 2010. [13] P. R. Reddy, S. N. Divya, and R. Vijayalakshmi, “Plant disease detection techniquetool—a theoretical approach,” International Journal of Innovative Technology and Research, pp. 91–93, 2015. [14] A.-K. Mahlein, T. Rumpf, P. Welke et al., “Development of spectral indices for detecting and identifying plant diseases,” Remote Sensing of Environment, vol. 128, pp. 21–30, 2013. [15] W. Xiuqing, W. Haiyan, and Y. Shifeng, “Plant disease detection based on near-field acoustic holography,” Transactions of the Chinese Society for Agricultural Machinery, vol. 2, article 43, 2014.

10 [16] A.-K. Mahlein, E.-C. Oerke, U. Steiner, and H.-W. Dehne, “Recent advances in sensing plant diseases for precision crop protection,” European Journal of Plant Pathology, vol. 133, no. 1, pp. 197–209, 2012. [17] P. Chaudhary, A. K. Chaudhari, A. N. Cheeran, and S. Godara, “Color transform based approach for disease spot detection on plant leaf,” International Journal of Computer Science and Telecommunications, vol. 3, no. 6, pp. 65–69, 2012. [18] S. B. Patil and S. K. Bodhe, “Leaf disease severity measurement using image processing,” International Journal of Engineering and Technology, vol. 3, no. 5, pp. 297–301, 2011. [19] J. K. Patil and R. Kumar, “Feature extraction of diseased leaf images,” Journal of Signal & Image Processing, vol. 3, no. 1, p. 60, 2012. [20] T. R. Reed and J. M. H. Dubuf, “A review of recent texture segmentation and feature extraction techniques,” CVGIP: Image Understanding, vol. 57, no. 3, pp. 359–372, 1993. [21] M. S. P. Babu and B. Srinivasa Rao, “Leaves recognition using back propagation neural network-advice for pest and disease control on crops,” IndiaKisan. Net: Expert Advisory System, 2007. [22] P. Revathi and M. Hemalatha, “Identification of cotton diseases based on cross information gain deep forward neural network classifier with PSO feature selection,” International Journal of Engineering and Technology, vol. 5, no. 6, pp. 4637–4642, 2014. [23] C. Zhou, H. B. Gao, L. Gao, and W. G. Zhang, “Particle swarm optimization (PSO) algorithm,” Application Research of Computers, vol. 12, pp. 7–11, 2003. [24] T. Rumpf, A.-K. Mahlein, U. Steiner, E.-C. Oerke, H.-W. Dehne, and L. Pl¨umer, “Early detection and classification of plant diseases with Support Vector Machines based on hyperspectral reflectance,” Computers and Electronics in Agriculture, vol. 74, no. 1, pp. 91–99, 2010. [25] Z. H. Zhou and S. F. Chen, “Neural network ensemble,” Chinese Journal of Computers, vol. 25, no. 1, pp. 1–8, 2002. [26] B. C. Karmokar, M. S. Ullah, Md. K. Siddiquee, and K. Md. R. Alam, “Tea leaf diseases recognition using neural network ensemble,” International Journal of Computer Applications, vol. 114, no. 17, pp. 27–30, 2015. [27] H. Al-Hiary, S. Bani-Ahmad, M. Reyalat, M. Braik, and Z. ALRahamneh, “Fast and accurate detection and classification of plant diseases,” Machine Learning, vol. 14, p. 5, 2011. [28] I. Lenz, H. Lee, and A. Saxena, “Deep learning for detecting robotic grasps,” The International Journal of Robotics Research, vol. 34, no. 4-5, pp. 705–724, 2015. [29] B. Alipanahi, A. Delong, M. T. Weirauch, and B. J. Frey, “Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning,” Nature Biotechnology, vol. 33, no. 8, pp. 831–838, 2015. [30] L. Zhang, G.-S. Xia, T. Wu, L. Lin, and X. C. Tai, “Deep learning for remote sensing image understanding,” Journal of Sensors, vol. 2016, Article ID 7954154, 2 pages, 2016. [31] J. Arevalo, F. A. Gonzalez, R. Ramos-Pollan, J. L. Oliveira, and M. A. G. Lopez, “Convolutional neural networks for mammography mass lesion classification,” in Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC ’15), pp. 797–800, August 2015. [32] S. Gould, R. Fulton, and D. Koller, “Decomposing a scene into geometric and semantically consistent regions,” in Proceedings of the 12th International Conference on Computer Vision (ICCV ’09), pp. 1–8, Kyoto, Japan, October 2009.

Computational Intelligence and Neuroscience [33] J. Howse, OpenCV Computer Vision with Python, Packt Publishing, Birmingham, UK, 2013. [34] D. M. Hawkins, “The problem of over-fitting,” Journal of Chemical Information and Computer Sciences, vol. 44, no. 1, pp. 1–12, 2004. [35] C. C. Stearns and K. Kannappan, “Method for 2-D affine transformation of images,” US Patent No. 5,475,803, 1995. [36] S. Brahmbhatt, Practical OpenCV, Apress, 2013. [37] J. Bergstra, F. Bastien, O. Breuleux et al., “Theano: deep learning on gpus with python,” in Proceedings of the NIPS 2011, Big Learning Workshop, Granada, Spain, December 2011. [38] R. Collobert, K. Kavukcuoglu, and C. Farabet, “Torch7: a matlab-like environment for machine learning,” BigLearn, NIPS Workshop EPFL-CONF-192376, 2011. [39] Y. Jia, E. Shelhamer, J. Donahue et al., “Caffe: convolutional architecture for fast feature embedding,” in Proceedings of the ACM Conference on Multimedia (MM ’14), pp. 675–678, ACM, Orlando, Fla, USA, November 2014. [40] D. Jia, W. Dong, R. Socher et al., “ImageNet: a large-scale hierarchical image database,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR ’09), pp. 248– 255, Miami, Fla, USA, June 2009. [41] Tips, CUDA Pro, and C. U. D. A. Spotlights, “Deep Learning for Computer Vision with Caffe and cuDNN,” February 2016, https://devblogs.nvidia.com/parallelforall/deep-learning-computer-vision-caffe-cudnn/. [42] S. Bahrampour, N. Ramakrishnan, L. Schott, and M. Shah, “Comparative study of caffe, neon, theano, and torch for deep 15 learning,” http://arxiv.org/abs/1511.06435v1. [43] A. K. Reyes, J. C. Caicedo, and J. E. Camargo, “Fine-tuning deep convolutional networks for plant recognition,” in Proceedings of the Working Notes of CLEF 2015 Conference, 2015, http://ceur-ws .org/Vol-1391/121-CR.pdf. [44] A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet Classification with Deep Convolutional Neural Networks, Advances in Neural Information Processing Systems, 2012. [45] G. Montavon, M. L. Braun, and K.-R. M¨uller, “Kernel analysis of deep networks,” The Journal of Machine Learning Research, vol. 12, pp. 2563–2581, 2011. [46] D. C. Cires¸an, U. Meier, J. Masci, L. M. Gambardella, and J. Schmidhuber, “Flexible, high performance convolutional neural networks for image classification,” in Proceedings of the 22nd International Joint Conference on Artificial Intelligence, vol. 2, pp. 1237–1242, 2011. [47] C. Ciresan Dan, U. Meier, J. Masci, L. M. Gambardella, and J. Schmidhuber, “Flexible, high performance convolutional neural networks for image classification,” in Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI ’11), vol. 22, no. 1, pp. 1237–1242, 2011. [48] A. Romero, Assisting the training of deep neural networks with applications to computer vision [Ph.D. thesis], Universitat de Barcelona, Barcelona, Spain, 2015. [49] S. Han, J. Pool, J. Tran, and W. J. Dally, “Learning both weights and connections for efficient neural network,” in Proceedings of the Advances in Neural Information Processing Systems (NIPS ’05), NIPS Proceedings, pp. 1135–1143, 2015. [50] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” The Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929–1958, 2014.

Computational Intelligence and Neuroscience [51] G. E. Dahl, T. N. Sainath, and G. E. Hinton, “Improving deep neural networks for LVCSR using rectified linear units and dropout,” in Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’13), pp. 8609–8613, Vancouver, Canada, May 2013. [52] M. Oquab, L. Bottou, I. Laptev, and J. Sivic, “Learning and transferring mid-level image representations using convolutional neural networks,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR ’14), pp. 1717– 1724, IEEE, Columbus, Ohio, USA, June 2014. [53] A. Sethi, “Experiments with Fine-tuning Caffe Models”. [54] O. Russakovsky, J. Deng, H. Su et al., “Imagenet large scale visual recognition challenge,” International Journal of Computer Vision, vol. 115, no. 3, pp. 211–252, 2015. [55] K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, “Return of the devil in the details: delving deep into convolutional nets,” http://arxiv.org/abs/1405.3531. [56] J. Schmidhuber, “Deep Learning in neural networks: an overview,” Neural Networks, vol. 61, pp. 85–117, 2015. [57] “Leafsnap: An Electronic Field Guide,” 2016, http://leafsnap .com//.

11

Journal of

Advances in

Industrial Engineering

Multimedia

Hindawi Publishing Corporation http://www.hindawi.com

The Scientific World Journal Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Applied Computational Intelligence and Soft Computing

International Journal of

Distributed Sensor Networks Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Advances in

Fuzzy Systems Modelling & Simulation in Engineering Hindawi Publishing Corporation http://www.hindawi.com

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Volume 2014

Submit your manuscripts at http://www.hindawi.com

Journal of

Computer Networks and Communications

 Advances in 

Artificial Intelligence Hindawi Publishing Corporation http://www.hindawi.com

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

International Journal of

Biomedical Imaging

Volume 2014

Advances in

Artificial Neural Systems

International Journal of

Computer Engineering

Computer Games Technology

Hindawi Publishing Corporation http://www.hindawi.com

Hindawi Publishing Corporation http://www.hindawi.com

Advances in

Volume 2014

Advances in

Software Engineering Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

International Journal of

Reconfigurable Computing

Robotics Hindawi Publishing Corporation http://www.hindawi.com

View publication stats

Computational Intelligence and Neuroscience

Advances in

Human-Computer Interaction

Journal of

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Journal of

Electrical and Computer Engineering Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

Volume 2014

Suggest Documents