A Better Activation Function for Artificial Neural Networks

Recommend Documents

FPGA Realization of Activation Function for Artificial Neural Networks

Abstract. Implementation of Artificial Neural network. (ANN) in hardware is needed to fully utilize the inherent parallelism. Presented work focuses on the.

Function Approximation Using Artificial Neural Networks

Abstract: - Function approximation, which finds the underlying relationship from a ... Key-Words: - function approximation, artificial neural network, radial basis ...

Artificial Neural Networks for Beginners

We first make a brief introduction to models of networks, for then describing in general terms ANNs. As an application, we explain the backpropagation algorithm, ...

Artificial Neural Networks for Beginners

Artificial Neural Networks for Beginners. Carlos Gershenson. C.Gershenson@ sussex.ac.uk. 1. Introduction. The scope of this teaching package is to make a brief ...

Artificial Neural Networks for Beginners

Networks are used to model a wide range of phenomena in physics, ... This exercise is to become familiar with artificial neural network concepts. .... (such as Matlab or Mathematica), find weights which will suit the following network after ... A

Artificial Neural Networks

An Artificial Neural Network (ANN from now on) is a mathematical structure which ... Matlab (R2008a -v.7.6- was used for the tutorial) provides the Neural ...

Artificial Neural Networks

used as a textbook for undergraduate and graduate courses, which address the subject of ... part of the book reflect the potential applicability of neural networks to the solution of problems ..... Hardware Implementation Aspects ....................

Artificial Neural Networks

tions in data mining of neural networks have been multilayer feedforward ... has as many nodes as the number of classes and the output layer node with. 3 ...

Artificial Neural Networks

In this note we provide an overview of the key concepts that have led to the emergence of Artificial Neural Networks as a major paradigm for Data.

Artificial Neural Networks

Design. â Input encoding. â Output encoding. â Network graph structure. â Other learning ... WWW Resources. â

ARTIFICIAL NEURAL NETWORKS - IASRI

Schalkoff (1997), Yegnanarayana (1999), Anderson (2003), etc. Software on neural networks has also been made and are as follows: Commercial Software:- ...

Artificial Neural Networks

Learning. â« Very simple principles. â« Very complex behaviours. 3. CIARE-2012, IIT Mandi ... ANNs â The basics ... basis networks, BP networks, learning vector.

Artificial Neural Networks

Oct 23, 2014 - The neural computer adapts itself during a training period, based on examples of ... Automotive ..... input patterns into a finite number of classes.

Artificial Neural Networks

What are Neural Networks? â« Models of the brain and nervous system. â« Highly parallel. â« Process information much more like the brain than a serial computer.

Artificial Neural Networks

Ivan Nunes da Silva â¢ Danilo Hernane Spatti. Rogerio Andrade Flauzino .... on this important and noble work. In particular, we would like to express our thanks to ...

artificial neural networks

pre-formulation parameters for predicting physicochemical properties of drug substances. It also finds its ... neural networks in detail and its applications in the pharmaceutical field. Review and ... work is the sigmoid work. Dynamic ..... For exam

Artificial Neural Networks for Gas Turbine Monitoring

This data enables the development of data-driven methods for tasks such as condition monitoring .... Gas Centre (SGC) and the Thermal Engineering Research Institute ... projects has been to study, develop and evaluate various neural network .... T

COMMERCIAL HARDWARE FOR ARTIFICIAL NEURAL NETWORKS ...

implementing the Artificial Neural Networks within a computer. Nevertheless this solution might be unsuitable because of its cost or its limited speed. The implementation ..... RC Module and produced by Samsung. The circuit is composed of ...

Artificial neural networks for automotive air ...

Available online 4 June 2012. Keywords: ... automotive air-conditioning (AAC) system. .... system. The ANN model is developed through 2 stages: training.

Determining Optimum Structure for Artificial Neural Networks

Artificial Neural Networks (ANNs) have attracted increasing attention from researchers in many fields, including economics, medicine and computer processing ...Missing:

Artificial Neural Networks for Satellite Image

Classification of Shoreline Extraction for Land and ... However, extracting information from satellite images is challenging as it relies on a strong understanding ...

Artificial Neural Networks For Spatial Perception

1118–1121, 2006. ... Networks - Architectures and Applications, K. Suzuki, Ed., 2013. ... [14] Neuronics AG, “Katana user manual and technical description.”.

Applying Artificial Neural Networks for Face Recognition

Jul 19, 2011 - face detecting ratios but with the minimum number of false positives and ...... the IEEE Computer Society Conference on Computer Vision and.

BrainNetCNN: Artificial Convolutional Neural Networks for ... - SFU

locality in brain networks. â Predict clinical outcomes of pre-term babies http://brainnetcnn.cs.sfu.ca/. Predict the infant's future developmental score from the.

A Better Activation Function for Artificial Neural Networks

Download PDF

9 downloads 103 Views 130KB Size Report

Comment

Institute for Systems Research. University of Maryland. College Park, MD 20742. ISR Technical Report TR 93-8. January 29, 1993. Research supported in part ...

TECHNICAL RESEARCH REPORT A Better Activation Function for Artificial Neural Networks

by D.L. Elliott

T.R. 93-8

ISR INSTITUTE FOR SYSTEMS RESEARCH

Sponsored by the National Science Foundation Engineering Research Center Program, the University of Maryland, Harvard University, and Industry

A Better Activation Function for Arti cial Neural Networks David L. Elliott Institute for Systems Research University of Maryland College Park, MD 20742 ISR Technical Report TR 93-8

January 29, 1993

Research

supported in part by Washington University and by NeuroDyne, Inc.

1

Abstract

An activation function, possibly new, is proposed for use in digital simulation of arti cial neural networks, on the ground that the computational operation count for this function is much smaller than for those employing exponentials and it satis es a simple dierential equation generalizing the logistic equation.

Introduction: activation functions

In the digital simulation of neural networks a feedforward P memoryless neuron is represented by the input-output relation y = ( n1 wk xk ) where (), the sigmoidal activation or \squash" function, should have the property that it is positive monotone between the values ,1 and 1 (or between 0 and 1) for u 2 (,1; 1). For use in nding optimal weights wk to minimize jjy , ydesired jj2 by backpropagation (gradient) algorithms, it is also a requirement that () be dierentiable and that it satisfy a simple dierential equation, thus permitting the evaluation of increments of weights via the chain rule for partial derivatives. In the seminal paper [1] on functional approximation with neural networks, the above properties are the only ones invoked. For these reasons, () has almost always been one of the two closely related functions 1 1(u) = (1) 1 + exp(,u) ; 2(u) = tanh(u): (2) although other optimization schemes (not using gradients) may permit the use of the signum function or the saturation function 3 (u) = u if juj < 1; = sgn(u) if juj > 1: Noting that the derivatives are given by exp(,u) 1(u) = (1 + exp(,u))2 and 2(u) = sech2(u); substituting from (1) and (2) respectively, the dierential equations for 1 and 2 are 1 = 1(1 , 1); (3) 2 2 = 1 , 2 : (4) The simplicity of these \Logistic Equations" (3),(4) permits easy computation of partial derivatives in the optimization algorithm. 0

0

0 0

2

The new squash function Our proposed activation function is e (u) =

u

1 + juj

(5)

and we have Proposition 1 The real function e is dierentiable everywhere with derivative e0 (u) =

1 (1 + juj)2

and satis es a generalized logistic dierential equation e0 = (1 , jej)2:

(6)

Note that e (u) is dierentiable for u 6= 0 and that its left and right derivatives at u = 0 are both equal to 1. Since Proof:

jxj = 1 ,jjej j ; e

the dierential equation (6) is satis ed.2 Equation (6) is no more complex than Eq. (3) or (4) and permits simple backpropagation code. The operation count for e is an order of magnitude less than for the exponential functions used in 1 or 2 . In production backpropagation code at NeuroDyne, Inc. (as well as the author's experiments with his own code and with Scott Fahlman's 1989 Quickprop code) the CPU time saved in net training is suciently substantial to warrant calling attention to e. It also nicely satis es the properties needed by Barron [2] in proving that sigmoids approximate at a rate independent of the dimension of the input vector. Any previous use of e and Eq. (6) in the backpropagation literature is unknown to the author.

References

[1] K. Hornik, H. White, M. Stinchcombe, \Multilayer feedforward networks are universal approximators," Neural Networks 2 359-366, 1989. [2] Andrew R. Barron, \Neural net approximation," Proc. 7th Yale Workshop on Adaptive & Learning Systems, New Haven, 69-72, 1992.

3