regularization tools for training large feed-forward neural ... - CiteSeerX

Recommend Documents

REGULARIZATION TOOLS FOR TRAINING FEED ... - CiteSeerX

... toolbox. KEY WORDS: Neural network training, Tikhonov regularization, Automatic di .... Matlab's Neural Network Toolbox 3], are much too slow we compare our methods ...... Tutorial 11: Nonlinear black-box modeling in system identi cation.

REGULARIZATION TOOLS FOR TRAINING FEED ... - CiteSeerX

E-mail: [email protected], [email protected],. Per. .... by Abazoglou 1] and Andersson 2]. By regularizing the nonlinear least squares.

Training Feedforward Neural Networks Using Symbiotic Organisms ...

Nov 16, 2016 - ature: feedforward neural networks (FNNs) [13], Kohonen self-organizing network [14], ... new swarm intelligence algorithm simulating the symbiotic interaction strategies ..... had undergone surgery for breast cancer. The dataset .....

Evolving Neural Feedforward Networks - CiteSeerX

Kaufmann, San Mateo, CA, 1990). [5] D.B. Fogel, L.J. Fogel, V.W. Porto, Evolving neural networks, in: Biological Cybernetics 63 (Springer,. Berlin, 1990).

Training Feedforward Neural Networks Using Genetic Algorithms

Training Feedforward Neural Networks Using Genetic Algorithms. David J. Montana and Lawrence Davis. BBN Systems and Technologies Corp. 10 Mouiton St.

Evolving Neural Feedforward Networks - CiteSeerX

a genetic algorithm driven network generator that evolves neural feedforward network architectures ... has been a lot of work concerning the use of genetic.

Constructive Feedforward Neural Networks for ... - CiteSeerX

While standard back-propagation performs gradient descent only in the ..... input domain X. As empirically shown in 32], bias falls and variance increases with the ..... In supervised GCS 29], adaptation of the parameters associated with the RBF unit

Constructive Feedforward Neural Networks for ... - CiteSeerX

weight space of a network with xed topology, constructive procedures start with a .... constructive algorithm, is only a subspace and is determined by the way ...

A New Training Method for Feedforward Neural Networks Based on

Jun 20, 2016 - Adaptive dropout for training deep neural networks. In Advances in ... A family of variable-metric methods derived by variational means.

Streamline Regularization for Large Discontinuous Motion ... - CiteSeerX

applied the Wilcoxon Signed Rank Test, which shows an improvement in estimation accuracy using our pro- posed scheme. 1. Introduction. The polar extremes ...

Three-Dimensional Feedforward Neural Networks and ... - CiteSeerX

Department of Electrical and Computer Engineering, University of Calgary (E-mail: {shmerko;yanush}@enel.ucalgary.ca). Abstract. The three-dimensional (3D) ...

Lamarckian Training of Feedforward Neural Networks ... - DI @ UMinho

Paulo Cortez Miguel Rocha and Jos Ã©e Neves [email protected] [email protected] [email protected]. Departamento de Inform Ã©atica, Campus de ...

Robust Training of Artificial Feedforward Neural ... - Springer Link

answers to the question on which M-estimator to use in these popular neural network applications. â¢ A new, stochastic simulated annealing algorithm for robust ...

An Iterative Pruning Algorithm For Feedforward Neural ... - CiteSeerX

An Iterative Pruning Algorithm for. Feedforward Neural Networks. Giovanna Castellano, Anna Maria Fanelli, Member, IEEE, and Marcello Pelillo, Member, IEEE.

A Constructive Algorithm for Feedforward Neural ...

A Constructive Algorithm for Feedforward Neural. Networks. Jinhua Xu. â ..... (LM) algorithm can be con- sidered a trust-region modification to Gauss-Newton.

Feedforward neural networks for nonparametric regression

ber of hidden neurons de ne exible non-parametric regression models. In ... and extend it to a non-parametric model by allowing unconstrained size of.

TRAINREC: A System for Training Feedforward & Simple ... - CiteSeerX

May 26, 1993 - TRAINREC is a system for training feedforward and recurrent neural ...... RAAMs are designed to use auto-associative training to evolve a ...

Simplified Stochastic Feedforward Neural Networks

Apr 11, 2017 - Abstract. It has been believed that stochastic feedforward neural networks (SFNNs) have several advantages beyond deterministic deep ...

Neural Networks Intelligent Tools For ... - CiteSeerX

Neural Networks Intelligent Tools ... overview of the applications of neural networks in telecommunications ... performance, in order to minimize a cost function. In.

Spectral Regularization Algorithms for Learning Large Incomplete ...

Journal of Machine Learning Research 11 (2010) 2287-2322 ... large matrices; for example SOFT-IMPUTE takes a few hours to compute low-rank .... In this paper we propose an algorithm SOFT-IMPUTE for the nuclear norm regularized least-.

Spectral Regularization Algorithms for Learning Large Incomplete ...

large matrices; for example SOFT-IMPUTE takes a few hours to compute low- rank approximations of a 106 ×106 incomplete matrix with 107 observed entries, ...

Regularization Framework for Large Scale Hierarchical Classification

a regularization term to penalize the complexity of f. Typically the prediction ... This form of regularization models the hierarchical dependencies in the sense that.

Regularization Theory and Neural Networks

The interpretation of an approximation scheme in terms of networks and vice versa has ... already shown for some of the neural networks (Girosi and Poggio, 1990; .... In this case the existence of an exact solution of the linear system .... is theref

Efficient Training of Large Neural Networks for ... - Semantic Scholar

speech decoding, n is usually limited to three or four words. Although these statistical language models (LM) perform quite well in practice, there are several ...

regularization tools for training large feed-forward neural ... - CiteSeerX

Download PDF

3 downloads 0 Views 240KB Size Report

Comment

NETWORKS USING AUTOMATIC. DIFFERENTIATION. JERRY ERIKSSON, M ARTEN GULLIKSSON, PER LINDSTR OM and. PER- AKE WEDIN. Department of ...

Copyright information to be inserted by the Publishers

REGULARIZATION TOOLS FOR TRAINING LARGE FEED-FORWARD NEURAL NETWORKS USING AUTOMATIC DIFFERENTIATION and JERRY ERIKSSON, M ARTEN GULLIKSSON, PER LINDSTROM PER-AKE WEDIN

Department of Computing Science, Umea University S-901 87 Umea, Sweden E-mail: [email protected], [email protected], [email protected], [email protected] We describe regularization tools for training large-scale arti cial feed-forward neural networks. We propose algorithms that explicitly use a sequence of Tikhonov regularized nonlinear least squares problems. For large-scale problems, methods using new special purpose automatic dierentiation are used in a conjugate gradient method for computing a truncated Gauss-Newton search direction. The algorithms developed utilize the structure of the problem in dierent ways and perform much better than a Polak-Ribiere based method. All algorithms are tested using benchmark problems and guidelines by Lutz Prechelt in the Proben1 package. All software is written in Matlab and gathered in a toolbox. KEY WORDS: Neural network training, Tikhonov regularization, Automatic dierentiation, Large-scale problems.

1 INTRODUCTION The training phase of supervised feed-forward neural networks leads to very dicult unconstrained nonlinear least squares problems. The diculties are due to the fact that the Jacobian matrix is rank de cient almost everywhere. By regularizing the original problem we get a less ill-conditioned problem with a solution limited in norm. For large problems, new special purpose automatic dierentiation algorithms for computing the Jacobian times a vector is used in a conjugate gradient method. In this paper, we propose optimization methods explicitly applied to the nonlinear regularized problem for large-scale problems. To be speci c, we formulate and solve nonlinear Tikhonov regularized problems. In [12] it was shown theoretically and practically that this approach is superior to standard optimization regularization Financial support has been received by the Swedish National Board of Industrial and Technical Development under grant NUTEK 8421-94-4603

1

2

J. ERIKSSON, M. GULLIKSSON et al.

techniques, such as Levenberg-Marquardt (LM) (trust region methods) [16] and truncated QR-methods such as subspace minimization [6, 7, 14]. In feed-forward neural network computations, the dierence between the output vector, A, and the desired target vector, T , is named the error vector, E = T ? A, see Section 1.1. However, we use the name conventions from the eld of numerical nonlinear optimization and write f instead of E . Then the nonlinear least squares problem is written as m 1X 2 (1) min F ( x ) = x2