REGULARIZATION TOOLS FOR TRAINING FEED ... - CiteSeerX

Recommend Documents

E-mail: [email protected], [email protected],. Per. .... by Abazoglou 1] and Andersson 2]. By regularizing the nonlinear least squares.

regularization tools for training large feed-forward neural ... - CiteSeerX

NETWORKS USING AUTOMATIC. DIFFERENTIATION. JERRY ERIKSSON, M ARTEN GULLIKSSON, PER LINDSTR OM and. PER- AKE WEDIN. Department of ...

Virtual Adversarial Training: a Regularization

Apr 13, 2017 - in which we smooth the model are virtually adversarial, we call our method virtual adversarial training (VAT). The computational cost of.

Pruning and Regularization Techniques for Feed ... - Semantic Scholar

Abstract - In this paper we present an extensive study of weight pruning and regularization techniques for feed forward neural nets. This algorithm comparison is ...

SATURATION SPACES FOR REGULARIZATION ... - CiteSeerX

An inverse problem deals with the estimation of an unknown function Ï which ... 2. SATURATION SPACES FOR REGULARIZATION METHODS IN INVERSE ... the associated cumulative distribution function; let h be an operator defined ..... Page 8 .... With the a

FEED BACK ON STUDENTS INDUSTRIAL TRAINING FOR ...

Department of Mechanical Engineering. Y.M.C.A. University of Science and ... Keywords: Industrial training, Education quality, Survey analysis, Industrial feedback, ... essential curriculum requirements of every technical institute. Fresh young ...

Training of Feed-Forward Neural Networks for

(IJCSIS) International Journal of Computer Science and Information Security,. Vol. 9, No. 11, 2011. Training of Feed-Forward Neural Networks.

Joint Regularization - CiteSeerX

ESANN'2005 proceedings - European Symposium on Artificial Neural Networks. Bruges (Belgium), 27-29 April 2005, d-side publi., ISBN 2-930307-05-6. 455 ...

Local diffusion regularization method for optical ... - CiteSeerX

by using robust statistics. Abdel Douiri, Martin Schweiger, Jason Riley, and Simon Arridge. Department of Computer Science, University College London, Gower ...

Discrete Regularization for Perceptual Image ... - CiteSeerX

Oct 24, 2008 - SEMI-SUPERVISED LEARNING AND OPTIMAL CONTROL. Hongwei Zheng and Olaf ... graph spaces for perceptual image segmentation via semi- supervised .... proposed based on pure âbottom-upâ processing. 2.1. Discrete ...

A Bayesian Framework for Regularization - CiteSeerX

Jul 6, 1994 - More recent work in this direction was done by MacKay. 18]. This article suggests a di erent ...... 18] David J.C. MacKay. Bayesian Methods for ...

TOTAL VARIATION REGULARIZATION FOR IMAGE ... - CiteSeerX

1Copyright c 2007 William K. Allard. Date: December 15, 2007. 2000 Mathematics Subject Classification. Primary 49Q20, 58E30. Supported in part by Los ...

Fast regularization technique for expectation maximization ... - CiteSeerX

method of sieves31 or the maximum penalized likelihood (MPL) approach32. ... Section 3 presents an overview of maximum likelihood image estimation, the ...

Regularization Algorithms for Transition Matrices - CiteSeerX

credit rating changes successively from one state to another, at given time ... algorithms for obtaining transition matrices and generators that give rise to close.

Streamline Regularization for Large Discontinuous Motion ... - CiteSeerX

applied the Wilcoxon Signed Rank Test, which shows an improvement in estimation accuracy using our proposed scheme. 1. Introduction. The polar extremes ...

Virtual Adversarial Training: a Regularization Method for ... - arXiv

Apr 13, 2017 - in which we smooth the model are virtually adversarial, we call our method virtual adversarial training (VAT). The computational cost of.

ADAPT : Tools for Training Design and Evaluation

ing designers to develop training programs that take the limited processing capacity of the human mind into account. Applying the 4C/ID* methodology for the.

Training Tools for Translators and Interpreters

Jun 29, 2011 ... period of basic translation exercises and development of secondary ...... Gile. Daniel. 1995. Basic Concepts and Models for Interpreter and ...

Instructor Tools for Virtual Training Systems

Amy Rankin. LinkÃ¶ping University [email protected]. Arjan Lemmers. National Aerospace Laboratory, NLR [email protected]. Magnus Morin. VSL Systems.

HFC-High Feed Cutter - NTK Cutting Tools

App for iOS App for ANDROID. HFC-High Feed Cutter- ver.2 ... 1.6. 0.74. 18,000. â. JHF125A2540R22 4.921 Ï125. 22. 2.4

Natural Regularization in SVMs - CiteSeerX

St. George House, 1 Guildhall Street. Cambridge CB2 .... p(xj ) directly one uses the log-likelihood instead, i.e. l(x; ) := lnp(xj ). ..... of Mercer's Theorem (Th. ??).

Rapid Methods as Analytical Tools for Food and Feed ...

Rapid Methods as Analytical Tools for Food and. Feed Contaminant Evaluation: Methodological. Implications for Mycotoxin Analysis in Cereals. Federica Cheli1 ...

Feed Training Largemouth Bass, Micropterus salmoides floridanus ...

Hatchery Techniques: Feed Training Juvenile Largemouth Bass, .... contains krill meal, which is a key attractant, and is essential during initial feed training.

Hybrid Training of Feed-Forward Neural Networks

of two tasks, the first one is the selection of an appropriate architecture for the ... second task, the optimization of connection weights of MLPs through the use of ...

REGULARIZATION TOOLS FOR TRAINING FEED ... - CiteSeerX

Download PDF

0 downloads 0 Views 224KB Size Report

Comment

... toolbox. KEY WORDS: Neural network training, Tikhonov regularization, Automatic di .... Matlab's Neural Network Toolbox 3], are much too slow we compare our methods ...... Tutorial 11: Nonlinear black-box modeling in system identi cation.

Copyright information to be inserted by the Publishers

REGULARIZATION TOOLS FOR TRAINING FEED-FORWARD NEURAL NETWORKS PART II: Large-scale problems and JERRY ERIKSSON, M ARTEN GULLIKSSON, PER LINDSTROM PER-AKE WEDIN Department of Computing Science, Umea University S-901 87 Umea, Sweden E-mail: [email protected], [email protected], [email protected], [email protected] We describe regularization tools for training large-scale arti cial feed-forward neural networks. In a companion paper (in this issue) we give the basic ideas and some theoretical results regarding the Gauss-Newton method compared to other methods such as the Levenberg-Marquardt method applied on small and medium size problems. We propose algorithms that explicitly use a sequence of Tikhonov regularized nonlinear least squares problems. For small-and-medium size problems the Gauss-Newton method is applied to the regularized problem. For large-scale problems, methods using new special purpose automatic dierentiation are used in a conjugate gradient method for computing a truncated Gauss-Newton search direction. The algorithms developed utilize the structure of the problem in dierent ways and perform much better than the Polak-Ribiere based method. All algorithms are tested using benchmark problems and guidelines by Lutz Prechelt in the Proben1 package. All software is written in Matlab and gathered in a toolbox. KEY WORDS: Neural network training, Tikhonov regularization, Automatic dierentiation, Large-scale problems.

1 INTRODUCTION The training phase of supervised feed-forward neural networks leads to very dicult unconstrained nonlinear least squares problems. The diculty is due to the fact that the Jacobian matrix is rank de cient almost everywhere. By regularizing the original problem we get a less ill-conditioned problem with a solution limited in norm. For large problems, new special purpose automatic dierentiation algorithms for computing the Jacobian times a vector is used in a conjugate gradient method. In this paper, we propose optimization methods explicitly applied to the nonlinear regularized problem for large-scale problems. To be speci c, we formulate Financial support has been received by the Swedish National Board of Industrial and Technical Development under grant NUTEK 8421-94-4603

1

2

J. ERIKSSON, M. GULLIKSSON et al.

and solve nonlinear Tikhonov regularized problems. In [7] (this issue) it was shown theoretically and practically that this approach is superior to the standard optimization regularization techniques, such as in Levenberg-Marquardt (LM) or trust region methods [15] or as in truncated QR-methods as in subspace minimization approaches described in [5, 6, 13]. We will use the same notations as in [7] which, for simplicity is partly repeated below. In feed-forward neural network computations, the dierence between the output vector, A and the desired target vector T , is named the error vector (E = T ? A), see Section 1.1 in [7]. However, we use the name conventions from the eld of numerical nonlinear optimization and write f instead of E . Then the nonlinear least squares problem is written as m X 1 min F (x) = 2 fi2(x); x2