Ensemble Pruning using Reinforcement Learning - Semantic Scholar

Recommend Documents

Ensemble Methods for Reinforcement Learning ... - Semantic Scholar

agents applied the Q-Learning algorithm to learn the action-values in subspaces and have been ... to decide about the best action to take. As Q-Learning tend to.

Diversity Regularized Ensemble Pruning - Semantic Scholar

Key words: diversity, ensemble pruning, diversity regularization. 1 Introduction. Ensemble methods [33], which train multiple classifiers for one single task, are.

An Ensemble Pruning Primer - Semantic Scholar

Abstract Ensemble pruning deals with the reduction of an ensemble of pre- ... The last 12 years a large number of ensemble pruning methods have been.

Ensemble Algorithms in Reinforcement Learning

AbstractâThis paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to ...

Extreme Learning Machine Ensemble Using ... - Semantic Scholar

computer science for over two decades. ... behavioral science and in clinic practice. Although humans ... The ensemble of an online sequential ELM (OS- ...... He received his B.E. degree in Computer Engineering from Pokhara University,.

Associative Reinforcement Learning using Linear ... - Semantic Scholar

the scenario we consider, learning proceeds in trials ..... trial t. Let progress = E(jjwt 0 vjj2 0 jjwt+1 0 vjj2) and best = maxi yt;i and drop the subscript t from all.

Associative Reinforcement Learning - Semantic Scholar

Reinforcement Learning, Associative Learning, Agents and Multi-agent ... we propose to enhance learning algorithms, reinforcement learning in particular, for.

Ensemble Network Architecture for Deep Reinforcement Learning

Feb 20, 2018 - significant better value evaluation and more stable and better performance on several classical control tasks at OpenAI Gym environment. 1.

A Study on Greedy Algorithms for Ensemble Pruning - Semantic Scholar

Jan 13, 2012 - pruning, selective ensemble, ensemble thinning and ensemble selection, ... Pruning these models while maintaining a high diversity among the.

Reinforcement Learning for Robots Using Neural ... - Semantic Scholar

Jan 6, 1993 - School of Computer Science. ELECT E ... The results of this dissertation rely on computer simulation, incl

Pruning Social Networks Using Structural ... - Semantic Scholar

Louis Licamele. Computer Science ... a set of events E, and a set of membership relations R. The most common ..... [5] D. Liben-Nowell and J. Kleinberg. The link ...

Learning to Drive a Bicycle using Reinforcement ... - Semantic Scholar

The second problem is not as straightforward as it may seem. The learning .... pieces of food (colored cans) and bring them to its nest. The trainer provides the ...

Reinforcement Learning for Robots Using Neural ... - Semantic Scholar

Jan 6, 1993 - AD-A261 434. Reinforcement Learning for ... robot, control, credit assignment, temporal difference, action

Reinforcement Learning using LCS in Continuous ... - Semantic Scholar

A basic LCS technique is developed and applied to a commonly studied RL ..... R. Sutton and A. Barto, An Introduction to Reinforcement Learning, MIT Press, ...

Pruning Social Networks Using Structural ... - Semantic Scholar

Pruning Social Networks Using Structural Properties and Descriptive Attributes. Lisa Singh. Computer Science Dept. Georgetown University. Washington, DC ...

Reinforcement Learning for Partially Observable ... - Semantic Scholar

AbstractâApproximate dynamic programming (ADP) is a class of reinforcement learning ..... The optimal control problem [17] is to find the policy uk = Î¼(xk) that ...

A Generalized Reinforcement-Learning Model - Semantic Scholar

Jan 18, 1996 - A Generalized Reinforcement-Learning Model: Convergence and Applicationa. Michael L. Littman1. Csaba Szepesva`ri2. Technical Report No ...

Reinforcement Learning for Spoken Dialogue ... - Semantic Scholar

automated synthesis of complicated dialogue systems from passive corpora ... TOOT, or whether the user prefers to hear their email ordered by date or sender in ...

Prioritized Sweeping Reinforcement Learning ... - Semantic Scholar

Sep 2, 2016 - Routing protocols for an ad hoc network [2] are generally .... the best estimate for delivering packet P to its final destination D. In order that ...

Sequence labeling with Reinforcement Learning ... - Semantic Scholar

searching for a globally optimal label sequence, we learn to construct this ... Sequence labeling is the generic task of assigning labels to the elements of a ...

Reinforcement Learning When Visual Sensory ... - Semantic Scholar

robot change motion m state evaluator reinforcement signal state x sensor actuator. Fig. 2 Structure of reinforcement learning system d dt d dt2. 2 rnd1 rnd2. +. ~.

Designing a Reinforcement Learning-based ... - Semantic Scholar

Abstract. This paper investigates the challenges posed by the application of reinforcement learning to large-scale strategy games. In this context, we present ...

Forgetting in Reinforcement Learning Links ... - Semantic Scholar

Oct 13, 2016 - Ayaka Kato1, Kenji Morita2*. 1 Department of ...... Takahashi YK, Roesch MR, Wilson RC, Toreson K, O'Donnell P, Niv Y, et al. Expectancy- ...

Reinforcement learning based dual-control ... - Semantic Scholar

Dec 18, 2008 - A dual-controller approach is undertaken where primary ...... Louis, MO, in 2002 and the M.S. degree in computer engineering from Univer-.

Ensemble Pruning using Reinforcement Learning - Semantic Scholar

Download PDF

16 downloads 0 Views 125KB Size Report

Comment

Ensemble Pruning using Reinforcement Learning. Ioannis Partalas, Grigorios Tsoumakas, Ioannis Katakis and Ioannis Vlahavas. Department of Informatics,.

Ensemble Pruning using Reinforcement Learning Ioannis Partalas, Grigorios Tsoumakas, Ioannis Katakis and Ioannis Vlahavas Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece {partalas,greg,katak,vlahavas}@csd.auth.gr

Abstract. Multiple Classifier systems have been developed in order to improve classification accuracy using methodologies for effective classifier combination. Classical approaches use heuristics, statistical tests, or a meta-learning level in order to find out the optimal combination function. We study this problem from a Reinforcement Learning perspective. In our modeling, an agent tries to learn the best policy for selecting classifiers by exploring a state space and considering a future cumulative reward from the environment. We evaluate our approach by comparing with state-of-the-art combination methods and obtain very promising results.

1

Introduction

A very active research area during the recent years involves methodologies and systems for the production and combination of multiple predictive models. Within the Machine Learning community this area is commonly referred to as Ensemble Methods [1]. The success of this area is due to the fact that ensembles of predictive models offer higher predictive accuracy than individual models. The first phase of an Ensemble Method is the production of the different models. An ensemble can be composed of either homogeneous or heterogeneous models. Models that derive from different executions of the same learning algorithm are called Homogeneous. Such models can be produced by injecting randomness into the learning algorithm or through the manipulation of the training instances, the input attributes and the model outputs [2]. Models that derive from running different learning algorithms on the same data set are called Heterogeneous. The second phase of an Ensemble Method is the combination of the models. Common methods here include Selection, Voting, Weighted Voting and Stacking. Recent work [3], [4] has shown that an additional intermediate phase of pruning an ensemble of heterogeneous classifiers (and combining the selected classifiers with Voting) leads to increased predictive performance. This paper presents a Reinforcement Learning approach to the problem of pruning an ensemble of heterogeneous classifiers. We use the Q-learning algorithm in order to approximate an optimal policy of choosing whether to include or exclude each algorithm from the ensemble. We compare our approach against other pruning methods and state-of-the-art Ensemble methods and obtain very

promising results. Additionaly, the proposed approach is an anytime algorithm, meaning that we can obtain a solution at any point. The rest of this paper is structured as follows: Section 2 presents background information on reinforcement learning and heterogeneous classifier combination, focusing on material that will be later on reffered in the paper. Section 3 reviews related work on pruning ensembles of heterogeneous models, as well as on using reinforcement learning for the combination of different algorithms. Section 4 presents our approach and Section 5 the setup of the experiments for its evaluation. Section 6 discusses the results of the experiments and finally Section 7 concludes this work and points to future research directions.

2 2.1

Background Reinforcement Learning

Reinforcement Learning (RL) addresses the problem of how an agent can learn a behavior through trial-and-error interactions with a dynamic environment [5]. In an RL task the agent, at each time step, senses the environment’s state, st ∈ S, where S is the finite set of possible states, and selects an action at ∈ A(st ) to execute, where A(st ) is the finite set of possible actions in state st . The agent receives a reward, rt+1 ∈