Information States in Optimal Control and Filtering

Recommend Documents

Anderson and John B. Moore (En- glewood Cliffs, NJ: Prentice-Hall, 1979, 357 pp.). Reviewed by Mansour. Eslami, Department of Electrical Engineering, State ...

Optimal Filtering

Anderson and John B. Moore (En- glewood Cliffs, NJ: Prentice-Hall, 1979, 357 pp.). Reviewed by Mansour. Eslami, Department of Electrical Engineering, State ...

Optimal Filtering

enfolds major contributions by a substantial number of researchers of this topic. ... theory. A comprehensive set of student exercises and an excellent list of.

New Method for Optimal Control and Filtering of Weakly ... - CiteSeer

Discrete-time linear control systems have been the subject of recent research ... It is important to note that it is easier to solve .... Note that in the following there is no need for analytical expressions for ..... Approach. Lecture Notes in Co

Geometry of quantum dynamics and optimal control for mixed states

Apr 30, 2013 - Riemannian structure, quantum dynamics, time-energy uncertainty, optimal control, mixed state. 1. arXiv:1304.8103v1 [quant-ph] 30 Apr 2013 ...

P2P Information Retrieval and Filtering

... and have no support for a combined service. ... paradigms and support both IR and IF in a unifying P2P framework. .... Based Publish/Subscribe DHT Network.

Improved Information Filtering and Feature

extracting features from the available database based on the semantic database approach has been presented. The basic of this paper is mainly focused on ...

Optimal Preorder Strategy with Endogenous Information Control

information and pricing strategy, the benefit of preorder is most pronounced when the normalized margin is in a medium .... open beta versions (for online network play if the game has an ... consumers with the product better and improve social.

Optimal Decisions with Limited information - Automatic Control

Optimal Decisions with. Limited Information. Ather Gattami. Department of Automatic Control. Lund University. Lund, June 2007 ...

Some Results In Abstract Optimal Linear Filtering

Michael Ruzhansky. Abstract. The linear optimal filtering problems in infinite ..... J(h) = J(hopt) = |D Rx -RxyR+ y R. * xy]D. *|H': These methods can be applied for ...

Information and Covariance filtering

observational update against roundoff errors and is a more likely cause of filter degradation. Pros & Cons. + : Problems with perfect Prior knowledge are easy to.

INFORMATION FILTERING - CS, Technion

Assistant Professor John Laird, Co-chairman. Professor ... I would like to thank John ...... strongly on the order in which its subgoals are executed (Naish, 1985a;.

Optimal Control of Epidemic Information Dissemination in Mobile Ad ...

Email : [email protected] and [email protected]. AbstractâTo facilitate reliable ... paradigms such as mobile ad hoc networks (MANETs). Since.

Optimal Control Strategies in Delayed Sharing Information Structures

Feb 22, 2010 - probability of the state given the data available at the control station as a sufficient statistic. (where the data available to a control station ...

Optimal Control

where e(t) := x(t) â x(t) and M is a positive definite symmetric matrix. (MT = M > .... The calculus of variations is the name given to the theory of the optimization of.

Linear Quadratic Optimal Control Problems with Fixed Terminal States ...

May 11, 2017 - OC] 10 May 2017. Linear Quadratic Optimal Control Problems with Fixed Terminal States and Integral Quadratic. Constraints. Jingrui Sunâ.

Detail Filtering in Geographic Information Visualization - eolss.net

Geographic information systems (GIS) deal with storing, querying, ... To an ever-growing degree, information technology is based on interactive visual media,.

Social Browsing & Information Filtering in Social Media

Oct 30, 2007 - The social network-driven Friends interface allows a user to easily track her friends' activity: the new content they recently .... mutual friends.

Optimal Control - Control and Dynamical Systems

kalman filtering in semi-active suspension control

Abstract: This paper focuses on estimation of the vertical velocity of the vehicle chassis and the relative velocity between chassis and wheel. These velocities are ...

Stochastic Optimal Control in Finance

through several important examples that arise in mathematical finance and .... In this Chapter, we will outline the basic structure of an optimal control problem.

Dynamic Optimal Control Models in

dynamic optimal control models in advertising subsequent to the comprehensive survey of the literature by Sethi in 1977. The basic problem underlying these ...

Use reformulated profile in information filtering

Similarity among a document dj and a query q or profile. p p is defined as a ... Sim(q, dj) is the similarity among the query q and the document d, and Sim(q (w) ...

Incorporating Auxiliary Information in Collaborative Filtering Data

Factorization)-based data update approach in collaborative filtering (CF) that solves the ...... Advances in Neural Information Processing Systems,. 13:556â562 ...

Information States in Optimal Control and Filtering

Download PDF

0 downloads 0 Views 334KB Size Report

Comment

Key Words: Optimal Control, Filtering, Minimax, Partial Observations, Feynman-Kac, ... density function, say, f (x; t);t 0g, then 1] ...... i ; x1; x2; D1; D2;1g: 2 (4.110).

Information States in Optimal Control and Filtering: A Lie Algebraic Theoretic Approach Charalambos D. Charalambous1 and Robert J. Elliott2 Department of Electrical Engineering McGill University, Montreal, P.Q., Canada H3A 2A7 1

Department of Mathematical Sciences University of Alberta, Edmonton, Alberta, Canada T6G 2G1 2

IEEE Transactions on Automatic Control: Submitted April 21, 1997; Revised April 17, 1998

Reference Number:97-182 Abstract.

The purpose of this paper is twofold; (i) to introduce the sucient statistic algebra which is responsible for propagating the sucient statistic, or information state, in the optimal control of stochastic systems, and (ii) to apply certain Lie algebraic methods widely used in nonlinear control theory, to derive new results concerning nite-dimensional controllers. This, enhances our understanding of the role played by the sucient statistic. The sucient statistic algebra enables us to determine a priori whether there exist nite-dimensional controllers; it also enables us to classify all nite-dimensional controllers. Relations to minimax dynamic games are delineated.

Key Words: Optimal Control, Filtering, Minimax, Partial Observations, Feynman-Kac, Lie Algebras, Sucient Statistic, Finite-Dimensional.

1

This author's work was supported by the Natural Science and Engineering Research Council of Canada under

grant OGP0183720

1

1 Introduction The practical utility of the Duncan-Mortensen-Zakai (DMZ) equation is greatly appreciated in both nonlinear ltering and stochastic control problems with partial information. The DMZ equation of nonlinear ltering of diusion processes is a linear, stochastic, partial dierential equation (PDE) which describes in a recursive manner the evolution of the unnormalized conditional distribution of the state process, fx(t); t 0g, given the observations, fy(t); t 0g. If this distribution has a density function, say, f(x; t); t 0g, then [1]

d (x; t) = L (x; t) + h(x)(x; t) d y(t); (x; t) 2 < (0; T ]: (1.1) 0 dt dt Consequently, f(x; s); 0 s tg evolves forward in time with initial condition (x; 0). Here, L0 is a certain second-order dierential operator related to the drift and diusion coecients of the state process, the Kolmogorov forward operator, and h(x) is a zero-order dierential operator related to the signal in the observations. In ltering problems one is concerned with conditional expectations R (z)(z; t)dz Z E [(x(t))jfy(s); 0 s tg] = (z)N (z; t)dz = 0:

(4.88)

Similar to Theorem 2.4 (see [20]), the information state approach to this control problem yields:

J (u ) = u2U inf E ad

Z

exp ('(x)) (x; T )dx : n

(4.89)

0) 1 log inf J (u) = inf sup E Q n Z T [(x(t); u(t; y)) ? 1 j (t)j2 ? 1 j(t)j2 ]dt + '(x(T ))o;

u2Uad

u2Uad ( ;)2Dad

0

(4.95)

where E Q [:] denotes expectation with respect to measure P Q under which (the system (4.87), (4.88) becomes)

dx(t) = f (x(t))dt +

` X j =1

gj (x(t))uj (t; y)dt + (x(t)) (t)dt +

n X j =1

j (x(t))dwj (t); x(0) 2 k; 1 j `; wi;j = constant ; 1 i; j n; 29

(4.101)

and

(x; u) =

n X i=1

Qi;ix2i +

1. If there exist an 0 such that

` X i=1

Ri;iu2i + 0 (x) 0; Qi;i 0; Rj;j > 0; 8i; j:

? 20 = Nonnegative quadratic function of (x1 ; : : : ; xn ); then LS has dimension at most 2n + 2 and n X 1 : ~ + mx + ]); x1 ; : : : ; xn ; D1 ; : : : Dn ; 1g; L L = SpanfL0 = ( D2 ? [x0 Qx

(4.102) (4.103)

(4.104) 2 i=1 i where Q~ = Q~ 0; m 2 (