Stony Brook University, Stony Brook, NY 11794 (USA) phone: + 1 631 ... der Awards CCR-0220011 and CCF-0515246 and the Office of Naval Re- search under ...
14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
ADVANCES IN COST-REFERENCE PARTICLE FILTERING M´onica F. Bugallo and Petar M. Djuri´c Department of Electrical and Computer Engineering Stony Brook University, Stony Brook, NY 11794 (USA) phone: + 1 631 632 8423, fax: + 1 631 632 8494, email: {monica, djuric}@ece.sunysb.edu
ABSTRACT Recently, we have proposed a particle filtering-type methodology, which we refer to as cost-reference particle filtering (CRPF). Its main feature is that it is not based on any particular probabilistic assumptions regarding the studied dynamic model. The concepts of particles and particle streams, however, are the same in CRPF as in standard particle filtering (SPF), but the probability masses of the particles are replaced with user defined costs. In this paper we propose some modifications of the original CRPF methodology. The changes allow for development of simpler algorithms, which may also be less computationally intensive and possibly more robust. We investigate several variants of CRPF and compare them with SPF. The advantages and disadvantages of the considered algorithms are illustrated and discussed through computer simulations of tracking of multiple targets which move along a two-dimensional space. 1. INTRODUCTION Cost Reference Particle Filtering (CRPF) is a generalization of standard particle filtering (SPF) in that it allows for recursive estimation of unobserved states of dynamic systems without use of probability distributions of the noises in the system and prior distributions of the states [7]. This methodology has already been successfully used in several applications including the positioning and tracking of one single target [2]. We have found that it has substantial advantages in terms of simplicity and robustness when compared to SPF algorithms [1, 2, 7]. In this paper, we continue the study of this class of PF methods and we investigate several variants of them. In particular, we propose to modify some of the steps of the original CRPF and make the resulting methods much simpler for use (by the practitioner) and much less computationally intensive. We achieve all this without degrading the performance of the CRPF methods. We tested the methods by applying them to the problem of tracking of multiple targets by using sensor measurements obtained in a sensor network. The measurements represent a superposition of signals that carry information about the positions of the various targets. The sensors send the sensed information to a fusion center that combines the received data from all the sensors and carries out necessary computations. We chose this problem because it is challenging and highly nonlinear, and because it has already been addressed by SPF [3, 5, 6]. The remaining of the paper is organized as follows. First, in Section 2, we briefly review the basic features of CRPF This work has been supported by the National Science Foundation under Awards CCR-0220011 and CCF-0515246 and the Office of Naval Research under Award N00014-06-1-0012.
and then describe the new algorithms. In Section 3, we provide the details of the multiple target tracking problem that is used for testing of the methods. In Section 4, we demonstrate their performance through computer simulations. Finally, in Section 5 we provide some concluding remarks. 2. CRPF AND ITS VARIANTS We have a dynamic model of a system described by state and observation equations whose general forms are given by xt yt
= =
fx (xt−1 ) + ut fy (xt ) + vt
(1) (2)
where t = 1, 2, ... denotes discrete time; xt and yt are the state and observation vectors, respectively; fx and fy are the statetransition and observation functions, respectively; and ut and vt are independent noise processes. We are interested in estimating the state vectors x0:t = [x0 , x1 , ..., xt ] from the observations y1:t = [y1 , ..., yt ]. We address nonlinear problems, that is, problems where at least the observation function fy (·) is nonlinear. Although, in this statement, we assumed that the noise is additive, the CRPF methods can be implemented in a similar way for other types of signal degradation. The SPF methods are based on the use of the Bayes’ theory and the knowledge of the noise distributions of the noises in (1) and (2). In many scenarios the knowledge of the distributions is questionable at best, and so our goal with CRPF was to track unknown states in time without using probabilistic assumptions of the noise. In order to estimate x0:t from y1:t in such situations, we considered a user-defined real cost function [7], C (x0:t |y1:t , λ ) = λ C (x0:t−1 |y1:t−1 ) + 4C (xt |yt )
(3)
where 4C (xt |yt ) is an incremental cost due to the presence of a new observation and 0 < λ < 1 is a forgetting factor that weights the history of previous data. The cost measures the quality of the state signal estimates according to the available observations and its recursive structure allows for sequential updating using solely the state and observation vectors at the current time instant. We also considered a one-step risk function, R(xt−1 |yt ) = 4C ( fx (xt−1 )|yt ) , which acts as a prediction of the incremental cost obtained from the previous state. With these two basic concepts, the CRPF techniques are sequentially built in a way similar to that of SPF algorithms [7]. It is apparent that many implementations of the CRPF methodology can be proposed for a single problem. In [1], the relationship between CRPF and SPF was studied (the values λ = 1 and λ = 0 provided the conditions) and
14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
several modifications of the CRPF methodology were proposed. The new algorithms avoid resampling (which is required in SPF and the original CRPF and represents a bottleneck in hardware implementations), are simpler to use, and are less computationally intensive. Here we outline the most simplified algorithm presented in [1] (λ = 0, no resampling, and estimation of the state using the particle with the minimum cost) and we discuss several directions that can be taken for further simplifications. 2.1 Simplified CRPF Particles are initialized uniformly on a set, X0 ⊂ RLx with zero assigned costs. Given a set of M state trajectories and n o (m) (m) M associated costs up to time t, x0:t , Ct , the grid of m=1 state trajectories is randomly propagated when yt is observed by implementing the following steps: 1. Selection of the most promising trajectories. (a) For m = 1, 2, ..., M, let
the samples can also be considered. The resulting algorithm (which does not calculate risks either) is symbolized by crpfmean nr . For ways of constructing the probability mass function π (·), please see [7]. 3. Exploring the state space In order to explore the sample space more efficiently, we propose to sample P times using the same particle (each particle would have P children and therefore we would get M P particles in the generation step). The number of particles is brought back to M once the costs of all the particles are computed, the particles are ranked according to the costs, and the particles with smallest costs selected. This method is referred to as crpfnr−P . A summary of all the proposed algorithms is shown in the Table. State − position 300 200
(m) Rt+1
(m) = R(xt |yt+1 )
100
TARGET 1
0
(m)
y−coordinate (m)
where Rt+1 is a risk function. (m)
(b) Sort in increasing order the Rt+1 . (c) Replicate the M N trajectories. n o (m) (m) M A new particle filter denoted by xˆ 0:t , Cˆt 1.
tained as the result of this step 2. Propagation of new trajectories. For m = 1, ..., M, let
m=1
is ob-
−100 −200 TARGET 2
−300 −400 −500 −600 −600
=
where pt+1 is a probability density function chosen by the designer. As mentioned before, this is a simplified version of the CRPF (note that compared to equation (3) the cost here is only updated using the second term). (m)
min = arg min{C 3. Estimation of the state. Let xt+1 t+1 }, and therefore we choose as estimate of the state the particle with minimum cost.
2.2 New variants of the CRPF Here we introduce some further simplifications and improvements of the above described algorithm (labeled as crpf). 1. No risk calculation A possible reduction in computational burden can be obtained by eliminating the risk step and moving the selection procedure at the end of step two, once costs are calculated. We will denote this method by crpfnr . 2. Other estimation procedures The purpose of considering the minimum cost estimate was to avoid the calculation of a probability mass function to normalize the obtain costs and perform estimation. Other types of estimates like the weighted mean value of 1 Note that this CRPF approach avoids resampling by replacing it with simple ordering of the obtained risks in step 1 of the algorithm and replicating the corresponding M N (where N can be 2, 3, · · · ) particles with particles with the lowest risks.
−500
−400
−300 −200 −100 x−coordinate (m)
0
100
200
300
Figure 1: Target trajectories and their estimates. The estimates are very close to the true trajectories and cannot be discerned.
Magnitude of the velocity (m/s)
∼
(m) pt+1 (x|xˆ t ) (m) 4Ct+1 (xt+1 |yt )
Magnitude of the velocity (m/s)
(m) xt+1 (m) Ct+1
TARGET 1 25 20 15
true spf crpfnr crpfnr−P
10 5 0 0
25
50
75
100 Time (s)
125
150
175
200
125
150
175
200
TARGET 2 25 20 15
true spf crpfnr crpfnr−P
10 5 0 0
25
50
75
100 Time (s)
Figure 2: Magnitude of the velocity.
3. STATEMENT OF THE TRACKING PROBLEM Consider a network of N acoustic sensors deployed in a twodimensional field where K targets move according to a stan-
14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
crpf
crpfmean nr
crpfnr
1. Select:
1. Propagate:
For m = 1, · · · , M
1. Propagate:
For m = 1, · · · , M
(m) (m) Rt+1 = R(xt |yt+1 )
For m = 1, · · · , M
(m) (m) xt+1 ∼ pt+1 (x|x ˆt ) (m) (m) (m) ˆ + 4Ct+1 (xt+1 |yt ) Ct+1 = λ Ct
(m) Sort in increasing order the Rt+1
For m = 1, · · · , M
2. Select: (m) Sort in increasing order the Ct+1
Replicate the M N trajectories
Replicate the M N trajectories
3. Estimate:
mean = ∑M x(m) π (m) xt+1 m=1 t+1 t+1
min = arg min{C (m) } xt+1 t+1
3. Estimate:
1, ..., P
(m,i) (m,i) xt+1 ∼ pt+1 (x|x ˆt ) i = 1, ..., P (m,i) (m,i) (m,i) ˆ + 4Ct+1 (xt+1 |yt ) Ct+1 = λ Ct
i=
2. Select:
(m) Sort in increasing order the Ct+1
3. Estimate:
(m) (m) xt+1 ∼ pt+1 (x|x ˆt ) (m) (m) (m) + 4Ct+1 (xt+1 |yt ) Ct+1 = λ Cˆt
For m = 1, · · · , M
(m) (m) xt+1 ∼ pt+1 (x|x ˆt ) (m) (m) (m) ˆ + 4Ct+1 (xt+1 |yt ) Ct+1 = λ Ct
2. Select:
Replicate the M N trajectories
2. Propagate:
crpfnr−P 1. Propagate:
(m,i) Sort in increasing order the Ct+1 Replicate the M N trajectories
3. Estimate:
min = arg min{C (m) } xt+1 t+1
min = arg min{C (m) } xt+1 t+1
4. SIMULATION RESULTS
dard model formulated as [5] xt
= Gx xt−1 + Gu ut
(4)
> > 4K indicates the position where xt> = [x> 1,t , . . . , xK,t ] ∈ R and the velocity of the targets in the field, i.e., xk,t = [x1,k,t x2,k,t x˙1,k,t x˙2,k,t ]> , k = 1, . . . , K. The transition matrices, Gx of size 4K × 4K, and Gu of size 4K × 2K, are block diagonal matrices with blocks
1 0 0 Gx = 0 0
0 1 0 0
Ts 0 1 0
T2 s 0 2 Ts 0 0 and Gu = 0 Ts 1 0
0 Ts2 2
0 Ts
where Ts is the sampling period. The noise in the state equation, ut ∈ R2K , accounts for small acceleration turbulences and is modeled as a Gaussian ³ process with zero mean ´and covariance matrix Cu = diag σu21,1 , σu21,2 , . . . , σu2K,1 , σu2K,2 . The n-th sensor at a known position, rn ∈ R2 , n = 1, . . . , N, receives the signal power transmitted from the targets that are present in the field according to [8], yn,t
= gn (xt ) + vn,t K
=
∑ ¯¯r
k=1
n = 1, . . . , N
Ψk d0α ¯
n − lk,t
¯α
+ vn,t ,
(5)
where gn (·) is a function that models the received signal power by the n−th sensor, Ψk is the emitted (known) power of the k−th target measured at a reference distance d0 , lk,t = [x1,k,t , x2,k,t ]> is the location of the k−the target at time t, α is an attenuation parameter that depends on the transmission medium and is considered known and the same for all sensors, and ¯ ¯ q ¯rn − lk,t ¯ = (r1,n − x1,k,t )2 + (r2,n − x2,k,t )2 . The observation noise, vn,t , assumed to be independent from ut , is modeled according to N (µv , σv2 ), where µv = σ 2 and σv2 = 2σ 4 /L, with σ 2 and L being the power of the background noise of one sample and the number of samples used to obtain the measured power, respectively. The objective is to track the targets in the field, i.e., estimate x0:t , using the measurements of the N sensors, yn,1:t , n = 1, 2, · · · N.
We now present simulation results that show the performance of the CRPF algorithms discussed in the previous section. We ran an experiment where we generated data according to model (4)-(5), which corresponded to the evolution of a system during T = 200s with sampling period Ts = 0.5s. We considered that the distribution of the state noise was ut ∼ N (0, .5I2K ); the parameters of the observation model were, Ψk = Ψ = 107 , d0 = 1m, and α = 2; and the observation noise distribution was generated with σ 2 = 0.02 and L = 100. The sensor network was composed of N = 16 sensors placed on a deterministic grid within the field (see Figure 1 for system configuration were the sensors are marked with dots.) We applied the proposed CRPF methodology for solving the tracking problem and, for comparison and benchmarking purposes, we also implemented the SPF algorithm [4]. For clarity, we only included in the Figures the results corresponding to the SPF algorithm (labeled spf), the crpfnr (CRPF that does not calculate the risks - see Table), and the crpfnr−P (CRPF that does not calculate the risk and propagates per each particle P = 10 children - see Table). The crpf (see [1]) and the crpfmean showed similar results. nr We used the following cost function for the CRPF algorithms: C (x0 ) = 0 4C (xt |yt ) = k yt − g(xt ) k where k · k denotes Euclidean distance. The propagation mechanism for the CRPF methods consisted of the sequence of Gaussian densities given by ´ ³ (m) (m) xt+1 ∼ N Gx xˆ t−1 , σt2 I2 , where the variance, σt2 , was a priori fixed. In Figure 1, we compare the SPF method with perfect knowledge of the noise distributions, and we present vehicle trajectories in the two-dimensional space resulting from a single simulation of the dynamic system. Both the SPF and the CRPF-type of algorithms were run with M = 1000 particles. It is apparent that all the algorithms remained locked to the vehicle position during the whole simulation interval. Figure 2 shows the magnitude of the velocity of each target in one realization of the system. Even though the curves are results of one run only, as expected, we already see that the algorithm that propagates 10 children per particle has a
14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
crpfnr−P
5
0
50
100 150 Time (s)
mad y−coordinate
mad x−coordinate
0
200
10
5
0
50
100 150 Time (s)
200
50
100 150 Time (s)
0
50
100 150 Time (s)
50
100 150 Time (s)
200
0.5
0
0.5
0
200
1
0
1
0
50
100 150 Time (s)
200
50
100 150 Time (s)
200
TARGET 2 1.5
1.5
5
0
nr
crpfnr−P
0.5
200
10
0
1
0 0
TARGET 2 15
15
0
5
mad x−coordinate
0
10
spf crpf
mad y−coordinate
nr
mad x−coordinate
10
mad y−coordinate
mad x−coordinate
spf crpf
TARGET 1 1.5
1.5
mad y−coordinate
TARGET 1 15
15
50
100 150 Time (s)
200
1
0.5
0
0
Figure 3: Left: Mean absolute deviation of the position. Right: Mean absolute deviation of the velocity. smoother performance than the algorithm propagating only one child. In Figure 3 (left) we show the mean absolute deviations (mad) of the estimated positions of the targets. The errors were computed over 10 runs using the same target trajectories. It can be seen that the proposed variations of the CRPF algorithm provide good tracking results. Again, note that the crpfnr−P that explores the sample space more intensively, gets a remarkable reductions in the error. Similar results can be seen in Figure 3 (right) where the mean absolute deviations of the estimated velocities of the targets was plotted. 5. CONCLUSIONS We proposed three variants for simplified CRPF. Two of the modifications result in less computationally demanding algorithms, while the third approach is more computationally intensive but explores the states space better and as a result has improved performance. We studied the feasibility of the methods on the problem of tracking multiple objects which move along a certain two-dimensional area. The computer simulations showed that the new algorithms had performance that is similar to that of the SPF methods, or that it is even better. REFERENCES [1] M. F. Bugallo, J. M´ıguez, and P. M. Djuri´c, “Positioning by cost reference particle filters: Study of various implementations,” in the Proceedings of the International Conference on “Computer as a tool” (EUROCON), Belgrade (Serbia), 2005. [2] M. F. Bugallo, S. Xu, J. M´ıguez, and P. M. Djuri´c, “Maneuvering target tracking using cost-reference particle filtering,” in the Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Montreal (Canada), 2004. [3] A. Doucet, N. de Freitas, and N. Gordon, Eds., Sequential Monte Carlo Methods in Practice, Springer, New York, 2001.
[4] A. Doucet, S. J. Godsill, and C. Andrieu, “On sequential Monte Carlo sampling methods for Bayesian filtering,” Statistics and Computing, pp. 197–208, 2000. [5] F. Gustaffson, F. Gunnarsson, N. Bergman, U. Forssel, J. Jansson, R. Karlsson, and P.-J. Nordlund, “Particle filtering for positioning, navigation, and tracking,” IEEE Transactions on Signal Processing, vol. 50, no. 2, pp. 425–437, 2002. [6] C. Hue, J.-P. Le Cadre, and P. Perez, “Sequential Monte Carlo methods for multiple target tracking and data fusion,” IEEE Transactions on Signal Processing, vol. 50, no. 2, pp. 309–325, 2002. [7] J. M´ıguez, M. F. Bugallo, and P. M. Djuri´c, “A new class of particle filters for random dynamical systems with unknown statistics,” EURASIP Journal on Applied Signal Processing, vol. 2004, no. 15, pp. 2278–2294, 2004. [8] X. Sheng and Y-H. Hu, “Maximum likelihood multiplesource localization using acoustic energy measurements with wireless sensor networks,” IEEE Transactions on Signal Processing, vol. 53, pp. 44–53, 2005.