High-Resolution Monitoring of Atmospheric ... - Semantic Scholar

4 downloads 18679 Views 1MB Size Report
the precision of the lost-cost sensors on the overall estimation accuracy. Index Terms—Air ...... degree in computer science from Monash Univer- sity, Melbourne ...
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

1

High-Resolution Monitoring of Atmospheric Pollutants Using a System of Low-Cost Sensors Sutharshan Rajasegarar, Timothy C. Havens, Senior Member, IEEE, Shanika Karunasekera, Christopher Leckie, James C. Bezdek, Life Fellow, IEEE, Milan Jamriska, Ajith Gunatilaka, Alex Skvortsov, and Marimuthu Palaniswami, Fellow, IEEE

Abstract—Increased levels of particulate matter (PM) in the atmosphere have contributed to an increase in mortality and morbidity in communities and are the main contributing factor for respiratory health problems in the population. Currently, PM concentrations are sparsely monitored; for instance, a region of over 2200 km2 surrounding Melbourne in Victoria, Australia, is monitored using ten sensor stations. This paper proposes to improve the estimation of PM concentration by complementing the existing high-precision but expensive PM devices with low-cost lower precision PM sensor nodes. Our evaluation reveals that local PM estimation accuracies improve with higher densities of low-precision sensor nodes. Our analysis examines the impact of the precision of the lost-cost sensors on the overall estimation accuracy. Index Terms—Air pollution, Bayesian maximum entropy, geospatial analysis, kriging, particulate matter, spatiotemporal estimation, wireless sensor networks.

I. I NTRODUCTION

A

CCORDING to the World Health Organization, over 4.6 million people die annually as a result of air pollution. High levels of pollutants are a major cause of respiratory conditions such as asthma, bronchitis, and chronic obstructive pulmonary disease [1]–[3]. Concern for pollutants depends on how widespread they are, as well as the legal and regulatory framework of the locale in which they exist. The most widespread pollutants include carbon monoxide, volatile organic compounds, ozone, oxides of nitrogen, sulfur dioxide, and lead. Some primary pollutants can interact to form secondary pollutants, which are produced as a result of the Manuscript received March 19, 2013; revised June 4, 2013; accepted June 22, 2013. This work was supported in part by the Australian Research Council (ARC) Research Network on Intelligent Sensors, Sensor Networks and Information Processing, by the University of Melbourne Interdisciplinary Seed Grant, and by the ARC Grants LP120100529 and LE120100129. S. Rajasegarar, J. C. Bezdek, and M. Palaniswami are with the Department of Electrical and Electronic Engineering, The University of Melbourne, Parkville, Vic. 3010, Australia (e-mail: [email protected]; [email protected]; [email protected]). T. C. Havens is with the Departments of Electrical and Computer Engineering, Michigan Technological University, Houghton, MI 49931-1295 USA (e-mail: [email protected]). S. Karunasekera and C. Leckie are with the Department of Computing and Information Systems, The University of Melbourne, Parkville, Vic. 3010, Australia (e-mail: [email protected]; [email protected]). M. Jamriska, A. Gunatilaka, and A. Skvortsov are with the Human Protection and Performance Division, Defence Science and Technology Organisation, Melbourne, Vic. 3207, Australia (e-mail: [email protected]; [email protected]; [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TGRS.2013.2276431

change in chemical balance in the atmosphere. Mixtures of pollutants (smog) can aggravate existing respiratory ailments such as asthma and bronchitis and/or increase the risk of respiratory problems, particularly on “smoggy” days. Elevated lead levels can affect the central nervous system and may impair intellectual development in children [4]. Many studies link air pollutants, such as particulate matter (PM), ozone, nitrogen dioxide, and carbon monoxide (CO), to increases in premature deaths and hospital admissions for people with existing heart and lung disease [4]. One specific form of pollution is PM, which is a fluid mixture of solids and liquids [5], [6]. In particular, PM in the air that has an aerodynamic diameter less than 10 μm, referred to as PM10, and those with a diameter less than 2.5 μm, referred to as PM2.5, are a population health hazard. The PM diameter is smaller than a human hair which averages about 70 μm [6]. PM particles are small enough to penetrate the deepest portions of the lungs, and the very smallest particles can be absorbed into the bloodstream [5]. Exposure to even low levels of PM can cause nasal congestion, sinusitis, throat irritation, coughing, wheezing, shortness of breath, and chest discomfort [6], [7]. Increased mortality and morbidity in communities with elevated PM10 concentrations have been reported by a variety of studies, and PM10 forecasting has been discussed in many articles [8]–[13]. A study conducted by the Harvard School of Public Health in six U.S. cities found that the subjects exposed to higher PM levels were 26% more likely to die prematurely than those exposed to lower concentrations [14]. Therefore, monitoring the PM levels in the environment is an important factor in maintaining good public health. Regulatory bodies around the world regularly monitor the levels of PM10 concentrations in the atmosphere. While expensive top-end pollution monitoring equipment that are capable of reliably measuring very low levels of pollutants are available, the current practice is that monitoring is done at a low spatial resolution due to the high cost of the equipment. For example, a region of over 2200 km2 surrounding Melbourne in Victoria, Australia, is monitored using ten sensor stations (see Fig. 1). The sparsity of these monitoring sites means that pollution levels estimated at locations far removed from the monitoring sites are unreliable, making it difficult to investigate the correlation of PM to population health outcomes. The sparsity of pollution sensors might be augmented by the use of complementary air quality models in both the temporal and spatial domains. The main objective of this paper is to investigate and propose the use of nodes equipped with low-cost low-precision PM

0196-2892 © 2013 IEEE

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 2

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Fig. 1. Geographical locations for the EPAV sensors. The horizontal (east–west) size of the domain is about 55 km, and the vertical (north–south) dimension is about 40 km.

sensors to supplement existing high-precision, often expensive, PM devices with the goal of improving the estimation at finer spatial and temporal resolutions. A more capable sensor network [15]–[29] will enable us to get a more accurate measure of the exposure of individuals to PM10 concentrations and, hence, better understand the impact of PM10 levels on respiratory illnesses. To achieve this objective, we perform simulation studies using synthetically generated PM10 concentration data that match the overall spatiotemporal characteristics of measured PM10 data. In particular, in this paper, we generated synthetic PM10 data that resemble the historical spatiotemporal PM10 data collected by the Environment Protection Authority Victoria (EPAV). We consider PM10 concentration time series from eight sensor stations located in the vicinity of Melbourne. Details of this EPAV sensor deployment can be found at the Environment Protection Authority (EPA) Web site [30]. To generate PM10 over the EPAV monitored region for this study, we propose two space–time (ST) covariance-based noise modeling algorithms: a Fourier transform-based method and a Cholesky decomposition-based method. Using the generated simulation data, we analyze how the estimation accuracy of PM10 levels at arbitrary locations is affected by the following: 1) spatial resolution of sensors and 2) sensor accuracy. This was done by estimating PM10 levels at a number of arbitrary locations using generated synthetic sensor measurements from high-precision EPAV sensors and a specified number of low-precision sensors placed at locations on a grid. The effect of the number and the accuracy of the low-cost sensors is studied. We used the Bayesian maximum entropy (BME) technique [31], which is a well-accepted technique for the estimation and prediction of spatiotemporally correlated data. Our results show the feasibility of using low-cost lowprecision sensors for improving the estimation accuracy of PM10 data. II. R ELATED W ORK Modeling networks of wireless pollutant sensors have recently been a topic of intensive research in applications spanning from environmental and technological monitoring to defense and security systems (see [32] and [33] and references therein). It has been demonstrated that such systems can be

described by a mathematical model that has some similarities with population biology. By employing this approach, a number of optimization strategies and operational protocols have been proposed to meet various operational criteria, including the tradeoff between parameters of an individual sensor and the coherent performance of the chemical sensor network and the integrated system (for details, see [32] and [33]). This paper contributes to this line of research by exploring the information gain that can be provided by the networking effect in a system of many sensors, each with low individual accuracy. A wide variety of sensors and models have been used for the estimation and prediction of air quality and concentrations of specific pollutants [34]–[36]. Slini et al. [8] used a stochastic autoregressive integrated moving average (ARIMA) model to study and make maximum ozone concentration forecasts in Athens, Greece. Diaz-Robles et al. [9] combined an ARIMA model with a neural network (NN) to build a hybrid model to forecast PM10 concentrations in Temuco, Chile. Slini et al. [10] described an operational air quality forecasting module for PM10 based on classification and regression trees and NN methods, which is capable of capturing PM10 concentration trends. Stadlober et al. [11] used multiple linear regression models that combined the information of the present day with meteorological forecasts of the next day to forecast daily PM10 concentrations for sites located in the three cities: Bolzano, Klagenfurt, and Graz, the three capitals in the provinces of South Tyrol, Carinthia, and Styria. Tsai et al. [12] studied PM2.5 and PM10 in Bangkok using simple regression and correlation analysis for data gathered in a shopping center and at a university. Yu et al. [37] studied the spatiotemporal distribution of PM2.5 across the Taipei area from 2005 to 2007 using a land use regression model. They established a quantitative relationship between PM2.5/PM10 and land use information for predicting the PM2.5 concentration. Epitropou et al. [38] used a fusion strategy to select environmental information from open access web resources of various types and data from monitoring stations in order to provide tailored information to the users. Forecast models are produced using published air quality information in the form of color-mapped georeferenced images and data from chemical weather databases. Multivariate geostatistics offers many alternatives for choosing the family of functions F and a best member F ∗ ∈ F to describe and model the observations. Good introductions to some of these methods are the texts by Clark [39] and Wackernagel [40]. The most popular and perhaps best known possibilities are variograms, ordinary, simple, and weighted kriging, principal component analysis, canonical analysis, and correspondence analysis. Christakos et al. [31], [41] present a method for estimating PM10 levels in the state of California based on an integrated ST domain using the BME mapping approach. Their BME method is a generalized model, in which kriging becomes one of the special cases. We use this method to estimate the concentrations at unknown spatiotemporal locations in the EPAV region. One of the primary challenges in many of these studies (including ours) is that analysis is based on measurements from a few sparsely located pollution monitors. For example, one such study in Santiago, Chile, used the average of just five

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. RAJASEGARAR et al.: MONITORING OF ATMOSPHERIC POLLUTANTS USING A SYSTEM OF LOW-COST SENSORS

monitoring sites to obtain pollution data representing the whole city [42]. Localized variations of pollutant concentrations may not be adequately captured by a small number of sparsely located monitoring sites; therefore, in [42], Santiago is divided into five regions, each centered around a pollution monitoring station. This strategy seemed to improve correlations between pollutant levels and observed health effects throughout the city. Our approach is somewhat different. We first validate our model with simulated data at the nodes of a regularized rectangular grid that circumscribes the sensor stations shown in Fig. 1. Then, we use the time series at the EPAV sites as a basis for both interpolation and extrapolation over the spatial domain of Fig. 1. The statistical properties of PM (pollutant, aerosol, moisture, and dust) in the atmospheric boundary layer have been investigated; the pioneering work of Richardson and Batchelor [43], [44] is well documented. It is recognized that these properties and statistics of associated particle time series emerge from the physics of turbulent mixing and particle dispersion by random flow [45], [46]. This enables the development of physics-based frameworks for data analysis and implementation of rigorous computer simulations for the concentration time series (see [45], [47], [48], and references therein). In the context of the current study, two specific properties of particle statistics are most relevant: 1) The concentration time series are strictly positive (since measured values are simply counts of particles), and 2) they can be comprehensively characterized by the first two statistical moments [46] (since they are driven by the underlying Gamma distribution). The latter property becomes critical for the application of the BME model; see (7). III. P ROBLEM S TATEMENT AND A PPROACH A. Problem Statement We consider a ST domain where a general point in space and time is given by p = (s, t) ∈ S × T , where s = (x, y) is the geographical location and t is the time. A set of Nh highprecision sensors and Nl low-precision sensors is deployed at known geographical locations in the region, and measure PM10 concentrations over T discrete time points. Hence, a collection of nh = Nh × T high-precision sensor measurements and nl = Nl × T low-precision sensor measurements is collected at nd = nh + nl ST points. The aim is to estimate the PM10 concentrations at an arbitrary set of nk ST points, called the estimation points, based on the measured values from nd ST points. Note that nk = Nk × T , where Nk is the number of geographic (estimation) locations. We denote the ST random field realizations (PM10 concentrations) over the ST region as a collection of correlated random variables Zmap = {Zdata ∪ Zest } at points pmap = {pdata ∪ pest }, where Zdata = (Z1 , Z2 , . . . , Znd ), Zest = (Znd +1 , . . . , Znd +k , . . . , Znk ), pdata = (p1 , p2 , . . . , pnd ), and pest = (pnd +1 , . . . , pnd +k , . . . , pnk ). We define the root-mean-square estimation error s at a spatial location s over a time period T as   T 1   ˆ tι ))2 s = (Z(s, tι ) − Z(s, (1) T ι=1

3

where Z(s, tι ) is the true value of the PM concentration at an ˆ tι ) is the estimated value of the PM ST point (s, tι ) and Z(s, concentration. The overall root-mean-square error (over space and time)  for a given sensor configuration is defined as   Nk  T  1  ˆ i , tι ))2 . (Z(si , tι ) − Z(s (2) = Nk T i=1 ι=1 Our aim is to study the impact of the number of low-precision sensors (Nl ) and their accuracy on the estimation errors (, s ). In particular, we are interested in determining whether supplementing high-precision sensors with low-precision sensors can reduce the error. B. Our Approach To study the impact of low-precision sensors on estimation accuracy, we need to compare estimated PM values to actual concentrations (ground-truth data) at chosen locations. While the measurements from high-precision sensors are the best possible estimates of actual concentrations, due to the spatial sparseness of these measurements, there are not enough ground-truth data available to perform the intended analysis. Therefore, the approach that we have taken is to generate synthetic ground-truth data that resemble the characteristics of real data. In this paper, we use the real PM10 data available from eight sensors (later, we will identify the eight sensors that we are using that have significantly less missing data) deployed in the region around Melbourne, Australia, by the EPAV. We describe the techniques that we use for this purpose in Section IV. After generating synthetic ground-truth data, the simulated concentration time series corresponding to Nh EPAV sensor station locations are taken as the measurements from the highprecision sensors. We then apply a sensor model to simulate low-precision sensor measurements made at Nl additional locations. The sensor model allows us to mimic characteristics such as noise, saturation, linearity, and the minimum measurement threshold of sensors. Then, we use simulated measurements corresponding to both high- and low-precision sensors to estimate PM10 concentrations at a set of arbitrary known locations not coinciding with the existing sensor locations. We use the BME technique to estimate the concentrations at chosen locations. The BME technique is described in Section VI. IV. S YNTHETIC G ROUND -T RUTH DATA G ENERATION We propose two ST covariance-based noise modeling algorithms to generate ground-truth data that simulate PM10 concentrations over the EPAV monitored region. One is based on the fast Fourier transform (FFT) model, and the other is based on the Cholesky factorization model. We now formulate the two ST covariance-modeled noise schemes. A. FFT Model Assume a wide sense stationary (WSS) 2-D covariance model B(Δx, Δy) for the spatial region. If we assume that

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 4

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Fig. 2. Example simulated time-series data computed by the FFT model. A Gaussian covariance model was used with parameters υ = 1 and a = 5. (a) t = 1. (b) t = 2. (c) t = 3. (d) t = 4.

B is also isotropic, then it can be written as B(r), where r = x − y2 is the Euclidean or 2 norm of the vector (x, y). One 2 2 such model could be the Gaussian B(r) = υe−r /a , where υ and a are the parameters of the model. The most popular way to simulate WSS covariance-modeled noise is by using the FFT. Assume a grid of sensors S = {s11 , s12 , . . . , sn(m−1) , snm }, where sij = (xi , yj ) is the sensor location. First, we construct the n × m matrix B = [B(rij )]n×m , where rij = sij − ¯sij 2 and ¯sij is the center of the grid. Then, the covariance-modeled noise is constructed as follows: 1) Compute the 2-D FFT of B, ξB = FFT2D (B). 2) Compute the 2-D FFT of an n × m complex Gaussian white noise matrix W , ξW = FFT2D (W ). 3) Compute  the 2-D inverse FFT, V = IF F T2D ( (siB ) ◦ (siW )), where ◦ is the Hadamard product. This produces two noise draws: the real part and the imaginary part of V (here, we use the real part of V ). To add time dependence to this model, we simply construct an n × m × T matrix B = [rijw ]n×m×T , where, now, rijw = sijw − ¯sijw 2 and sijw = (xi , yj , tw ). The computation process is the same as above, except that 3-D FFTs and inverse FFTs are used, and a 3-D complex Gaussian white noise random matrix is drawn in step 2. Fig. 2 shows examples of FFT-simulated time-series data on a grid. The advantage of the FFT-based model is that it is computationally fast and easy to implement. However, the FFT-based model is limited to a grid arrangement, which may not be appropriate for simulations of real systems. Hence, we now move to a different type of model (that produces equivalent results for arbitrary sensor network locations). B. Cholesky Factorization Model Now, consider some collection of sensors S = {s1 , . . . , sn }, where si = (xi , yi ) is the sensor location (not necessarily grid based). First, we construct an n × n matrix, B = [B(rij )]n×n , where rij = si − sj 2 . B is positive semidefinite and thus can be factorized into B = RRT (Cholesky factorization). Draw an n-length vector β of unit-variance zero-mean normaldistributed random numbers, and compute φ = Rβ. The covariance of φ is Bφ = φφT  = Rββ T  RT = RIn RT = B

(3)

where · indicates the expected value. Hence, the covariance of φ is exactly B. The vector φ is a set of simulated sensor measurements at each location in X according to the statistical model B(r). Now, assume a time series of sensor measurements S = {S1 , . . . , ST }, where St is the tth set of measurements. Consider the time-based covariance model Bt (Δt). Construct the matrix Bt = [Bt (Δtij )]T ×T , where Δtij = |ti − tj |, factor it as Bt = Rt RtT , and make a random draw At = Rt A, where A is a T × n matrix of unit-variance zero-mean normally distributed random numbers. Each row of At is an n-length vector of Bt (0)-variance zero-mean normal-distributed random numbers. Therefore, (At )Ti (At )i  = Bt (0)In , ∀i, where (At )i is the ith row of At . Finally, we can create ST covariancemodeled noise by Φ = RATt , where Φ = {φ1 , . . . , φT } is the time series of simulated sensor measurements. Each φt has the spatial statistics of B(r), and the time series at each sensor location has the temporal statistics of Bt (Δt). The advantage of this method is that it is not constrained to a grid-based sensor arrangement (as the FFT model is). Fig. 3 illustrates using the Cholesky method to simulate the sensor measurements of the EPA Victoria sensor array (ten locations).

V. M ODEL F ITTING TO R EAL EPA DATA The first step to fitting a WSS spatially isotropic spatiotemporal covariance model to real data is to compute the variogram (covariance estimate) of the sensor measurements z(s, t). Since we are assuming an isotropic WSS model of the form B(r, t) = Bs (r)Bt (Δt), we can model the spatial and temporal covariances separately. The basic steps of this process are as follows. ˆs (r) and temporal vari1) Compute the spatial variogram B ˆ ogram Bt (Δt) from the data z(s, t). 2) (Optional) Interpolate the variogram(s) onto a uniform grid. ˆs (r) and 3) Regress parameterized models to each of B ˆr (Δt). B 4) Choose the best spatial model and the best temporal model (if more than one model were regressed to each variogram). In detail, the first step is to calculate the variograms, viz., covariance estimates. The isotropic spatial covariance estimate

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. RAJASEGARAR et al.: MONITORING OF ATMOSPHERIC POLLUTANTS USING A SYSTEM OF LOW-COST SENSORS

5

Fig. 3. Example simulated PM10 concentrations at EPAV locations. The Gaussian covariance model parameters are υ = 1 and a = 14 for spatial covariance model and υ = 1 and a = 2 for temporal covariance model. (a) t = 1. (b) t = 2. (c) t = 3. (d) t = 4.

Fig. 4. (a), (b) Covariance estimates for EPAV. (c) Spatial covariance estimate interpolated onto a uniform grid using cubic interpolation. Covariance estimates are shown as circles, and interpolated values are shown as dots. (a) Spatial covariance. (b) Temporal covariance. (c) Interpolated spatial covariance. TABLE I PARAMETERIZED C OVARIANCE M ODELS

is computed as ˆs (r) = B

T   (z(si , t) − μz )(z(sj , t) − μz ) t=1 ∀i,j

T nr

(4)

where nr is the number of summed terms in the second summation (or the  number of covariance terms averaged for the lag r = si − sj 2 ) and μz is the mean of the sensor measurements. The temporal covariance estimate is computed as ˆt (Δt) = B

ns   (z(sm , ti ) − μz )(z(sm , tj ) − μz ) m=1 ∀i,j

ns nΔt

(5)

where ns is the number of sensors and nΔt is the number of terms in the second summation for the lag Δt = (ti − tj ). Fig. 4(a) and (b) shows the spatial and temporal covariances for the EPAV data (described in Section VII), estimated over 100 consecutive time slices. What we see right away in view (a) is that there are very few data points in the spatial lag region of 0 < r < 16.5. Also, the spatial covariance estimate shows that the correlation between sensor measurements has already settled to near 0 at the next lag point of r = 16.2. This imposed a difficult modeling task because the behavior of the PM10 concentrations between r = 0 and r = 16.5 is unknown (this is exactly the reason that a dense sensor network is needed). In contrast to the sparse spatial estimate, the temporal covariance estimate is relatively dense (obviously, we have temporal covariance estimates at integer-valued time lags Δt = 0, 1, 2, . . .). Interestingly, the temporal variogram exhibits a periodic nature, indicating strong correlation peaks at time lags of 12, 23, 36, and 48 h. This is intuitively pleasing as this corresponds to the periodic nature of the day. Step 2 of the model-fitting process

is used to alleviate the difficulty of fitting a model to the spatial covariance estimate shown in Fig. 4(a). Because there are no ˆs (r) onto a unidata points in 0 < r < 16.5, we interpolate B form grid r = 0, 1, 2, . . . as shown in Fig. 4(c). The interpolated data (using cubic interpolation) give the model-fitting algorithm data to work with the region 0 < r < 16.5. Is this really what the real-world covariance would look like? We do not know, but this at least gives us a reasonable estimate. In step 3, we ˆs (r) and regress a set of predetermined models to each of B ˆt (Δt). The models that we use are shown in Table I. To fit B ˆ ˆs (r) or a model to the covariance estimate B(q)—i.e., either B ˆt (Δt)—we seek to minimize the error function B  ˆ Em (Π) = n2q (B(q) − Bm (q, Π))2 (6) q

where q represents all possible lags, r or Δt, and Bm (q, Π) is one of the seven models shown in Table I. In essence, we are weighting each summation term in Em by the number of ˆ pairwise estimates used to compute B(q) at lag q. Recall that ˆs (r) onto a uniform grid before fitting to we first interpolated B the models. For the spatial covariance, we thus set nr = 1, ∀r. Note that, in (4), every nr = 1 as there is only one unique pair ˆt (Δt), of sensors for each spatial lag r. This is not the case for B

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 6

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Fig. 5. (a), (b) Best models for spatial and temporal covariance estimates (model shown as green continuous line). (c and d) Comparison of covariance model estimated from real data and sample-based covariance of simulated random field (simulation shown as green continuous line). (a) Spatial covariance model. (b) Temporal covariance model. (c) Spatial covariance. (d) Temporal covariance.

as each nΔt is different, with n0 = T , n1 = T − 1, n2 = T − 2, etc. The minimization of the function in (6) is accomplished using a quasi-Newton line search method. After we regress each model to each of Bs (r) and Bt (Δt), we choose the best model for each covariance estimate. Let Π∗m be the best parameters found by minimizing (6) for either the spatial or temporal covariance. Then, the best overall models are chosen by m∗ = arg minm Em (Π∗m ), which is solved for each of the spatial and temporal covariances. Fig. 5(a) and (b) ˆs (r) shows the best empirically determined model for each of B ˆ and Bt (Δt). The chosen model for the spatial covariance is the exponential model B1 (r, (1, 12)), and the chosen model for the temporal covariance is the exponential model B1 (Δt, (1, 2.4)). Note that, in both cases, υ = 1. This is because we normalize the data to unit variance. For the estimates shown in Fig. 4, ˆs (0) = B ˆt (0) = 147. B As Figs. 4(c) and 5(a) show, the interpolation method most likely overestimates the parameter a of the exponential spatial covariance model. However, given the absence of covariance estimates at spatial lags 0 < r < 16.5, we believe that any value 5 ≤ a ≤ 12 would provide a reasonable fit for these data. Fig. 5(b) shows that the exponential model fails to capture the periodic nature of the temporal covariance model. In the future, we plan to extend our modeling and simulation capabilities to more accurately match this periodic behavior. Fig. 5(c) and (d) shows the spatial and temporal covariance models compared to the sample-based covariance estimates of a random field generated by the FFT process described earlier. We generated 51 time slices of a 101 × 101 grid, based on the model parameters estimated from the EPAV data [shown in Fig. 5(a) and (b)]. Then, the sample-based covariance was estimated from the simulated field. The figure shows that the covariance of our simulated random field matches well with the model. VI. PM C ONCENTRATION E STIMATION In order to estimate the PM concentration at arbitrary locations, we use a recently developed approach called BME, which involves the concepts of Bayesian conditionalization and entropy [31], [49]. The definitive works on this are those by Serre and Christakos [31], [49], who have used this approach quite successfully for PM10 estimation in various locales [41]. The BME approach incorporates two prime knowledge bases (a collection of knowledge sources): 1) a general knowledge base G, such as that from physical laws, summary statistics, and scientific theories, and 2) specificatory knowledge or casespecific knowledge base , such as that obtained through expe-

rience with specific situations. The total knowledge base is H = G ∪ . Furthermore, the specific knowledge base accounts for both hard data (measured values that we are willing to concede are correct with near certainty) and soft data (interval or probabilistic data that are uncertain due to, for example, inexact knowledge, experience, intuition, low-precision sensor outputs, etc.) [49]. Let the ST random field realizations be represented as a collection of correlated random variables Zmap = (Z1 , Z2 , . . . , Znd , Zk ) at points pmap = (p1 , p2 , . . . , pnd , pk ). The aim of BME is to obtain a spatiotemporal map displaying the estimate at a point pk ∈ pest of the unknown value Zk based on the observed values Z1 , Z2 , . . . , Znd at locations p1 , p2 , . . . , pnd . For each point in pmap , the ST model assigns a probability value using the probability density function (pdf) fH (Zmap ). There are three main stages of knowledge acquisition, interpretation, and processing involved in the BME approach: (1) the prior stage; (2) the meta-prior stage; and (3) the integration or posterior stage. In the prior stage, ST analysis starts with the basic set of assumptions and the general knowledge base G. The expected information contained in the map Zmap is expressed  as E(IG [Zmap ]) = − fG (Zmap )log fG (Zmap ) dZmap , where fG is the multivariate pdf model associated with G before the specific knowledge base has been taken into consideration and E(.) denotes the expectation operator. The shape of the prior pdf fG is determined by maximizing the expected prior information subject to the physical constraints E(gα (Zmap )) =  gα (Zmap )fG (Zmap ) dZmap , where gα is a set of known functions of Zmap with g0 = 1, E(g0 (Zmap )) = 1, and α = 0, 1, 2, . . . , Nc . The total number of Nc is such that stochastic moments are included that involve all the ST points pi ∈ pmap of the map. In this paper, the G consists of the mean functions ´ ) given by m(p) and the covariance functions C(p, p C(p, p ´ ) = E[(Z(p) − m(p))(Z(p ) − m(p ))]∀p, p

(7)

where m(p) = E[Z(p)], ∀p. Hence, the gα is chosen such that the expectation E(gα (Zmap )) provides the ST mean and covariance functions throughout the ST domain of interest. The optimization is performed using the method of Lagrange multipliers, and then, the expression for the prior pdf becomes fG (Zmap ) = J −1 eYG [Zmap ] , where J = e−μ0 , YG [Zmap ] =  Nc α=1 μα gα (Zmap ), and μα are the Lagrange multipliers. In the meta-prior stage, the specificatory knowledge is considered. The pdf fG is updated by means of Bayesian conditionalization. In the integration or posterior stage, both of these knowledge are incorporated to obtain the

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. RAJASEGARAR et al.: MONITORING OF ATMOSPHERIC POLLUTANTS USING A SYSTEM OF LOW-COST SENSORS

posterior pdf fH (Zmap ) = Q−1 Y [YG , , Zmap ], where Q is a normalization parameter. If we assume that the specific knowledge consists of the hard data and the probabilistic soft data, then the posterior operator Y becomes Y = Q−1 I eYG [Zmap ] dF [Zsof t ], where : Zsof t ∈ I,  P rob [zsof t ≤ ζ] = F (ζ) and Q = I dF (Zsof t )fG (Zdata ). Zsof t denotes the soft data, I denotes the domain of Zsof t , Zhard denotes the hard data, Zdata = {Zhard ∪ Zsof t } ⊂ Zmap , and F is the cumulative distribution function obtained from . Several possibilities exist to obtain the estimate Zˆk at an unknown point pk using fH (Zmap ). One such possibility is to use themean estimate at the point pk , which is given by Zˆk,mean = Zk fH (Zk )dZk . In this paper, this mean estimate is used to find the PM10 concentrations at the off-sensor locations. Note that this BME mean estimate minimizes the mean squared estimation error. Let the ST variability of the PM10 concentration Z be described in terms of a (centered) covariance function C(κ, τ ) = E[(Z(p) − m(p))((Z(p ) − m(p ))], where p − p = ((s − s ), (t − t )); κ = s − s ; τ = |t − t | and m is the mean. The quantities κ and τ are space and time lags and show that this covariance is spatially isotropic and stationary in time. For the case of κ = 0, C(0, τ ) is reduced to the single-point time series (observations from an individual sensor) and can easily be validated experimentally [45]. More specifically, the difference C(κ, τ ) − C(0, 0) should exhibit a linear dependence on its arguments at the limit of small values of κ, τ [46]. Since this property has been reliably recovered experimentally [45], it can be used as a supporting argument for the proposed model. We use C(κ, τ ) = υs υt e(−κ/as ) e(−τ /at ) to model the covariance structure in the evaluation. The covariance coefficients υs and υt provide information about variations in the PM10 distribution. The coefficient as is the range of spatial variation, and at represents temporal fluctuations. We match the coefficients to our generated simulation data, which are a representation of the EPAV PM10 data (as explained in Section V), and utilize the BME software for the estimation [50]. VII. E VALUATION

7

Fig. 6. Normalized error versus number of soft sensors (grid size).

coordinates ([1, 200] × [1, 200] × [1, 50]), where the x- and y-axis ranges are [1, 200] each and the third axis (time slices) range is [1, 50]. The grid spacing is 1 in all three directions. The spatial covariance model υs e(−r/aS ) has two parameters with values υs = 1 and as = 12, and the temporal covariance model υt e(−r/at ) has parameter values υt = 1 and at = 2.4. These parameters were selected based on the model fitted to the spatiotemporal variogram curves obtained from the real EPAV PM10 data (see Section V). B. Representation of High- and Low-Precision Sensors In order to study the impact of additional low-precision sensors on the estimation accuracy, we consider a deployment which has a combination of high-precision sensors as well as low-precision sensors. The high-precision sensors are located at the actual EPAV sensor locations, and their concentration values are set to the sensor concentrations of the generated data (i.e., consider them as ideal sensors with zero error). In the BME estimation, these are the hard data locations. To study the impact of low-precision sensors, we place them on a regular grid encompassing the EPAV sensor network. The number of low-precision sensors (Nl ) is varied by changing the grid size. The precision of the sensors is controlled by adding zeromean Gaussian noise with varying standard deviation. For BME estimation, these sensors are the soft data.

A. Representation of the EPAV Sensor Network

C. Effect of the Number of Low-Precision Sensors (Nl )

Our work is facilitated by introducing the artificial Cartesian coordinate system for the EPAV nodes (shown in Fig. 1) on a regular grid of size 200 × 200. The EPAV nodes considered in this evaluation are deployed at the following eight locations (including the coordinates): Alphington (141, 171), Box Hill (159, 156), Brighton (135, 122), Dandenong (177, 89), Mooroolbark (197, 170), Deer Park (89, 176), Footscray (115, 164), and Richmond (135, 156). The EPAV PM10 data (high-precision sensor data) were collected during the period between October 3, 2011 and April 20, 2012 at a sampling interval of 1 h. We use the FFT-based model (grid sensor-based scheme; see Section IV-A) for generating the ground-truth data for our experiments as the given sensor locations are already on a Cartesian grid. The ground-truth data are generated for a 3-D rectangular parallelepiped (rectangular box) region spanned by the

In this experiment, the estimation locations are a set of grid locations with a spatial grid spacing of 20 within the spatial grid of size [80, 200] × [80, 200]. The soft data locations are the grid locations with the number of horizontal and vertical spacings selected from a range of values between 1 and 100, i.e., we use square grids of size 1 × 1 to 100 × 100 for the low-precision sensors. If a soft data location coincides with a hard data location, the soft data sensor is removed. The soft data are modeled using a Gaussian pdf with the generated groundtruth value as the mean and a constant value of 0.2 as the standard deviation (γ). The estimated values are compared with the generated ground-truth values obtained at these estimation locations. The root mean square error (RMSE)  [see (2)] is computed across all the estimation locations that are noncoinciding with the soft or hard sensor locations and all the time instances, for each grid size of the soft sensors (number

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 8

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Fig. 7. RMSE (s ) intensity plots. Black stars represent the EPAV sensor locations. (a) Nl = 0. (b) Nl = 8 × 8. (c) Nl = 15 × 15. (d) Nl = 25 × 25.

of soft data locations). The normalized error for each grid size of the soft sensors is computed by dividing the RMSE  by the RMSE obtained for the hard only case, where there are no low-precision sensors deployed, i.e., Nl = 0. Fig. 6 shows the normalized error versus the number of soft sensors. As intuition would suggest, the normalized error decreases as the number of soft sensors is increased. The intensity plots in Fig. 7 show the root-mean-square estimation error s at spatial locations for the cases Nl = 0 (no low-precision sensors), Nl = 8 × 8, Nl = 15 × 15, and Nl = 25 × 25. The estimation locations used are a 50 × 50 grid. D. Effect of Sensor Noise on the Estimation In this experiment, the estimation locations are a set of grid locations with a spatial grid spacing of 20 within the spatial grid of size [80, 200] × [80, 200]. The soft data are modeled using a Gaussian pdf whose mean is the generated ground-truth value (using the FFT algorithm). The sensor noise, standard deviation γ, is changed in the range [0.001, 1] at intervals of 0.01. Five different soft sensor configurations are considered. Each configuration consists of soft sensor locations at the sets of equally spaced grid locations with a total number of soft sensors as 16 (4 × 4), 169 (13 × 13), 625 (25 × 25), 1600 (40 × 40), and 2116 (46 × 46). For each of these location sets, the RMSE  is computed across all estimation locations that are noncoinciding with the soft or hard sensor locations and all time instances for each noise level of the soft data. Fig. 8 shows the normalized error versus noise standard deviation γ for the five soft sensor grid sizes. The normalized error for each γ is computed by dividing the RMSE  by the RMSE obtained for the hard only case, where there are no lowprecision sensors deployed, i.e., Nl = 0. The normalized error increases with the standard deviation of the noise level of the soft sensors. Furthermore, the normalized error is observed to be lower within the region with a larger number (and, thus, denser array) of soft sensors. VIII. D ISCUSSION AND C ONCLUSION Monitoring the PM concentration at a fine-grained resolution is important for accurate estimation and use in health-related research, such as for asthma research. In this paper, we examined the effect of supplementing existing high-precision PM10 sensors with low-precision (and, thus, less expensive) simulated sensors. Not surprisingly, our evaluation reveals that the deployment of inexpensive low-precision sensors can improve the estimation accuracy in the monitored region. The number

Fig. 8.

Normalized error versus sensor noise standard deviation γ.

of additional sensor nodes that need to be deployed to obtain a given estimation accuracy is also analyzed. Furthermore, we have analyzed the maximum error that can be tolerated in the low-precision sensors to achieve a given level of overall PM10 estimation accuracy. In the future, we plan to deploy a wireless sensor network equipped with PM10 sensors within the Victorian region. This study will help us to meet a specified estimation accuracy while choosing the best tradeoff of sensor cost and placement density. R EFERENCES [1] Environmental Protection and Heritage council, “Asthma research: A background paper,” Co-operative Studies on Priority Air Quality and Health Related Issues, 2002. [Online]. Available: http://www.scew.gov. au/archive/air/pubs/aq_rsch__asthma_background_paper_200212.pdf [2] J. G. Watson, “Visibility: Science and regulation,” J. Air Waste Manage. Assoc., vol. 52, no. 6, pp. 628–713, Jun. 2002. [3] J. C. Chow, J. Watson, J. Mauderly, D. Costa, R. Wyzga, S. Vedal, G. Hidy, S. Altshuler, D. Marrack, J. Heuss, G. Wolff, C. Pope, and D. Dockery, “Health effects of fine particulate air pollution: Lines that connect,” J. Air Waste Manage. Assoc., vol. 56, no. 10, pp. 1368–1380, Oct. 2006. [4] Air Quality Criteria for Particulate Matter, US Environ. Protection Agency (US EPA), Washington, DC, USA, 1996. [5] S. H. Ling and S. F. van Eeden, “Particulate matter air pollution exposure: Role in the development and exacerbation of chronic obstructive pulmonary disease,” Int. J. Chron. Obstruct. Pulmon. Dis., vol. 4, pp. 233– 243, 2009. [6] A. Kimbrell, In-Car Air Pollution. The Hidden Threat to Automobile Drivers, Int. Center Technol. Assessment, Washington, DC, USA, Rep. 4. [Online]. Available: http://www.andrewkimbrell.org/doc/ In-carpollutionreport.pdf [7] D. Massey, A. Kulsrestha, and A. Taneja, “A study on indoor outdoor concentration of particulate matter in rural residential houses in India,” in Proc. 2nd Int. Conf. Environ. Comput. Sci., 2009, pp. 218–223. [8] T. Slini, K. Karatzas, and N. Moussiopoulos, “Statistical analysis of environmental data as the basis of forecasting: An air quality application,” Sci. Total Environ., vol. 288, no. 3, pp. 227–237, Apr. 2002. [9] L. Diaz-Robles, J. C. Ortega, J. S. Fu, G. D. Reed, J. C. Chow, J. G. Watson, and J. A. Moncada-Herrera, “A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. RAJASEGARAR et al.: MONITORING OF ATMOSPHERIC POLLUTANTS USING A SYSTEM OF LOW-COST SENSORS

[10] [11] [12]

[13]

[14]

[15]

[16] [17] [18] [19] [20] [21]

[22]

[23] [24] [25]

[26]

[27]

[28] [29] [30] [31]

case of Temuco, Chile,” Atmos. Environ., vol. 42, no. 35, pp. 8331–8340, Nov. 2008. T. Slini, A. Kaprara, K. Karatzas, and N. Moussiopoulos, “ PM10 forecasting for Thessaloniki, Greece,” Environ. Modell. Softw., vol. 21, no. 4, pp. 559–565, Apr. 2006. E. Stadlober, S. Hormann, and B. Pfeiler, “Quality and performance of a PM10 daily forecasting model,” Atmos. Environ., vol. 42, no. 6, pp. 1098– 1109, Feb. 2008. F. C. Tsai, K. R. Smith, N. Vichit-Vadakan, B. D. Ostro, L. G. Chestnut, and N. Kungskulniti, “Indoor/outdoor PM10 and PM2.5 in Bankok, Thailand,” J. Exposure Anal. Environ. Epidemiol., vol. 10, no. 1, pp. 15–26, Jan./Feb. 2000. L. Guoxing, S. Jing, J. S. Rohan, P. X. Chuan, Z. Maigeng, W. Xuying, C. Yue, S. Ross, and S. G. Reginald, “Temperature modifies the effects of particulate matter on non-accidental mortality: A comparative study of Beijing, China and Brisbane, Australia,” Public Health Res., vol. 2, no. 2, pp. 21–27, 2012. D. W. Dockery, C. A. Pope, X. Xu, J. D. Spengler, J. H. Ware, M. E. Fay, J. Benjamin, G. Ferris, and F. E. Speizer, “An association between air pollution and mortality in six cities,” New Engl. J. Med., vol. 329, no. 24, pp. 1753–1759, Dec. 1993. S. Rajasegarar, C. Leckie, and M. Palaniswami, “Detecting data anomalies in sensor networks,” in Security in Ad-hoc and Sensor Networks, R. Beyah, J. McNair, and C. Corbett, Eds. Singapore: World Scientific, Jul. 2009, pp. 231–260. J. C. Bezdek, S. Rajasegarar, M. Moshtaghi, C. Leckie, M. Palaniswami, and T. Havens, “Anomaly detection in environmental monitoring networks,” IEEE Comput. Intell. Mag., vol. 6, no. 2, pp. 52–58, May 2011. S. Rajasegarar, C. Leckie, and M. Palaniswami, “Anomaly detection in wireless sensor networks,” IEEE Wireless Commun., vol. 15, no. 4, pp. 34–40, Aug. 2008. S. Rajasegarar, C. Leckie, M. Palaniswami, and J. C. Bezdek, “Distributed anomaly detection in wireless sensor networks,” in Proc. ICCS, 2006, pp. 1–5. S. Rajasegarar, J. C. Bezdek, C. Leckie, and M. Palaniswami, “Elliptical anomalies in wireless sensor networks,” ACM Trans. Sens. Netw., vol. 6, no. 1, p. 28, Dec. 2009. M. Takruri, S. Rajasegarar, S. Challa, C. Leckie, and M. Palaniswami, “Spatio-temporal modelling based drift aware wireless sensor networks,” IET Wireless Sens. Syst., vol. 1, no. 2, pp. 110–122, Jun. 2011. S. Rajasegarar, C. Leckie, J. C. Bezdek, and M. Palaniswami, “Centered hyperspherical and hyperellipsoidal one-class support vector machines for anomaly detection in sensor networks,” IEEE Trans. Inf. Forensics Security, vol. 5, no. 3, pp. 518–533, Sep. 2010. A. Shilton, S. Rajasegarar, and M. Palaniswami, “Combined multiclass classification and anomaly detection for large-scale wireless sensor networks,” in Proc. IEEE 8th Int. Conf. Intell. Sens., Sens. Netw. Inf. Process., Melbourne, VIC, Austrlia, Apr. 2013, pp. 491–496. M. Moshtaghi, S. Rajasegarar, C. Leckie, and S. Karunasekera, “An efficient hyperellipsoidal clustering algorithm for resource-constrained environments,” Pattern Recognit., vol. 44, no. 9, pp. 2197–2209, Sep. 2011. M. Moshtaghi, T. Havens, L. Park, J. C. Bezdek, S. Rajasegarar, C. Leckie, M. Palaniswami, and J. Keller, “Clustering ellipses for anomaly detection,” Pattern Recognit., vol. 44, no. 1, pp. 55–69, Jan. 2011. S. Rajasegarar, A. Shilton, C. Leckie, R. Kotagiri, and M. Palaniswami, “Distributed training of multiclass conic-segmentation support vector machines on communication constrained networks,” in Proc. 6th Int. Conf. Intell. Sens., Sens. Netw. Inf. Process., Brisbane, QLD, Austrlia, Dec. 2010, pp. 211–216. M. Moshtaghi, J. C. Bezdek, T. C. Havens, C. Leckie, S. Karunasekera, S. Rajasegarar, and M. Palaniswami, “Streaming analysis in wireless sensor networks,” Wireless Commun. Mobile Comput., 2012. [Online]. Available: http://onlinelibrary.wiley.com/doi/10.1002/wcm.2248/pdf C. Oreilly, A. Gluhak, M. A. Imran, and S. Rajasegarar, “Online anomaly rate parameter tracking for anomaly detection in wireless sensor networks,” in Proc. IEEE Conf. Sens., Mesh Ad Hoc Commun. Netw., Seoul, Korea, Jun. 2012, pp. 191–199. J. Gubbi, R. Buyya, S. Marusic, and M. Palaniswami, “Internet of things (IoT): A vision, architectural elements, and future directions,” Future Gener. Comput. Syst., vol. 29, no. 7, pp. 1645–1660, Sep. 2013. Internet of Things, 2013. [Online]. Available: http://issnip.unimelb.edu. au/research_program/Internet_of_Things Environment Protection Authority Victoria Sensor Locations, Australia, 2012. [Online]. Available: http://www.epa.vic.gov.au/air/airmap/default. asp?mapView=melb&id=10213&tab=Particles M. L. Serre and G. Christakos, “Modern geostatistics: Computational BME analysis in the light of uncertain physical knowledge—The equus

[32] [33] [34] [35]

[36] [37]

[38]

[39] [40] [41]

[42] [43] [44] [45] [46] [47]

[48] [49] [50]

9

beds study,” Stochastic Environ. Res. Risk Assessment, vol. 13, no. 1/2, pp. 1–26, Apr. 1999. C. Mendis, A. Skvortsov, A. Gunatilaka, and S. Karunasekera, “Performance of wireless chemical sensor network with dynamic collaboration,” IEEE Sensors J., vol. 12, no. 8, pp. 2630–2637, Aug. 2012. S. Karunasekera, C. Mendis, A. Skvortsov, and A. Gunatilaka, “A decentralized dynamic sensor activation protocol for chemical sensor networks,” in Proc. 9th IEEE NCA, Jul. 15–17, 2010, pp. 218–223. H.-h. K. Burke, “Detection of regional air pollution episodes utilizing satellite digital data in the visual range,” IEEE Trans. Geosci. Remote Sens., vol. GE-20, no. 2, pp. 154–158, Apr. 1982. F. Cuccoli, L. Facheris, S. Tanelli, and D. Giuli, “Infrared tomographic system for monitoring the two-dimensional distribution of atmospheric pollution over limited areas,” IEEE Trans. Geosci. Remote Sens., vol. 38, no. 1, pp. 155–168, Jan. 2000. A. Srivastava, N. Oza, and J. Stroeve, “Virtual sensors: Using data mining techniques to efficiently estimate remote sensing spectra,” IEEE Trans. Geosci. Remote Sens., vol. 43, no. 3, pp. 590–600, Mar. 2005. H.-L. Yu, C.-H. Wang, M.-C. Liu, and Y.-M. Kuo, “Estimation of fine particulate matter in Taipei using land use regression and Bayesian maximum entropy methods,” Int. J. Environ. Res. Public Health, vol. 8, no. 6, pp. 2153–2169, Jun. 2011. V. Epitropou, L. Johansson, K. D. Karatzas1, A. Bassoukos, A. Karppinen, J. Kukkonen, and M. Haakana, “Fusion of environmental information for the delivery of orchestrated services for the atmospheric environment in the pescado project,” in Proc. Int. Congr. Environ. Modell. Softw. Manag. Resources Limited Planet, R. Seppelt, A. A. Voinovr, S. Lange, and D. Bankamp, Eds., 2012. [Online]. Available: http://www. iemss.org/society/index.php/iemss-2012-proceedings I. Clarke, Practical Geostastics. Columbus, OH, USA: Ecosse North Amer Llc, 1979. [Online]. Available: http://www.kriging.com/PG1979/ H. Wackernagle, Multivariate Geostatistics: An Introduction With Applications. Berlin, Germany: Springer-Verlag, 1998. G. Christakos, M. L. Serre, and J. Kovitz, “BME representation of particulate matter distributions in the state of California on the basis of uncertain measurements,” J. Geophys. Res., vol. 106, no. D9, pp. 9717–9731, May 2001. B. Ostro, J. M. Sanchez, C. Aranda, and G. S. Eskeland, “Air pollution and mortality,” in Policy Research Working Paper 1453. Washington, DC, USA: The World Bank, 1995, pp. 1–40. G. K. Batchelor, “Diffusion in a field of homogeneous turbulenceii. The relative motion of particles,” Math. Proc. Cambridge Philosop. Soc., vol. 48, no. 2, pp. 345–362, Apr. 1952. L. F. Richardson, “Atmospheric diffusion shown on a distance-neighbour graph,” Proc. R. Soc. Lond. A, vol. 110, no. 756, pp. 709–737, Apr. 1926. M. Jamriska, T. DuBois, and A. Skvortsov, “Statistical characterisation of bio-aerosol background in an urban environment,” Atmos. Environ., vol. 54, pp. 439–448, Jul. 2012. E. Yee and A. Skvortsov, “Scalar fluctuations from a point source in a turbulent boundary layer,” Phys. Rev. E, vol. 84, no. 3, pp. 036306-1– 036306-7, Sep. 2011. A. Gunatilaka and R. Gailis, “High fidelity simulation of hazardous plume concentration time series based on models of turbulent dispersion,” in Proc. 15th Int. Conf. Inf. FUSION, Singapore, Jul. 9–12, 2012, pp. 1838–1845. G. Boffetta, A. Celani, A. Crisanti, and A. Vulpiani, “Pair dispersion in synthetic fully developed turbulence,” Phys. Rev. E, vol. 60, no. 6, pp. 6734–6741, Dec. 1999. G. Christakos, Modern Spatiotemporal Geostatistics. New York, NY, USA: Dover, 2012. BME Library, 2012. [Online]. Available: http://www.unc.edu/depts/case/ BMELIB/

Sutharshan Rajasegarar received the B.Sc. Engineering degree (with first class honors) in electronic and telecommunication engineering from the University of Moratuwa, Moratuwa, Sri Lanka, in 2002 and the Ph.D. degree from The University of Melbourne, Parkville, Australia, in 2009. He is currently a Research Fellow with the Department of Electrical and Electronic Engineering, The University of Melbourne. His research interests include wireless sensor networks, anomaly/outlier detection, spatiotemporal estimations, Internet of things, machine learning, pattern recognition, signal processing, and wireless communication.

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 10

Timothy C. Havens (S’06–M’106–SM’10) received the Ph.D. degree in electrical and computer engineering from the University of Missouri, Columbia, MO, USA, in 2000. He is an Assistant Professor in the Departments of Electrical and Computer Engineering and Computer Science and the William and Gloria Jackson Assistant Professor of Computer Systems at Michigan Technological University, Houghton, MI, USA. Prior to his current position, he was an National Science Foundation/Computing Research Association Computing Innovation Fellow at Michigan State University, Lansing, MI, USA. Before working on his Ph.D., he was an Associate Technical Staff at the Massachusetts Institute of Technology, Lincoln Laboratory. His research interests include mobile robotics, explosive hazard detection, heterogeneous and big data, social networks, fuzzy sets, sensor networks, and data fusion. He has coauthored over 50 technical publications. Dr. Havens was the recipient of the Best Overall Paper Award at IEEE International Conference on Fuzzy Systems 2013, the IEEE Franklin V. Taylor Award, and a Best Paper Award from the Midwest Nursing Research Society. He is an Associate Editor of the IEEE T RANSACTIONS ON F UZZY S YSTEMS.

Shanika Karunasekera received the B.Sc degree in electronics and telecommunications engineering from the University of Moratuwa, Moratuwa, Sri Lanka, in 1990 and the Ph.D. degree in electrical engineering from the University of Cambridge, Cambridge, U.K., in 1995. From 1995 to 2002, she was a Software Engineer and a Distinguished Member of the Technical Staff at Lucent Technologies, Bell Labs Innovations, Murray Hill, NJ, USA. In January 2003, she joined the Department of Computing and Information Systems, The University of Melbourne, Parkville, Australia, where she is currently an Associate Professor. Her current research interests include decentralized algorithms for event and anomaly detection, and decentralized data aggregation.

Christopher Leckie received the B.Sc. degree, the B.E. degree (with first class honors) in electrical and computer systems engineering, and the Ph.D. degree in computer science from Monash University, Melbourne, Australia, in 1985, 1987, and 1992, respectively. He joined Telstra Research Laboratories in 1988, where he conducted research and development into artificial intelligence techniques for various telecommunication applications. In 2000, he joined The University of Melbourne, Parkville, Australia, where he is currently a Professor in the Department of Computing and Information Systems. His research interests include scalable data mining, network intrusion detection, bioinformatics, and wireless sensor networks.

James C. Bezdek (M’80–SM’90–F’92–LF’10) received the Ph.D. degree in applied mathematics from Cornell University, Ithaca, NY, USA, in 1973. His interests include optimization, pattern recognition, clustering in very large data, coclustering, and visual clustering. He retired in 2007. Dr. Bezdek is the past President of the North American Fuzzy Information Processing Society, the International Fuzzy Systems Association (IFSA), and the IEEE Computational Intelligence Society (CIS), the founding Editor of the International Journal Approximate Reasoning and the IEEE T RANSACTIONS ON F UZZY S YS TEMS , a Life Fellow of the and IFSA, and a recipient of the IEEE 3rd Millennium, IEEE CIS Fuzzy Systems Pioneer, IEEE technical field award Rosenblatt, and the Kampe de Feriet medals.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Milan Jamriska received the physical engineering degrees from Faculty of Nuclear Sciences and Physical Engineering of the Czech Technical University in Prague and the Ph.D. degree in aerosol physics from Queensland University of Technology, Qld., Australia. He is a Defence Science and Technology Organisation (DSTO) Senior Research Scientist with an interest in aerosol sciences and applied physics. He has been actively engaged in academia and scientific community in Australia and internationally published over 60 research publications. Currently, he works in the field of detection and protection against airborne chemical and biological hazards.

Ajith Gunatilaka received the B.Sc. (Engineering) degree (with first class honors) in electronics and telecommunication engineering from the University of Moratuwa, Moratuwa, Sri Lanka, in 1990 and the Ph.D. degree in electrical engineering from the Ohio State University, Columbus, OH, USA, in 2000. In November 2005, he joined the Human Protection and Performance Division of the Defence Science and Technology Organisation, where he is engaged in chemical, biological, and radiological (CBR) hazard modeling and CBR data fusion research.

Alex Skvortsov received the B.Sc. degree (with first class honors) in applied mathematic and the Ph.D. degree in theoretical physics in 1987 from the Moscow University of Applied Physics and Technology, Moscow, Russia. He has significant R&D experience in defense projects (sonar systems, turbulent flows, and artificial intelligence) on which he worked in academia and industry. He joined the Defence Science and Technology Organisation in 2005, where he conducted research and development in tracer dispersion, data fusion, source backtracking, and system biology. He is currently the Science Team Leader for Hazard Modelling. He has published over 50 papers.

Marimuthu Palaniswami (S’84–M’87–SM’94– F’12) received the M.E. degree from the Indian Institute of Science, Bangalore, India, the M.Eng.Sc. degree from The University of Melbourne, Parkville, Australia, and the Ph.D. degree from the University of Newcastle, Callaghan, Australia. He is currently a Professor with the Department of Electrical and Electronic Engineering, The University of Melbourne. He has published over 400 refereed research papers and leads one of the largest funded Australian Research Council, Research Network on Intelligent Sensors, Sensor Networks, and Information Processing programme. He has been a grants panel member for NSF, an advisory board member for the European FP6 grant center, a steering committee member for National Collaborative Research Infrastructure Strategy, Great Barrier Reef Ocean Observing System, Smart Environmental Monitoring and Analysis Technologies, and a board member for Information Technology and supervisory control and data acquisition companies. He has been funded by several ARC and industry grants (over 40 m) to conduct research in sensor network, Internet of things (IoT), health, environmental, machine learning, and control areas. He is representing Australia as a core partner in European Union FP7 projects such as SENSEI, SmartSantander, Internet of Things Initiative, and SocIoTal. His research interests include support vector machines sensors and sensor networks, IoT, machine learning, neural network, pattern recognition, signal processing, and control.