A network-based approach on detecting dredgers ... - SAGE Journals

7 downloads 0 Views 2MB Size Report
Marine Information Technology (Xiamen University), Ministry of. Education, Xiamen .... concentration based on the backscatter intensity. All the related studies ...... robots and systems, Edmonton, AL, Canada, 2–6 August. 2005, pp.277–282.
Research Article

A network-based approach on detecting dredgers’ illegal behavior of dumping dredged sediments Rongxin Zhang, Fei Yuan

International Journal of Distributed Sensor Networks 2018, Vol. 14(12) Ó The Author(s) 2018 DOI: 10.1177/1550147718818400 journals.sagepub.com/home/dsn

and En Cheng

Abstract Focusing on detecting the illegal behavior of dumping dredged sediments by dredgers at ports, a sensor network composed of two acoustic sources and several passive sensor arrays is proposed in this article. The approach can be divided into two parts: the local sensor decision model and the fusion center algorithm. Local sensors receive signals emitted from the acoustic sources and extract the corresponding Hilbert–Huang marginal spectrum, which is significantly different between the scenario of illegal dumping dredged sediments and the contrary. Local decision is made based on spectrum feature extraction and classification. In regard to the fusion center, a fusion strategy with a scanning window is adopted to make a system-level decision. With a proper window width and the corresponding Bayes optimum threshold, the proposed approach performs well in simulations, in terms of a low system-level false alarm probability and a low system-level miss alarm probability. Keywords Detection of dumping dredged sediments, Hilbert–Huang marginal spectrum, sensor network, data fusion

Date received: 25 February 2018; accepted: 18 November 2018 Handling Editor: Florentino Fdez-Riverola

Introduction Marine transportation, in its long history, has a close relationship with the maritime countries’ development. As a result, the port, which acts as a hinge linking the ocean and the land, is more and more important. Port is a complicated system composed of many facilities, one of which is called disposal site, that is, wastedumping zone for dredged sediments. Dredged sediments, which are sediments excavated from the bottom of waterway including ports and estuaries, are expected to be dumped to the disposal site by dredgers. The underwater excavation and dumping process is called dredging. Dredging is important to maintain the depth of waterway. If dredging is performed in a perfunctory manner or not done in time, it will bring hidden safety troubles for the navigation, shipping, and ship berthing. However, dredgers often dump dredged sediments on the road during their transportation instead of to

the prescribed disposal site. Such behavior is harmful and illegal. Therefore, we need a stable and reliable method to detect the misbehavior, viz illegal dumping. Insofar, as it can be ascertained, the related schemes can be classified into two types: ship-equipping and ship-equipping-free. Ship-equipping refers to the approaches depending on installing associated apparatus on board to detect the dumping behavior, for instance, the CCTV video monitoring device1 and the

Key Laboratory of Underwater Acoustic Communication and Marine Information Technology (Xiamen University), Ministry of Education, Xiamen, China Corresponding author: Fei Yuan, Key Laboratory of Underwater Acoustic Communication and Marine Information Technology (Xiamen University), Ministry of Education, Xiamen 361005, China. Email: [email protected]

Creative Commons CC BY: This article is distributed under the terms of the Creative Commons Attribution 4.0 License (http://www.creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/ open-access-at-sage).

2 monitoring recorder for marine wastes dumping.2 By contraries, ship-equipping-free approaches could work without equipping any devices on dredgers. Thought ship-equipping schemes can achieve better judgments, they would give rise to many problems, for example, ambiguity of liability. Thereby ship-equipping-free detecting has drawn the attention of multiple researchers. The most popular approach is to measure the waterline of dredgers with the starting point: sediments dumping lightens a dredger. Luo et al.3 advanced a method for automatic detection of ships’ waterline based on the image-processing method, using Canny operators to detect edges and Hough transform to detect the locations of water trace and the waterline after geometric correction. Liu et al.4 forwarded a texture spectrum method, using K-means clustering algorithm to segment ship waterline images. Gai et al.5 used a laser rangefinder to get the waterline, but the equipment is expensive, and the method has high requirements for the water turbidity and the measuring cycle period. Hua6 proposes five dynamic measuring methods to detect the waterline. These methods are little affected by wind and waves. Nevertheless, they have respective limitations: very limited detection range, prohibitive cost of necessary equipment, strict requirements for installation, and so on. Another way is to detect the presence of flowing sediments below the sea surface. The underlying principle is that flowing sediments will cause the suspended sediment concentration higher than usual. Therefore, we can monitor the suspended sediment concentration, or utilize the impacts of the concentration change to detect the presence of flowing sediments. If a dredger is detected having flowing sediments on its way to the disposal site, it is declared to dump dredged sediments illegally. This involves a subject known as underwater target detection. Unfortunately, the problem abovementioned has rarely been discussed. The detecting targets of most relative literatures are fishes or sea bed sediments,7–10 not the diffusional flowing sediments. Therefore, in order to meet the need of the background, we develop a new approach in the article based on marginal spectrum of Hilbert–Huang Transform (HHT) for local sensors to detect the event. Furthermore, a network, accompanied by a global fusion approach, is developed to monitor the whole ship channel. Detection fusion has drawn significant attention from academia. Niu and Varshney11 proposed a counting-rule-based approach which used the total number of detections coming from local sensors as the fusion statistic. A local vote decision fusion approach was introduced by Katenka et al.12 Applying the principle, whereby the minority is subordinate to the majority, the decision of a local sensor was voted by its entire neighborhood within a fixed distance. Song et al.13 proposed a scan statistic–based fusion approach, using a

International Journal of Distributed Sensor Networks scanning window to find the most significant detection cluster in a special part of the data set. In this article, we will show that the fusion strategy with scan statistic is suitable to our network-based application, which is detailed in the following section. The remainder of the article is organized as follows. Section ‘‘Related work’’ discusses the related work. Section ‘‘Detection approach for local sensors’’ presents our approach of detection for local sensors. Section ‘‘Global detection fusion’’ addresses the data fusion problem of our detection network. Section ‘‘Experimental results’’ presents our simulation results. Section ‘‘Conclusion and future work’’ draws the conclusion and discusses the future work.

Related work There are several researches on sediment-related issues. The issues can be divided into several types: sediment dredge-and-dump strategies,14 suspended sediment concentration monitoring,15 sediment pollution problem,16 and the ecological effect.17 And at present, Haimann et al.18 propose an ecologically oriented integrated monitoring system which consists of several stages, for example, preliminary assessment stage and postdumpling monitoring stage. In order to detect suspended sediment concentration, for both Cutroneo et al.15 and Blanco et al.,16 the dredger needs to be equipped with an acoustic Doppler current profiler (ADCP). ADCP is a kind of equipment which makes use of the Doppler effect of acoustic wave scattered back from particles within the water column to measure water current velocities over a depth range. That is, ADCP emits signal into the water and collects the signal reflected by the particles in the water. Therefore, one can estimate the suspended sediment concentration based on the backscatter intensity. All the related studies take the detection of the suspended sediment concentration as an indicator to guide the dredgers finishing the dredging operations effectively. The dredgers are assumed to do their duty and follow the instructions well. However, the dredgers in China are employed from third-party companies. And because of the chaos in the management, many dredgers try to find various excuses to remove the monitor facilities equipped on them, and dump dredged sediments on the road during their transportation instead of to the prescribed disposal site to reduce fuel consumption. Therefore, we need a ship-equipping-free approach to detect suspended sediment concentration and take the result as an indicator of whether the illegal dumping happened. Unfortunately, to the best of our knowledge, there is no reference focused on similar application background. Therefore, we strive to develop an effective approach to detect the illegal

Zhang et al.

3

dumping behavior and take advantage of network structure to extend the detection coverage area.

a‘ (v) = a‘, w + a‘, v + a‘, s

Detection approach for local sensors Typically, like Niu and Varshney11 and Katenka et al.,12 each sensor obtains an energy reading from the environment. However, the pattern is not applicable in our application, as the suspended sediments would not emit any energy signal. Therefore, we introduce position-fixed acoustic sources (equipped with underwater transducers), and develop a single-input singleoutput (SISO) detecting model for local sensors (equipped with hydrophones). The detecting signal transmitted by the acoustic sources periodically is composed of two parts: hyperbolic frequency modulated (HFM) signal as synchronization code (denoted by ssy (t)) and several evenly spaced chirp signals as detecting codes (denote a certain one by s(t) for convenience). Once the signal synchronization is completed at the receiving end (i.e. the local sensor), discriminating features are extracted from the signal segment corresponding to s(t). Then a specific classifier is adopted to make a judgment whether the dredger is dumping sediments or not.

Denote r(t) as the signal segment received corresponding to s(t) (S(v) in the frequency domain), and its frequency domain form is given by L1 X

ð4Þ

where a‘, w is the acoustical absorption coefficient of pure sea-water (dB/m), a‘, v is the viscous attenuation coefficient (dB/m), and a‘, s is the scattering loss coefficient (dB/m), which can be neglected, since it is far less than the other two. The exist of flowing dredged sediments will affect the value of a‘ (v) by bringing a big change to the value of a‘, v , which will further affect the suspended sediment concentration. Flowing dredged sediments under the sea surface are assumed to exist in forms of particle clouds. Figure 1 is a schematic illustration when a single acoustic ray travels through a particle cloud. Let us take the example of Xiamen Port in China. The concentration of the suspended sediments in the sea at Xiamen Port has consistently kept at a low level less than 0.1%, whereas it will locally exceed 1% in a short time when there is a dredger dumping sediments, which leads a‘, v in equation (4) to be about 10 times larger than normal. The semiempirical formula of a‘, w is expressed as21 

 ASfT f 2 Bf 2 a‘, w = (20 log e) 2 + (1  6:67 3 105 d) fT fT + f 2 ð5Þ

Model establishment

R(v) =

The acoustical absorption coefficient a‘ (v) under turbid sea-water conditions is given by

A‘ (v)S(v)ejvt‘ + W (v),

ð1Þ

‘=0

that is, we consider a multipath channel, where L is the number of paths decided by the Zielinski N-path model,19A‘ (v) is the attenuation coefficient of the ‘-th path, t‘ is the delay of the ‘-th path, and W (v) is the noise. The noise is not white but generated according to the typical mean spectrum level diagram of marine noise.20 In the shallow sea, A‘ (v) is mainly caused by the propagation loss, which can be expressed as21 A‘ (v) =

R‘ n

 10a‘ (v)x‘ =20

where e is the base of the natural logarithm, A and B are constant with the value 2.03 3 1026 and 3.38 3 1026, respectively, f is the frequency of the acoustic signal (kHz), d is the depth of the transducer, T is the temperature of the sea surface, we take it as 21.5°C at Xiamen Port, fT is the relaxation frequency with the value fT = 21:9 3 1061520=(T + 273) ’ 151 kHz, and S is the salinity (‰). When the seawater density rw =1025 kg=m3 , S =1:305(rw  1000)+(T  17:5)3 0:3 =33:825‰. The semiempirical formula of a‘, v is expressed as24

ð2Þ

x2‘

where x‘ is the propagation distance of path ‘, n is propagation index (we choose 1 here), and R‘ is the corresponding reflection loss coefficient decided by R‘ = ~rsn~rbm

ð3Þ

where ~rs 22 and ~rb 23 represent the reflection coefficients of sea surface and sea bed, respectively, and n and m are the corresponding occurrences of reflections.

Figure 1. Schematic illustration of the acoustic signal passing through the particle cloud. Dx‘ is the distance of the signal traveling through the particle cloud.

4

International Journal of Distributed Sensor Networks

Table 1. Parameter calculating formulas for a‘, v . Parameter

Calculating formula

Parameter

Calculating formula

P d r b m0

rs  (1  C)=r + l rs =rv rs  C + rv  (1  C) pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi v=(2m)

Q l ς m Clv

d(1  C)2 + C(1  C) + l 1=2 + 9=(4bas ) 1 )=(4bas ) 9(1 + (baps )ffiffiffiffiffiffi m0 (1 + q ClV + p  ClV ) Cl  rv  1000

p

(0:33T  25:91)=10000

q kc

qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi f21:482½(T  8:435) + 8078:4 + (T  8:435)2   1200g1 (0:5185T + 1:0675)=10000 C  Ks + (1  C)  Kv

rffiffiffiffi kc C(1  C) (1  s) §  v  rw  r a‘, v = pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2(§2 + Q2 )  §2 + PQ + (§2 + P2 )(§2 + Q2 ) 2

2

ð6Þ

where C is the volume fraction of the solid particulate; v is the angular frequency of the signal (rad/s); kc is the volume compressibility of the solid–liquid mixture; rw is the density of pure sea-water (kg/m3); r is the density of the mixed liquid (kg/m3); and P, Q, l, and § are dimensionless parameters. Detailed parameter’s determination is specified in Table 1,24 where rs is the density of the solid particulate (kg/m3), as is the mean radius of the solid particulate (m), m is the kinematic viscosity of seawater, m0 is the kinematic viscosity of pure water, T is the sea surface temperature (°C), Clv is the chlorosity of seawater (kg/m3), Cl is the chlorinity of seawater (‰), Ks is the bulk modulus of the solid particulate (set to 7 3 109 Pa), and Kw is the bulk modulus of seawater (set to 2.43 3 109 Pa). As shown in Figure 1, we denote a‘, v and (a‘, v + Da‘, v ) as the viscous attenuation coefficients in the no-dredged-sediments area and the dredgedsediments area respectively, where Da‘, v is caused by the influence of the flowing sediments. Therefore, equation (2) can be written as ½(a‘, w + a‘, v )x‘ + Da‘, v Dx‘ : R‘ 20 A‘ (v) = pffiffiffiffi  10 x‘

ð7Þ

Local sensor node decision By performing HHT on r(t), we obtain the corresponding Hilbert marginal spectrum. HHT is a time– frequency analysis method advanced in 1998 by Norden E. Huang et al. Compared with the classical signal analysis methods (e.g. the Fourier transform), HHT breaks out the restrains of linearity and stationarity and overcomes the restriction of Heisenberg uncertainty principle.

Figure 2. Hilbert marginal spectrums of the signals received under different situations.

Denote RH (v) as the Hilbert marginal spectrum of r(t). RH (v) curves under different situations are shown in Figure 2, wherein sharp contrasts can be seen in terms of the concentration degree and the amplitude. In order to distinguish different situations, three features are chosen: the peak energy (denoted as Ep), the ratio between 3 dB bandwidth and 50 kHz (denoted as RB3 dB ), and the ratio between the energy within the frequency range from 20 to 25 kHz and the total energy (denoted as RBse ), all of which are described by 8 Ep > > > > < RB3 dB > > > > : RBse

= = =

maxfRH (v)g B3 dB 50 kHz E0 Etotal

ð8Þ

where E0 is the energy within the frequency range from 20 to 25 kHz, and Etotal is the total energy. Once the feature selection is done, a specific classifier is used to discern the scenario of illegal dumping sediments and the contrary. Thereafter, the local sensor declares its decision accordingly, that is, declares the

Zhang et al.

5

Figure 3. The sensor network is composed of two acoustic sources and M sensor arrays, each of which contains K passive sensors, in order to detect dredgers’ misbehavior of illegal dumping sediments.

value 1 when an illegal dumping behavior happened, and declares the value 0 for the opposite case.

Global detection fusion Optimal detection fusion rule with scan statistic As shown in Figure 3, the sensor network is composed of two acoustic sources and M sensor arrays. Each sensor array contains K passive sensors arranged in the vertical direction. The first bM=2c sensor arrays and source 2 are deployed on the same bank, and the rest sensor arrays and source 1 are deployed on the other bank. bc denotes the floor function that gives the greatest integer less than or equal to the corresponding variable. The sensor arrays are spaced evenly along the banks, and the sources are deployed at the 1/4 and 3/4 locations of their corresponding banks, respectively. Assume that each sensor array can only receive signals from its corresponding source, that is, the first bM=2c arrays are related to source 1, and the rest are related to source 2. Each sensor array has a cluster head (CH), which is responsible for collecting data from the sensors within the array through wire transmissions, and then relay the merged data to the fusion center using electromagnetic waves. That is, each sensor node is composed of a hydrophone for passive signal receiving, digital signal processing (DSP) units for feature selection and classification, and wire communication units to send data to its CH. And all of the CHs are equipped with wireless communication units, wire signal receiving units, and the necessary DSP units. Denote the binary data from the kth sensor of sensor array m (i.e. sensor (m,k)) as x(m, k) = f0, 1g and xmk in brief. xmk takes the value 1 when there is a positive detection; otherwise, it takes the value 0. The mth

CH counts the number of positive decisions made by the sensors f(m, k)gKk= 1 , and sends it to the fusion center periodically. When the fusion center receives local decisions transmitted from CHs (transmission time is negligible), it will make a final decision about whether there is a dredger dumping sediments illegitimately or not, and it will also know the approximate place where the event happens (we call a dumping-sediments event an event in brief in the article). The null hypothesis of no event happens is H0 : xmk = 0 for all mk combinations, while the alternative is simply H1 : xmk = 1 for some mk combinations. If all sensor decisions are counted to get a systemlevel decision, it is obvious that we can get an extreme low system-level false alarm probability (denoted by Pf ) and, unfortunately, a very low system-level detection probability (denoted by Pd ). This is because only a part of the sensors would be affected by the event. Thereby, we introduce a scanning window by which we can sacrifice the performance of Pf for higher Pd , that is, make a tradeoff between Pf and Pd . As shown in Figure 4, a counting rule within a scanning window is adopted in this article, using the total number of ‘‘1’’s wherein as a statistic. Denote the discrete width and height of the window by S and K, where 1 ł S ł M. The window will scan M  S + 1 times. The statistics within the window at the nth time is calculated as Dn =

n+ S1 X

K X

xmk

ð9Þ

m=n k =1

=

n+ S1 X m=n

Xm

ð10Þ

6

International Journal of Distributed Sensor Networks Note that jK1 j = IM (S), and jK0 j = S  K  IM (S), where j  j denotes the cardinality of the set. When pd .pf (generally, it holds in reality), by substituting and simplifying, we obtain the optimal decision rule described by H1

ð15Þ

I(S) 7 Th H0

where 1p

Th =

Figure 4. Data table at the fusion center. x(m, k) or xmk denotes the decision made by the kth sensor of sensor array m, that is, sensor (m, k). The solid circles refer that the corresponding decisions are ‘‘1’’s.

where D

Xm ¼

K X

xmk :

ð11Þ

P(H0 ) log P(H + S  K  log 1pdf 1) p (1p )

log pdf (1pdf )

:

ð16Þ

We assume that P(H0 )=P(H1 ) = 1, thus the first item of the numerator of equation (16) is zero. If the fusion center declares H1 D

m ¼  argm maxfXm jn ł m ł n + S  1g

ð17Þ

and it indicates that the event happens nearby the geometric plane passing the line which goes through the m*th sensor array and the corresponding acoustic source.

k=1

Performance analysis

The maximal scan statistic I(S) is defined as D

I(S) = Dn ¼ maxfDn j1 ł n ł M  S + 1g ð12Þ Pn + S1 where Dn = m Xm . = n We call the particular window with the maximal statistic I(S) the maximal window. It is obvious that each sensor may have different false alarm probability and detection probability. In this case, PrfDn = ij0 ł i ł S  Kg is intricate. For ease of calculation, we assume that 8m, k, pfmk = pf , and pdmk = pd . When S is given, rewrite the observations fxmk g within the maximal window to a vector x. The optimal decision rule is given by a likelihood ratio test as P(xjH1 ) P(H0 )(C10  C00 ) ? P(xjH0 ) H0 P(H1 )(C01  C11 ) H1

ð13Þ

where Cij represents the cost of choosing Hi when Hj is true. The quantity on the left side of the equation is the likelihood ratio, and the right side is the Bayes optimum threshold. Under the minimal error probability criterion (C00 = C11 = 0 and C10 = C01 = 1), the corresponding log-likelihood ratio test can be expressed as25 log

X X P(H1 ) pd 1  pd H1 + log + log ? 0 P(H0 ) pf 1  pf H 0 K K 1

D

where Ki ¼ f(m, k) : xmk = ig.

0

ð14Þ

According to the local detection model described in the previous section, we can get the knowledge of each sensor’s false alarm probability pf and the detection probability pd . Under H0 , local decisions made by sensors are independent and identically distributed (i.i.d.). Therefore, we have  PrfXm = ijH0 g =

 K i pf (1  pf )Ki i

ð18Þ

where 0 ł i ł K. The system-level false alarm probability is calculated as Pf = PrfI(S) ø ThjH0 g:

ð19Þ

Likewise, the system-level detection probability is calculated as Pd = PrfI(S) ø ThjH1 g:

ð20Þ

Because the local decisions made by the sensors under H1 are not i.i.d. but target dependent, we analyze Pd via simulations. The equations for calculating Pf are similar to those by Song et al.,13 thus we could use the recursive methods presented by Glaz26 and Karwe and Naus27 to estimate Pf as Song et al.13 did. However, the recursive methods are not efficient and usually take much time for calculations, especially when the parameters become

Zhang et al.

7

larger. Therefore, we make a trivial change here. Once Dn is calculated, a decision denoted by Dn is made based on equation (21)  Dn =

if Dn ø Th otherwise:

1 0

ð21Þ

=P

MS X+ 1

Then the covariance is given by 

Accordingly, equation (19) is rewritten as Pf = PrfI(S) ø ThjH0 g

P(C ø Th  k) = P(B ø Th  k) SKn X ij  S  K  nij  q : = pf (1  pf )SKnij q q q=Thk

Cov(Di , Dj ) =

!

ð22Þ

Dn ø 1jH0 :

E(Di Dj )  mi mj 0

if nij .0 otherwise:

ð27Þ

By plugging equation (27) into equation (23), the estimate of Pf is obtained.

n=1

Experimental results

Based on Katenka et al.,12 we arrive at 0

MS P+ 1

1

B C 1 mn B C n=1 C s ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ffi Pf ’ 1  B B MS + 1 C P P @ A 2 dn + Cov(Di , Dj ) n=1

ð23Þ

i6¼j

Local detection

where Cov(Di , Dj ) is the covariance between Di and Dj , and (

mn = E(Dn jH0 ) = P(Dn = 1jH0 ) d2n = P(Dn = 1jH0 )(1  P(Dn = 1jH0 ))

where

ð24Þ

 SK j pf (1  pf )SKj . j = Th j Denote the collection of the sensors related to Dn by Ln, that is, Ln = f(m, k)jn ł m ł n + S  1, 1 ł k ł Kg. Cov(Di , Dj ) is the dependence between Di and Dj (j 6¼ i), which results from the intersection between Li and Lj. Let nij denote the number of the sensors within the intersection, then P(Dn = 1jH0 ) =

 nij =



PSK

(S  ji  jj)  K 0

if 1 ł ji  jj ł S  1 : otherwise

All the simulations are done by MATLAB. The parameters associated with the real environment (e.g. the mean radius of the solid particulate as) are decided according to the practical situations of Xiamen Port.

ð25Þ

Denote the number of positive decisions made in the intersection by A, the number of positive decisions made in Li but not in Lj by B and the number of positive decisions made in Lj but not in Li by C. Note that A, B, and C are independent. Therefore

Simulation parameters and performance metrics. The usable frequency range of the transducer is from 20 to 25 kHz, and the code duration is 10 ms. Other simulation parameters are listed in Table 2. The model to simulate the falling process of the dredged sediments is given in another paper,28 which is needed to decide the value of Dx‘ described in Figure 1. As it is a binary detection problem (binary hypothesis testing), the metrics should be the detection probability (i.e. pd , the higher the better) and the false alarm probability (i.e. pf , the lower the better). Numerical results. To start with, we discuss the effectiveness and reliability of the feature selection and classification processes. Figures 5–7 have shown that the three features described by equation (8) can be used to discern the scenarios of illegal dumping sediments (yellow box-plots shown in these figures) and the contrary (green box-plots shown in these figures). The values of each box-plot are obtained from the results over 103 simulation runs. As shown in Figure 5, the main value range of Ep (from the lower quartile Q1 to the upper quartile Q3) under the scenarios of illegal dumping

E(Di Dj ) = P(Di = Dj = 1) =

nij X

½P(A = k)P(B ø Th  k)P(C ø Th  k)

k=0

where  P(A = k) =

and

 nij k pf (1  pf )nij k k

ð26Þ

Table 2. Simulation parameters for local detection scheme. Parametric description

Symbol

Value

Sea surface temperature Density of pure seawater Mean radius of solid particulate Density of solid particulate Depth of the transducer

T rv as rs d

21.5°C 1.25 kg/m3 3.9 3 1026 m 2650 kg/m3 10 m

The depth of the navigation channel is set to 16 m.

8

International Journal of Distributed Sensor Networks

Figure 5. The peak energy values between the scenarios of illegal dumping sediments and the contrary (marked as the label ‘‘NoDump’’). Each box-plot is obtained from the results over 103 simulation runs: (a) the impact of different dumping volumes. The dumping location is110 m far away from the signal emitting end. (b) The impact of different dumping locations. The volume of the dredged sediments dumped is 50 m3. The dumping location is the distance from the signal-emitting end.

Figure 6. The ratios between 3 dB bandwidth and 50 kHz under the scenarios of illegal dumping sediments and the contrary (marked as the label ‘‘NoDump’’). Each box-plot is obtained from the results over 103 simulation runs: (a) the impact of different dumping volumes. The dumping location is110 m far away from the signal emitting end. (b) The impact of different dumping locations. The volume of the dredged sediments dumped is 50 m3. The dumping location is the distance from the signal-emitting end.

sediments is quite distinct from that under the contrary situations, and the ranges are little affected by the dumping volumes and the dumping locations. Similar results can be found in Figures 6 and 7. However, any one of the features would fail to discern the two kinds of scenarios with high accuracy, as their value ranges are not entirely different. Especially when the extreme outliers are generated, miscarriage of justices would be made. Therefore, the following classifiers are tested using all of the features to see which one is preferred: LibSVM, K nearest neighbors (KNN)

classifier, Naive Bayes classifier and discriminant analysis classifier. As shown in Figures 5–7, Ep is much bigger than the other two features, thereby Ep should be processed first. Because the highest value of Ep is not bigger than 20, so we use Ep/20 instead of Ep as a classification feature. A total of 6900 simulation results under 14 different conditions, that is, no dredged sediments dumped (3000 data) and 50 m3 dredged sediments dumped at 13 different distances from the emitting end (13 3 300 data), are used for training. Another 23,100 simulation results

Zhang et al.

9

Figure 7. The ratio between the energy within the frequency range from 20 to 25 kHz and total energy under the scenarios of illegal dumping sediments and the contrary (marked as the label ‘‘NoDump’’). Each box-plot is obtained from the results over 103 simulation runs: (a) the impact of different dumping volumes. The dumping location is110 m far away from the signal emitting end. (b) The impact of different dumping locations. The volume of the dredged sediments dumped is 50 m3. The dumping location is the distance from the signal-emitting end.

Figure 8. The recognition success rates of the local sensor detection approach proposed: (a) judged by a single detecting code and (b) judged by a complete detecting signal (three detecting codes).

under 25 different conditions, that is, no dredged sediments dumped (3000 data), 50 m3 dredged sediments dumped at 13 different distances from the emitting end (13 3 700 data), and 11 different volumes of dredged sediments dumped at the distance 110 m far away from the emitting end (11 3 1000 data), are used for predicting. Classification results are shown in Figure 8(a). The recognition success rates are judged by a single detecting code. Unfortunately, the false alarm probabilities (i.e. the detection probabilities for the scenario of no dredged sediments dumped in the figure) are too high, which are above 10% on the whole. It is worth noting

that, as mentioned before, there are several (here we choose three) evenly spaced detecting codes in a detecting signal. So the judgment should be made by the three detecting codes together. When the minority is subordinate to the majority, the recognition success rates judged by a complete detecting signal are shown in Figure 8(b). As depicted in Figure 8(b), false alarm probabilities (pf ) and miss alarm probabilities (1  pd ) are both less than 5%. When we emphasize a low false alarm probability, we can choose Naive Bayes classifier; otherwise, LibSVM is the best choice, as it obtains the highest detection probability pd .

10

International Journal of Distributed Sensor Networks

Figure 9. The relationship between Pf (Pd ) and the window width S, where pf = 0:2, pd = 0:8, and K = 3. Note that the value of Pf is related to the left y axis, and the value of Pd is related to the right y axis.

Figure 10. The relationship between the system-level false alarm probability Pf and the local sensor false alarm probability pf , where pd = 0:8, S = 10, and K = 3.

Global fusion We hope the system-level detection probability Pd is as high as possible and the system-level false alarm probability Pf is as low as possible. That is, Pd and Pf are the performance metrics of the system proposed. We choose M = 51, and K = 3. Each Pd value shown in the figures is obtained by 105 Monte Carlo runs. The relationship between Pf (Pd ) and the window width S is shown in Figure 9, where pf = 0:2 and pd = 0:8. Note that we use double y axes here. The left y axis belongs to Pf , and the right one belongs to Pd . We have the following observations: 1.

2.

Pd s and Pf s are decreasing functions of the window width S. This is not surprising because the larger S is, the more closely we can approach the limiting case of counting all sensor decisions within the whole network. Pf decreases more sharply than Pd . And when S . 9, Pf = 0. Therefore, we can accordingly find a topgallant balance value of S with a proper Pf and an acceptable Pd .

As indicated in Figure 8, local sensors can achieve high pd and low pf with whichever classifier used. However, due to the complicate properties of the marine environment, it is unrealistic to reach a comparable level. Therefore, we do not use the pd and pf values depicted in Figure 8 for the network simulations. Figure 10 depicts the relationship between the systemlevel false alarm probability Pf and the local sensor false alarm probability pf . The red broken curve Pf = pf is used for comparison. As pf grows, Pf increases quickly. And when pf ø 0:44, Pf is worse than

Figure 11. The relationship between Pf (Pd ) and the threshold Th, where pf = 0:2, pd = 0:8, S = 10, and K = 3. Note that the value of Pf is related to the left y axis, and the value of Pd is related to the right y axis. The corresponding Bayes optimum threshold is 16.

pf . We also study the influence of pd over Pd . Let pd increase from 0.2 to 0.9 with a step length of 0.1, and each corresponding Pd value is obtained by 1 million Monte Carlo runs with S = 10, pf = 0:2, and K = 3. The results show that all the system-level detection probabilities are over 0.94. It is clear from Figure 11 that the performance depends on the threshold Th. Double y axes are also adopted here. We notice that the Bayes threshold is optimal.

Conclusion and future work This article seeks an adequate solution to the problem of detecting dredgers’ illegal behavior of dumping

Zhang et al.

11

dredged sediments. The solution is composed of a local sensor detection approach and a fusion strategy. In the local sensor part, an acoustic signal propagation model for sediments dumping channel is built first, and then an effective detection approach is presented. The local false alarm probabilities and miss alarm probabilities are both less than 5% for all candidate classifiers, which could be as low as about 4% and 3%, respectively for LibSVM. When the lowest false alarm probability is preferred, Naive Bayes classifier is a better choice with about 2% false alarm probability. At the fusion center, a scan statistic is used to fuse the decisions made by the local sensors, by which we can make a tradeoff between the system-level false alarm probability Pf and the system-level detection probability Pd . Results show that we can get an accurate and reliable detection network by choosing a proper window width S and the corresponding Bayes optimum threshold. When M = 51, K = 3, pf = 0:2, and pd = 0:8, the optimal scanning window width S is 10, and the corresponding Bayes optimum threshold Th for the system-level binary hypothesis test is 16. The resultant system-level false alarm probability Pf is 0, and the system-level detection probability Pd is over 0.94. In the future, to offer a more flexible detection network, the sensors will be deployed on buoys or autonomous underwater vehicles, no longer fixed on the shore. Therefore, it is essential to make the sensors be able to plan their paths, in order to minimize the redundant coverage overlap, that is, maximize the overall detection area. Furthermore, when a dredger is working, the sensors are expected to follow the dredger. Once the algorithms are done, the deployments in the real environment (for example, a harbor) may be performed.

2. 3.

4.

5.

6.

7.

8.

9.

10.

11.

12.

13.

Declaration of conflicting interests The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

14.

15.

Funding The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (grant nos: 61471308, 61571377, and 61771412) and the Fundamental Research Funds for the Central Universities (grant no.: 20720180068).

16.

17.

ORCID iD Fei Yuan

https://orcid.org/0000-0002-8614-8756

References 1. Harte N, Rankin A, Baugh G, et al. Detection of illegal dumping from CCTV at recycling centres. In:

18.

Proceedings of the international machine vision and image processing conference, Kildare, 5–7 September 2007, p.204. New York: IEEE. Li L, Feng Y and Xu J. Model hyq2-1 dumping data recorder. Ocean Technol 2001; 20(4): 54–56. Luo J, Shi CJ and Ran X. A new method for automatic detection of ship waterline. Ship Ocean Eng 2012; 92(2): 151–159. Liu RJ, Gao YB and Hao XG. Research on precise positioning of ship waterline based on texture spectrum. J West Anhui Univ 2009; 25(5): 1–4. Gai ZG, Zhao J, Yang L, et al. Development of a new laser intelligent water level measuring system. J Optoelectron Laser 2013; 24(3): 569–572. Hua S. Study on dynamic measurement method of ship draught. PhD Thesis, Wuhan University of Technology, Wuhan, China, 2008. Tong J, Miyamoto Y, Uchida K, et al. A client/server architecture remote fish finder system for a set net fishery. Fisheries Sci 2014; 80(6): 1159–1167. Creuze V, Jouvencel B and Baccou P. Using sound diffraction to determine the seabed slope. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Edmonton, AL, Canada, 2–6 August 2005, pp.277–282. New York: IEEE. Sebastian A, Gerardo G, Andre L, et al. Evaluation of an efficient approach for target tracking from acoustic imagery for the perception system of an autonomous underwater vehicle. Int J Adv Robot Syst 2014; 11(1): 1–13. Hsiao YH, Chen CC, Lin SI, et al. Real-world underwater fish recognition and identification, using sparse representation. Ecol Inform 2014; 23(9): 13–21. Niu R and Varshney PK. Performance analysis of distributed detection in a random sensor field. IEEE T Signal Pr 2007; 56(1): 339–349. Katenka N, Levina E and Michailidis G. Local vote decision fusion for target detection in wireless sensor networks. IEEE T Signal Pr 2007; 56(1): 329–338. Song X, Willett P, Glaz J, et al. Active detection with a barrier sensor network using a scan statistic. IEEE J Ocean Eng 2012; 37(1): 66–74. Paarlberg AJ, Guerrero M, Huthoff F, et al. Optimizing dredge-and-dump activities for river navigability using a hydro-morphodynamic model. Water 2015; 7(7): 3943–3962. Cutroneo L, Castellano M, Ferranti MP, et al. Use of optical and acoustic instruments to study the turbid plumes generated by three different types of dredges during dredging activities inside and outside of a port. J Soil Sediment 2013; 13(9): 1645–1654. Blanco M, Fernandes D, Rizzi J, et al. The combined use of chemical and biochemical markers in Rutilus rutilus to assess the effect of dredging in the lower course of the Ebro river. Ecotoxicol Environ Safe 2018; 155: 9–16. Hauer C. Review of hydro-morphological management criteria on a river basin scale for preservation and restoration of freshwater pearl mussel habitats. Limnologica 2015; 50: 40–53. Haimann M, Hauer C, Tritthart M, et al. Monitoring and modelling concept for ecological optimized harbour dredging and fine sediment disposal in large rivers. Hydrobiologia 2018; 814(1): 89–107.

12 19. Zielinski A. Performance analysis of digital acoustic communication in a shallow water channel. IEEE J Ocean Eng 1995; 20(4): 293–299. 20. Chang X. Study on information receiver of multi-array organic Mine-Hunting Sonar system. PhD Thesis, Harbin Engineering University, Harbin, China, 2009. 21. Schulkin M and Marsh HW. Absorption of sound in seawater. Radio Electr Eng 2010; 25(6): 493–500. 22. Coates R. An empirical formula for computing the Beckmann-Spizzichino surface reflection loss coefficient. IEEE T Ultrason Ferr 2002; 35(4): 522–523. 23. Wang D and Shang E. Underwater acoustics. 2nd ed. Beijing, China: Science Press, 2013 (in Chinese). 24. Sun CW and Ma W. The study of acoustic property of high-concentration suspending liquid. Tech Acoust 1983; 2(1): 1–6 (in Chinese).

International Journal of Distributed Sensor Networks 25. Chair Z and Varshney P. Optimal data fusion in multiple sensor detection systems. IEEE T Aero Elec Sys 1986; 22: 98–101. 26. Glaz J NJI. Tight bounds and approximations for scan statistic probabilities for discrete data. Annal Appl Prob 1991; 1: 306–318. 27. Karwe VV and Naus JI. New recursive methods for scan statistic probabilities. Comput Stat Data Anal 1997; 23(3): 389–402. 28. Yuan F, Zhong Z, Wang J, et al. Modeling for the sedimentation of dredger and analysis of the influence on sound propagation characteristics. Chin J Acoust 2016; 41(3): 409–419.