WAVELET-BASED MOVING OBJECT SEGMENTATION USING ...

1 downloads 0 Views 1MB Size Report
KEY WORDS. Moving object ... key operation for content-based video coding [1] and ..... “Trevor,” extracted from the proposed method, are more accurate than ...
WAVELET-BASED MOVING OBJECT SEGMENTATION USING BACKGROUND REGISTRATION TECHNIQUE Tae Hyung Im, Il Kyu Eom, and Yoo Shin Kim Department of Electronics Engineering Pusan National University San 30, Jangieon-dong, Kumjeong-gu, Pusan, Korea, 609-735 E-mail : {imtae80, ikeom, kimys}@pusan.ac.kr in terms of implementation and enables the automatic detection of new appearances. The position and shape of the moving object is detected from the frame difference of two consecutive frames. The conventional change detection-based approaches [4-5, 10] are reliable for video sequences like the head-and-shoulder type of video. However, it has several drawbacks. First, the value of the frame difference depends on the speed of the object, so the quality of the segmentation cannot be maintained consistently if the speed of the object changes significantly in the sequence. It has a relatively poor result for video sequences like the surveillance type of video. Second, the uncovered background region [8] is detected as a moving object region from the frame difference information. To overcome these problems, an algorithm using the background registration technique [10] is reported. However, it is difficult to construct the background if the object motion is minute. Change detection for moving object segmentation is also achieved in the wavelet domain [6-7]. This method may reduce noise disturbance and speed change, but the segmenting coherence is still not ensured [9]. In this paper, we propose an efficiently wavelet-based moving object segmentation algorithm that simultaneously uses the inter-frame difference and background registration technique. The rest of this paper is organized as follows. In the next section, the proposed method is described in detail, followed by experimental results are shown in section 3. Finally, section 4 concludes this paper.

ABSTRACT This paper proposes wavelet-based moving object segmentation using change detection and background registration technique. The inter-frame difference for detecting change was calculated by using high frequency coefficients in wavelet subbands. The background was constructed with low frequency wavelet coefficients which were the approximated version of the original image. Combining detected changes and constructed backgrounds, we propose an efficient moving object segmentation algorithm. Our experimental results show that our proposed method obtains accurate video object planes (VOPs) in a video sequence. KEY WORDS Moving object segmentation, wavelet coefficient, change detection, background registration, video object plane

1. Introduction Moving object segmentation from a video sequence is a key operation for content-based video coding [1] and multimedia description. The MPEG-4 standard provides a content-based framework to achieve the goals of fast transmission, efficient storage, flexible manipulation and the reuse of visual content. However, the MPEG-4 does not provide specific techniques for VOP (video object plane) extraction. Nonetheless, it is an indispensable process for many digital video applications. Thus, the preprocessing used to decompose sequences into VOPs is an important issue. For VOPs extraction, object segmentation is one of the popular approaches. One of all video segmentation methods is to use spatial homogeneity [2-3] as the primary segmentation criterion. This algorithm is that the sequence is partitioned into some regions by mathematical morphology filters, and then the watershed algorithm is applied for region boundary decision. The result of this method can obtain the object boundary more precisely than other methods. However, the computation complexity is very high because both computationally intensive operation. The change detection method for inter-frame difference [4-7, 10] is a popular scheme because it is straightforward 576-118

2. Proposed method The wavelet transform decomposes an image or a video frame into high frequency and low frequency components. Since the high frequency coefficients represent the edges of an image, changes can be detected by using these coefficients. On the other hand, the low frequency components, which are an approximated version of the original signal, can be used as information for background registration. We have proposed a method for moving object segmentation using the change detection method [4-7] and background registration technique [10] in the wavelet domain. 84

frame n- 1 frame n

LL (- )

LH LL

HL LH

2.2 Background registration technique using LL subband

HH HL

HH

Background Re gistration T echnique

CDM n , d with Vth , d

Canny edge dete ction on initial object mask

Canny edge dete ction on CDM n , d

DEn , LL

DEn , LH

DEn , HL

The background registration technique constructs the background from several consequence frames in the image domain, and then extracts a moving object from the difference between the constructed background and the current frame. We consider the LL subband as an image, though it is an approximated version of the image with half-resolution. For the low frequency subband, we apply the background registration technique to build background. The frame difference mask of the LL image, FDn, LL (i, j ) , is obtained by thresholding the difference between coefficients in two LL subbands as follows:

DEn , HH

Merge four sub- band and upscale

DE n Moving object edge ( ME n ) of frame n

1 if | wn, LL (i, j ) − wn−1, LL (i, j ) |> Vth ,FD FDn , LL (i, j ) =  , (2) 0 otherwise

C urrent Edge m ap ( E n)

where

Vth , FD

is

a

threshold

of FDn, LL (i, j ) .If

FDn , LL (i, j ) = 0 , then the difference between two frames

is almost the same. Therefore, it is considered to be a background pixel. According to the frame difference mask of the past several frames, pixels that are not moving for a long time are considered as reliable background. The reliable background, BRn, LL (i, j ) is

Extract VOP of frame n

VOPn

Fig. 1. Overall block diagram of proposed moving object segmentation algorithm

defined as

2.1 Inter-frame difference in wavelet domain

 BRn−1, LL (i, j ) + 1 if FDn,LL(i,j) = 0 BRn ,LL (i, j ) =  0 otherwise

Fig. 1 shows the overall block diagram of the proposed method. Firstly, we performed a wavelet transformation to obtain four wavelet subbands. For the high frequency subband (LH, HL, and HH), we applied the change detection method to find the change detection mask, CDM n ,d (i, j ) as follows:

The BRn, LL (i, j ) value is accumulated until FDn, LL (i, j ) holds 0 value. At any time that FDn, LL (i, j ) is changed from 0 to 1, BRn, LL (i, j ) becomes 0. If the value in BRn , LL (i, j ) exceeds a predefined value, denoted by L,

1 if | wn ,d (i, j ) − wn−1,d (i, j ) |> Vth ,d CDM n ,d (i, j ) =  , (1) 0 otherwise

where wn,d (i, j ) (d = {LH , HL, HH }) is

the

(3)

then we calculate the background difference mask, BDn , LL (i, j ) . It is obtained by taking the difference between the current frame information stored; that is,

wavelet

coefficient at location (i,j) of frame n in each wavelet subband, and Vth,d is a threshold determined by the

and

the

background

1 if | BI n−1, LL (i, j ) − wn, LL (i, j ) |> Vth ,BD BDn , LL (i, j ) =  , (4) 0 otherwise

significant test [11]. If the difference between inter-frame coefficients is larger than the given threshold, then we can guess that there are moving objects. However, because the extracted CDM n ,d (i, j ) is the difference value for the

where BI n ,LL (i, j ) is the pixel value in the current frame that is copied to the corresponding pixel in the BRn, LL (i, j ) , and Vth, BD is a given threshold. Now we

edges, the boundary is thickly detected. To overcome this problem, the Canny edge detection method is applied to the CDM n ,d (i, j ) to extract the rough object shapes. The edge detected version of CDM n ,d (i, j ) is called by the

define the edges of significantly different pixels, DEn ,LL (i, j ) , as the Canny edge detection version of

edges of significantly different pixels, DEn,d (i, j ) . From

BDn ,LL (i, j ) .

the value of DEn,d (i, j ) , we can grasp the movement of the objects. 85

frame n

Background registration

frame n- 1

Frame difference

(a)

Background difference

Fram e difference m ask

Background difference m ask

Background registration m ask

Fig. 4 VOP extraction: (a) moving object edge, (b )extracted VOP after morphological operations.

Background difference

Then, we upscale the DEn, F (i, j ) in low resolution into original resolution to extract the moving object edge, MEn (i, j ) . An upscaled version of DEn,F(i,j) is referred to as DEn (i, j ) . That is,

Initialobject m ask

Fig. 2. Block diagram of background registration technique

DEn (i, j ) = upscale( DEn , F (i, j )) .

In the case of BRn, LL (i, j ) < L , we assume that the background is not constructed, so we use a frame difference mask, FDn, LL (i, j ) , which is calculated in the

(6)

Also, DEn (i, j ) is obtained by performing an inverse wavelet transformation on the four subband edge images. After obtaining DEn (i, j ) , we use Kim’s method [4-5] to find the moving object’s edge, MEn (i, j ) , which can be considered to be the performance measure of moving object segmentation. Kim’s method is to use the relation between DEn (i, j ) and En (i, j ) , where En (i, j ) represents the edge points detected by Canny edge detection in the current frame n .

first step to extract the edges of significantly different pixels. When BRn, LL (i, j ) < L , DEn, LL (i, j ) is obtained by the Canny edge detection of FDn, LL (i, j ) . Fig.2 shows the background registration technique in the wavelet domain. 2.3 Merging edge masks and finding moving edge The obtained four edges of significantly different pixels are 25% of the size of the original frame size. To get the object shape to the size of the original frame, we have to merge the four edges of significantly deferent pixels. To do so, we apply a union operation to four masks. If we let the merged edge mask be DEn, F (i, j ) , DEn , F = DEn , LL ∨ DEn , LH ∨ DEn , HL ∨ DEn , HH .

(b)

MEn (i, j ) = DEn (i, j ) ∧ En (i, j ) .

(7)

We must also consider the method using DEn (i, j ) , En (i, j ) and DEn (i, j ) ’s neighbor pixel for increasing the number of the extracted MEn (i, j ) . From Fig. 3, we find that moving object edges, MEn (i, j ) of “Trevor,” extracted from the proposed method, are more accurate than the previous method in the wavelet domain [7], which uses only a change detection algorithm.

(5)

2.4 Extraction of VOPs We obtained the moving object edge map MEn (i, j ) detected from DEn (i, j ) , as shown in Fig. 3(b). The VOPs are extracted from the moving object edge map. First, we declare the horizontal candidates if they are inside the first and last edge pixel in each row and the vertical candidates for each column. After finding both horizontal and vertical candidates, the intersection region using logical AND operation is marked as extracted VOPs. For video sequences like “Trevor,” which contain only a partial moving object, we declare the image boundary as moving object edge pixels if either horizontal or vertical

(a) (b) (c) Fig. 3. Extracted moving object edge: (a) original image, (b) conventional method [7]; (c) the proposed method

86

candidates touch the image boundary and conduct the process to find candidates. The false candidates of the extracted VOPs are removed by morphological closing operations. After eliminating false candidates, the noise of the background region is removed and the small uncovered region is covered by a morphological opening operation in post–processing. From Fig.4 we can see the final extracted VOPs.

“Trevor” have minute movement, the background registration method is of poor quality. That is, the moving object is misunderstood as background. But our method uses both the background registration method and a change detection algorithm to eliminate misleading background. From Fig.5 and Fig.6, our method found more VOPs than the previous method reported in [7]. Our method was also applied to a “hall monitor” in a 288 X 352 format, which is a surveillance type of video with relatively small objects and a complex background. The change detection method shows good results for the headand-shoulder type of video, but poor results for the surveillance type of sequence. Also we extracted more accurate VOPs as shown in Fig.7 and Fig.8. Fig.9 shows the number of moving object edge for two test videos. From Fig.9, we discovered that our proposed method has more moving edges than those of the method in [7].

3. Experiment results To simulate our proposed method, we performed a Haar wavelet transformation for each image frame. Afterwords, the proposed method was applied to “Trevor,” which is a typical head-and-shoulder video in a 256 X 256 format. From Fig. 3, we find that moving object edges of “Trevor,” extracted from the proposed method, are more accurate than the previous method in the wavelet domain [7]. Since head-and shoulder type of videos such as

(a) (b) (c) Fig. 5 VOP extraction for “Trevor” (frame 19): (a)original image; (b)conventional method[7] ;(c)proposed method

(a) (b) (c) Fig. 6 VOP extraction for “Trevor”: (a) frame # 23; (b) frame # 69 ;(c) frame # 97

(a) (b) (c) Fig. 7 VOP extraction for “Hall monitor” (frame # 41):(a) original image; (b) conventional method [7] , (c) proposed method 87

[4] C-Kim and J-N Hwang, Fast and automatic video object segmentation and tracking for content-based applications, IEEE Trans. Circuits Syst. Video Technol. 2002, 122-129 [5] C. Kim and J.-N. Hwang, Fast and robust moving object segmentation in video sequences, in Proc. Int. Conf. Image Processing(ICIP’99)., 1999, 131-134 [6] J.c. Huang and W.S. Hsieh, Double-change-detection method for wavelet-based moving-object segmentation, IEEE Electron. Lett., 2004 [7] J.C. Huang, and W.S Hsieh, Wavelet-based moving object segmentation, Electron. Lett., 1380-1382, 2003 [8] le Roux, L.J, van Schalkwyk, J.J.D, An overview of moving object segmentation in video images, communication and signal Processing,COMSIG Proceedings., South Africa Symposium, 53-57, 1991 [9] H Liu, X Chen, Y Chen, and C Xie, Double change detection method for moving-object segmentation based on clustering, IEEE ISCAS 2006 Circuits and Syst., 2006, 5027-5030 [10] Shoa-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Efficient Moving Object Segmentation Algorithm Using Background Registration Technique, IEEE Trans. Circuits Syst. Video Technol., 2002, 577-586 [11] Til Aacj, Andre Kaup, and Rudolf Mester, Statistical model-based change detection in moving video, Signal processing, 1993, 165-180

4. Conclusion In this paper, we have proposed a moving object segmentation algorithm using a change detection method and background registration technique in a wavelet domain. A low frequency subband has been applied to the background construction; in addition, the change detection method has been made by coefficients in a high frequency subband to overcome the drawbacks of the conventional method. Simulation results show that the proposed method can obtain more accurate moving object edges in both head-and-shoulder and surveillance video sequences.

Acknowledgement This work was supported by the 2nd-Phase Brain Korea (BK) 21 Program funded by the Ministry of Education, Korea

References [1] T.Silora, The MPEG-4 video standard verification model. IEEE Trans. Circuit Syst. Video Technol., 1997, 19-31. [2] Y. Tasiy, and A. Averbuch, Automatic segmentatin of moving objects in video sequences, IEEE Trans. Circuits Syst. Video Technol., 2002, 122-128 [3] A. Cavallavo, O. Steiger, and T. Ebrahimi, Tracking video objects in cluttered background, IEEE Trans. Circuits Syst. Video Technol., 2002, 122-128

(a) (b) (c) Fig. 8 VOP extraction for “Hall monitor”:(a) frame # 66; (b) frame # 131; (c) frame # 181;

(a) (b) Fig. 9 The number of moving object edge points: (a). “Trevor “; (b). “Hall monitor

88

Suggest Documents