EMBEDDED IMAGE CODING USING CONTEXT-BASED ADAPTIVE WAVELET DIFFERENCE REDUCTION Poonlap Lamsrichan and Teerapat Sanguankotchakorn, Member, IEEE
Telecommunications Field of Study, School of Engineering and Technology Asian Institute of Technology, Pathumthani, Thailand (phone: 662-524-5747, fax: 662-524-5730; e-mail:
[email protected]). ABSTRACT We propose a new still image coding technique which is a highly developed version of the renowned wavelet difference reduction (WDR). This new coder inherits the important properties from the original WDR, such as embeddedness, target compression ratio, region of interest support and the simplicity. The special treatment is performed in the significant pass by using context-based adaptable scanning order to predict the new significant coefficients. The prediction model is based on the compactness in subband coefficients and also the quadtree relation between the same orientation subbands. In binary mode, the coder is consistently better than SPIHT. A new conditional arithmetic coding (AC) using array of contexts is specifically designed for WDR-based coder. Rate-distortion results of the new coder in AC mode are superior to all other WDR-based coders in the literature and are very comparable to SPIHTAC and JPEG2000, the best coders, at all bit rates for all images in the test set.
Index Terms- Conditional arithmetic coding, context adaptive, image coding, wavelet difference reduction 1. INTRODUCTION
Wavelet difference reduction [I] is a simple embedded image coder which yields good image quality at high compression ratio, and fully supports resolution and SNR scalability properties. The performance of WDR is comparable in both objective and subjective quality with other best image coders in the literature, such as SPIHT [2] and Embedded Block Coding with Optimal Truncation (EBCOT) [3] adopted by JPEG2000 image compression standard. Further improvement of WDR are adaptively scanned WDR (ASWDR) in [4], and context modelled WDR (CMWDR) in [5] and [6]. These two algorithms use selfsimilarity across scale and/or energy compaction in each subband for scanning order adaptation in order to improve the compactness of the significant coefficients in the quantization process. The coefficients that are the neighbors or children of the significant coefficients will be placed before the ones of the insignificant coefficients.
1-4244-0481-9/06/$20.00 C2006 IEEE
In [7], another modified version of WDR called Adaptive WDR is proposed. AWDR employs advanced scanning order adaptation with many additional features and has comparable coding performance with the original WDR and all of its variants mentioned above. In this paper, we propose a method to improve the AWDR. The improved coding technique results in a better ratedistortion performance, especially in the arithmetic coding mode, than that of [7]. This is achieved by better-organized predefined scanning order and adaptation (see section 3), together with a more efficient arithmetic coding for bit-plane WDR (see section 4).
2. WAVELET DIFFERENCE REDUCTION
The original WDR maps the two-dimensional image coefficients into one-dimensional scanning order. Binary stream of output symbols of WDR-based coders have 4 possible symbols (+,-,0, and 1) for significance pass and 2 possible symbols (0 and 1) for refinement pass. In the significant pass, each significant coefficient is encoded with its sign (+ or - symbol) followed by distance along the scanning order between itself and the previous encoded coefficients in reduced form. The distance can be called difference reduction and are presented by string of 0 and/or 1 symbols. The further detail of WDR can be found in [1] and brief summaries can be found in [4]-[7]. 3. BINARY ENCODING: CONTEXT-BASED ADAPTABLE SCANNING ORDER OF WDR In this coding technique, called context-based adaptive wavelet difference reduction (CAWDR), the scanning order adaptation of wavelet difference reduction is based on statistical context modeling of the wavelet transform coefficients. The fundamental idea is similar to ASWDR and CMWDR. However, the context model is more advanced in taking number of significant coefficients into consideration. Moreover, a newly developed predefined scanning order and bit-plane header are integrated into the coder to provide good rate-distortion performance in binary encoding mode. The role of number of significant neighbors is utilized by priority weight. The cumulative weight for coefficients with
1137
Authorized licensed use limited to: Kasetsart University. Downloaded on January 11, 2010 at 02:32 from IEEE Xplore. Restrictions apply.
ICIP 2006
Vertical Subband s
C2a C2
N--C2a C2a Vertical Subband Clc Cla Clc Clb
S
Clb
Clc Cla Clc Clc Clb Clc
Cla Clb Cla
Cla S Cla
Clb S Clb
ClcClb Clc
Cla ClbbCla
Horizontal Subband Diagonal Subband (a)
*b C2b *-C2b C2b Diagonal Subband S
C2.C2a
C2a C2a
,
C2b C2b C2b C2b
Horizontal Subband .......................
C
All-lowpass subband and all highest level subbands in each orientation can be scanned by these new scanning orders directly. However, for lower subbands, they will be separated into subblocks, each with members of 2(Lmax-L)x2(Lmax-L) coefficients. Where Lmax is the highest decomposition level and L is the level of the current subband. That is, if Lmax = 5, all subbands in level L = 4 must be separated into subblocks each with 2x2 members, and with 4x4 in L = 3, and so on. The new scanning order will be assigned to intersubblocks ,then intra-subblocks. The algorithm also allows the predicted coefficients to be moved in front of all unpredicted coefficients (even with much higher decomposition level). The idea is to scan higher potential coefficients before the lower potential ones. The LIP of predicted coefficients (LIP N and LIPSC) for all subbands will be merged together, except for all subbands in the highest level. With this manner of prediction, the significant coefficients can be encoded with less number of bits. With all these properties, CAWDR in the binary mode is better than binary SPIHT at all bit rates. The results are shown in table I.
a
C2aC2aC2bC2b 2a CbC2 ....... C2a (b)
Fig. 1. Directional weights for neighbor and children of significant pixels (a) weights for neighbors (b) weights for children S = significant pixel, N = neighbor of significant pixel
significant contexts should be more than weight of coefficient with fewer significant contexts. The weight can be separated into Cla, Clb, Clc and C2a, C2b and shown in Fig. 1. In general, Cla > Clb > Clc. This is because Cla is weight for neighbor in the same direction with the subband orientation. List of insignificant pixels (LIP) is separated into three sets. 1) LIP_N contains the coefficients that have the highest weights and these coefficients are moved to the front part of scanning order, 2) LIP C contains the medium-weighted coefficients, and it comes after LIP N, and 3) the normal LIP contains the rest of the coefficients with no weight remain in the back part,. The original WDR use an end-marker as an indicator of the end of each bit-plane by locating an extra significant coefficient just outside the scanning order. Therefore, endmarker is the sign symbol followed by the distance to outside end of scanning order. A Bit-plane header is used in CAWDR, instead of the end-marker, to separate the bitplanes and to store the total number of significant coefficients encoded in each bit-plane. There is no extra cost using header because fewer bits are required for header than end-marker in many bit-planes, please see [7] for more detail of header. more
4. ARITHMETIC ENCODING FOR CAWDR: MULTIPLE CONTEXTS CONDITIONAL ENTROPY In the previous section, binary version of CAWDR performs quite well. However, a zero-order arithmetic coding [11] using only one context for each pass (four symbols for significance pass and two symbols for refinement pass) results in very little rate-distortion improvement, as can be seen in [6] and [7]. In significance pass, due to the efficient significant coefficients prediction scheme of CAWDR, each pair of sign (S Type) symbols (+ and -) are not likely to be separated by very long sequence of 0 and 1 symbols (or called difference symbol - D Type). Therefore, as length of sequence of consecutive D symbol gets longer, the probability that the next symbol is S symbol is increased and the skewness of probability in significance pass is also increased. The probability of S symbol and D symbol at symbol m (when the most recent sign symbol is located n symbol away) are:
P(X
)Sn= p(X. S Xm-n = SI Xm-1 ,...,IXm-n+l = D) (1)
P(Xm)D And if ni (a) (b) (c) Fig. 2. 8x8 New scanning order (a) zigzag, (b) vertical, and (c) horizontal
The new orders scan the quadtree coefficients together while well maintain the orientation of the subband. Fig. 2 shows the new scanning orders.
Where
P(Xm > n2,
-D
=Xm
=S,Xm X
)nl > P(Xm)n2
P(Xm
)nl
Xk
m+j
D) (2)
then it is more likely that
P(Xm
Xk
X
P(Xm )n2
is symbol at location k, and l is symbol located I symbols before k.
1138 Authorized licensed use limited to: Kasetsart University. Downloaded on January 11, 2010 at 02:32 from IEEE Xplore. Restrictions apply.
(3)
The more skewness will lead to smaller code length due to reduction in the entropy. Therefore, multiple-order conditional entropy coding [12] is considered. An array of contexts ( cs, all have four possible symbols) are used, one for each possible condition for encoding the output symbols. For example: contexts for the significance pass
ones presented in this paper. However, with that manner, one can not fully mention the embeddedness of the coder. TABLE I PSNR RESULTS OF CODERS IN BINARY MODE PSNR (dB) of binary coders WDR WDR ASWDR CMWDR SPIHT CAWDR Image [9] bpp (new) [8] [8] [6] (new) Lena 33.93 0.25 33.51 33.30 33.65 33.89 33.69 0.5 36.55 36.41 36.66 36.99 36.84 37.00 1 39.67 39.59 39.77 40.05 39.98 40.08 Barbara 0.25 27.45 27.24 27.44 28.20 27.76 28.26 0.5 31.46 31.23 31.47 32.15 31.67 32.19 1 36.64 36.07 36.87 37.35 37.31 36.95 Goldhill 0.25 30.27 30.22 30.25 30.27 30.22 30.31 32.71 32.76 0.5 32.59 32.57 32.72 32.78 1 35.83 35.84 36.05 36.15 36.09 36.00 Boat 30.60 0.25 30.51 30.39 30.53 30.51 30.48 34.05 0.5 33.86 33.59 33.96 33.93 33.90 1 38.42 38.22 38.47 38.60 38.49 38.46
(First sign) 0 + 1 0 .. symbols: + 0 1 0 S D D D S D S D D .. types: Contexts: CR CS Cs2 Cs3 Cs4 Cs Cs2 1 2 *
C, context is used when the most recent sign (S) symbol is at the n-th location earlier. CR is a context with two possible symbols. Note that: first symbol must be sign (only + or -). Where:
For the refinement pass output including bit-plane header section and the first sign symbol in each bit-plane, the zeroorder binary arithmetic coder using only one context (CR ) is exploited. With this very effective arithmetic coder, the performance of CAWDR and other WDR-based coder can be increased significantly and the results are shown in the next section. 5. EXPERIMENTAL RESULTS In this section, the coding performance in rate-distortion sense of CAWDR is presented and compared with the performance of other coders described in this paper. The fixed weight model [7] for significance prediction is used for binary CAWDR. Comparing different image coding techniques is always a difficult task because many papers presented results for different images even though they used the same names, such as Lena and Barbara. In order to provide a strict reference, the coding results in this paper are performed on the images from web site ks.uwaterloo.c ne..as.htlE All test images are 512x512 8 bpp which are Lena, Barbara, Goldhill, and Boat. The results of CMWDR from [6] can be used for comparison straightforwardly, since these images were also used in that paper. Results of WDR&ASWDR, SPIHT and JPEG2000 are real encoding/decoding of their reference software from [8], [9] and [10], respectively. For our proposed coder, CAWDR, anyone who is interested can mail us at the authors' mailing addresses. All images are transformed into 5-level two-dimensional discrete wavelet decomposition using the well-known 9/7 filters before encoding at 2 bpp, and then reconstructed at rate 0.0625, 0.125, 0.25, 0.5, 1 and 2 bpp from its own single file of 2 bpp encoding. The rationale behind this process is to test encoder/decoder in the embedded approach. Please be noted that the encoding and decoding exactly at the same bit rate (such as 0.5 bpp) may result in different PSNR values from
The PSNR results of all coders in binary mode are shown in table I. It is obvious that CAWDR performs better than all other coders. Especially in "Barb" image, the PSNR of CAWDR coder can reach 0.5 dB higher than PSNR of SPIHT in some bit rates. This reveals that CAWDR, which uses both neighborhood and parent-child relations, can handle orientation of wavelet coefficients better than SPIHT which uses only parent-child relation. To demonstrate their efficiency, we also show the PSNR results of a WDR coder using only our new scanning order, bit-plane header, and the new arithmetic coder (without scanning order adaptation). This coder is called WDR (new) in all tables. CAWDR uses the new multiple contexts conditional arithmetic encoding in significance pass. The number of contexts used is not so great (not more than 9 contexts) and depends on the size of bit-plane. The encoder and decoder can synchronously specify the size of bit-plane without any side information. This can be done by the information in the header- the total number of significant coefficients encoded in each bit-plane. The more encoded coefficients, the larger size of bit-plane and more contexts are likely to be used. WDR coder with end-marker cannot do this without extra side information. For symbol with sign symbol at m location earlier, where m is higher than the number of contexts (such as m > 9), the symbol will be encoded with the highest context available in that bit-plane. In table II, the PSNR results of all coders in arithmetic mode are presented, and the average of all images is shown in table III. The arithmetic encoded CAWDR can outperform SPIHT and JPEG2000 in many bit rates of many images. For example, PSNR results of "Lena" of CAWDR are the best among all coders for all bit rates.
1139 Authorized licensed use limited to: Kasetsart University. Downloaded on January 11, 2010 at 02:32 from IEEE Xplore. Restrictions apply.
TABLE II PSNR RESULTS OF CODERS IN ARITHMETIC MODE
Image Bit rate Lena 0.0625 0.125 0.25 0.5 1 2 Barbara 0.0625 0.125 0.25 0.5 1 2 Goldhill 0.0625 0.125 0.25 0.5 1 2 Boat 0.0625 0.125 0.25 0.5 1 2
TABLE III AVERAGE PSNR OF ALL IMAGES FROM TABLE II
PSNR (dB) of arithmetic coders WDR WDR ASWDR CMWDR SPIHT JPEG2000 CAWDR (new) [8] [8] [6] [9] [10] (new) 28.13 30.80 33.69 36.75 39.94 44.69
27.99 30.56 33.48 36.58 39.71 44.29
28.19 30.91 33.88 36.82 40.06 44.78
-
28.38 31.10 34.11 37.21 40.41 45.06
27.79 30.78 33.98 37.18 40.27 44.72
28.47 31.24 34.25 37.34 40.48 45.16
23.68 25.31 27.64 31.76 36.97 43.48
23.63 24.89 27.35 31.38 36.31 42.89
23.69 25.35 27.68 31.79 37.32 43.52
28.25 32.28 37.42 -
23.77 25.38 28.13 32.11 37.45 43.64
23.76 25.69 28.65 32.75 37.91 43.93
23.89 25.47 28.48 32.57 37.71 43.92
26.64 28.34 30.46 32.98 36.27 41.72
26.55 28.28 30.37 32.80 36.06 41.21
26.64 28.33 30.46 32.96 36.35 41.71
-
26.73 28.48 30.56 33.13 36.55 42.02
26.56 28.35 30.48 33.16 36.46 41.87
26.72 28.50 30.58 33.11 36.56 41.99
25.69 27.97 30.75 34.16 38.79 44.26
25.67 27.92 30.62 33.91 38.49 44.01
25.79 28.03 30.82 34.21 38.89 44.38
25.90 28.16 30.97 34.45 39.12 44.77
25.66 27.90 30.89 34.51 39.18 44.74
25.95 28.19 31.05 34.49 39.24 44.92
-
-
34.01 37.10 40.19
30.39 32.88 36.24 -
-
30.71 34.12 38.75 -
PSNR (dB) Image WDR WDR ASWDR CMWDRSPIHTJPEG2000 CAWDR Bit rate (new) [8] [8] [6] [9] [10] (new)
Average 0.0625 0.125 0.25 0.5 1 2
26.03 28.10 30.64 33.91 37.99 43.54
25.96 27.91 30.46 33.67 37.64 43.10
26.08 28.16 30.71 33.95 38.16 43.60
26.20 28.28 30.84 30.94 34.10 34.23 38.15 38.38 43.87
25.94 28.18 31.00 34.40 38.46 43.82
-
26.25 28.35 31.09 34.38 38.50 44.00
TABLE IV COMPARISON OF WDR VARATIONS IN ARITHMETIC MODE Image Lena
Total number of encoded significant coefficients WDR WDR ASWDR CMWDR CAWDR Bit rate (new) [8] [8] [6] (new)
Barbara Goldhill Boat
To provide full information, the total numbers of encoded significant coefficients are shown in table IV. Based on our proposed technique, this is the first time in the literature that a WDR-based coder can reach this high rate-distortion performance. The improvement is mainly from the new conditioning arithmetic coder. This is very similar to the case [2] that special type of arithmetic coder, additional to the highly refined binary output, can significantly enhance the performance of SPIHT over its predecessor, EZW.
[2]
[3]
[4] [5]
[6]
6. CONCLUSION CAWDR is very competitive, in PSNR sense, to SPIHT and JPEG2000. Moreover, it has lower computational complexity than JPEG2000 and supports ROI which can not be found in SPIHT. The rate-distortion improvement from CAWDR over WDR and all other WDR variants can not be considered marginal at all. CAWDR can reach 1.5 dB higher PSNR than the original WDR and is normally 0.3 dB better than CMWDR and ASWDR in all bit rates. 7. REFERENCES [1] J. Tian, and R.O. Wells, Jr., "Embedded image coding using waveletdifference reduction", in Wavelet Image and Video Compression, Top iwala, P., Ed., Kluwer Academic, Norwell, MA, 1998, pp. 289-301.
[7]
[8] [9]
0.25 0.5 1 0.25 0.5 1 0.25 0.5 1 0.25 0.5 1
11603 23413 48553 13006 26737 52862 11659 24649 52251 11175 24164 46744
10993 22279 45723 12008 24881 49031 11154 23888 50389 11174 23401 46745
12092 23875 49593 13033 26854 54303 11622 24512 52427 11174 24322 46745
12378 24934 50015 14147 28394 54743 11210 24150 51483 11151 24021 46739
13059 26263 53237 14719 29601 56501 12361 25189 54095 11579 24322 48791
A. Said, and W.A. Pearlman, "A new, fast, and efficient image codec based on set partitioning in hierarchical trees", IEEE Trans. on Circuits and Systems for Video Technology, 6(3), 1996, pp. 243-250. D. Taubman, E. Ordentlich, M. Weinberger, and G. Seroussi, "Embedded block coding in JPEG 2000," Signal Processing - Image Communication, volume 17 (number 1), 2002, pp. 49-72. J.S. Walker, "A lossy image codec based on adaptively scanned wavelet difference reduction", Optical Engineering, Vol. 39, No. 7, July, 2000, pp. 1891 - 1897. Y. Yuan and M. K. Mandal, "Context-modeled wavelet difference reduction coding based on fractional bit-plane partitioning," in Proc. Of IEEE ICIP, 2003, vol.2, pp. 251-254. Y. Yuan and M. K. Mandal, "Novel embedded image coding algorithms based on wavelet difference reduction," Vision, Image and Signal Processing, IEE Proceedings, Volume 152, Issue 1, 28 Feb. 2005, pp. 9 - 19. P. Lamsrichan and T. Sanguankotchakorn, "Adaptive wavelet difference reduction using high-order context modeling for embedded image compression," to be published in Proc. ofIEEE ICICS, 2005. WDR and ASWDR compressors are part of the FAWAV software at at and can be found package
http://www.uwec.edu/walkerjs/pages/resume.html SPIHT
programs
can
be
downloaded
http://www.cipr.rpi.edu/research/SPIHT/spiht3.html
from
[10] JPEG2000 reference software can be found at http://jpeg2000.epfl.ch/ [11] I.H. Witten, R.M. Neal, and J.G. Cleary, "Arithmetic coding for data compression", Communications of the ACM, 30, 1987, pp. 520-540. [12] A. Moffat, R.M. Neal, and I.H. Witten, "Arithmetic coding revisited", ACM Transaction on Information Systems, Vol. 16, No. 3, July 1998, pp. 256-294.
1140 Authorized licensed use limited to: Kasetsart University. Downloaded on January 11, 2010 at 02:32 from IEEE Xplore. Restrictions apply.