Application and Network Measurements For Multiple Description Coded Video Sequences F. Fitzek ∗ Aalborg University Head of Future Visions Niels Jernes Vej 12, room A5 204 DK-9220 Aalborg Denmark Phone: +45 9635 8678 Fax: +45 9815 1583 email:
[email protected] July 13, 2004 Technical Report R-04-1005 ISBN 87-90834-46-1 ISNN 0908-1224 Multiple description coding has received a lot of interest lately, because it allows to split one source of information in multiple entities, where each of the entities is decodable. The more of these entities are received the more information of the original source can be restored. This opens the door for new applications in many fields of mobile communications. Unfortunately the multiple description coding comes with the price of higher bandwidth requirements. This overhead is caused by the coding process itself and the related network transport. In this work we present a solid investigation of the overhead arising from the splitting process for video coding. Several well known and new video sequences in the QCIF and CIF video format were encoded for multiple descriptors using H.263 and H.26L encoder. The overhead caused by the encoding and underlying network protocols is given separately. Furthermore we investigate the quality of the decoded video in dependency of the received percentage of descriptors. Our results show that the overhead depends dramatically on the video codec chosen and that for eight descriptors the overhead differs from 5% to 200%. A main conclusion we draw here is that we have to consider the scenario carefully whether or not multiple description coding should be chosen over single description coding. ∗
The work of F. Fitzek was supported by the SAMSUNG project Joint Advanced Development Enabling 4G (JADE).
c
Aalborg University.
R-04-1005
Page 1
Contents 1 Introduction 1.1 Structure Of This Document . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
10 12
2 Methodology and Measurement Setup
13
3 Metric and Terminology
16
4 Coding Overhead Measurements 4.1 H.263 . . . . . . . . . . . . . . . . . . 4.1.1 QCIF . . . . . . . . . . . . . . 4.1.2 CIF . . . . . . . . . . . . . . . 4.2 H.26L . . . . . . . . . . . . . . . . . . 4.2.1 QCIF . . . . . . . . . . . . . . 4.2.2 CIF . . . . . . . . . . . . . . . 4.3 Impact of the Quantization Parameter
18 18 18 18 18 21 22 22
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
5 Network Overhead
25
6 Quality Measurements
28
7 Conclusion
31
8 Acknowledgment
31
A Additional Overhead Measurements A.1 H263 . . . . . . . . . . . . . . . . A.1.1 QCIF . . . . . . . . . . . A.1.2 CIF . . . . . . . . . . . . A.2 H26L . . . . . . . . . . . . . . . . A.2.1 QCIF . . . . . . . . . . . A.2.2 CIF . . . . . . . . . . . .
34 34 34 37 37 39 42
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
B Additional Quantization Measurements
44
C Additional Quality Measurements
50
c
Aalborg University.
R-04-1005
Page 2
List of Figures 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
The carphone sequence in the QCIF format. . . . . . . . . . . . . . . . . . . . . . The bridge–close sequence in the CIF format. . . . . . . . . . . . . . . . . . . . . Splitter and encoding chain for J=3. . . . . . . . . . . . . . . . . . . . . . . . . . Merger for J=3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Merger for J=3, where one descriptor is missing. . . . . . . . . . . . . . . . . . . Overhead of selected H.263 video sequences in the QCIF format for different number of sub–streams. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridgefar video sequence in the QCIF format. . . . . . . . . . . . . . . . . . . . . . . . Overhead of selected H263 video sequences in the CIF format for different number of sub–streams. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridgefar video sequence in the CIF format. . . . . . . . . . . . . . . . . . . . . . . . . Overhead of selected H.26L video sequences in the QCIF format for different number of sub–streams. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Percentage of I frames in one descriptor for different video lengths. . . . . . . . . Overhead of selected H26L video sequences in the CIF format for different number of sub–streams. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the container video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the container video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the foreman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the foreman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Transport of multiple description coded video and voice over RTP, UDP, and IP network layer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Network Overhead (RTP/UDP/IPv4) for the foreman video sequence and two different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Network Overhead (RTP/UDP/IPv6) for the foreman video sequence and two different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (random approach). . . . . . . . . PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (best case). . . . . . . . . . . . . . PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (worst case). . . . . . . . . . . . . . PSNR values versus time for the bad case and a random case. . . . . . . . . . . . Comparison of PSNR degradation for different video sequences for J = 15. . . . . Overhead of multiple description coded video sequence bridge-close in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-far in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
13 13 14 15 15 19 19 20 20 21 22 23 23 23 24 24 25 26 27 28 29 30 30 31 34 34
Page 3
27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49
Overhead of multiple description coded video sequence carphone in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence claire in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence container in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence foreman in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence grandma in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence highway in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence mthr dotr in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence news in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence salesman in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence silent in the QCIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-close in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-far in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence highway in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence mobile in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence paris in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence tempete in the CIF format using the H.263 encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-close in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-far in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence carphone in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence claire in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence container in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence foreman in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence grandma in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
34 34 35 35 35 35 36 36 36 36 37 37 37 37 38 38 39 39 39 39 40 40 40
Page 4
50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72
Overhead of multiple description coded video sequence highway in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence mthr dotr in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence news in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence salesman in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence silent in the QCIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-close in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence bridge-far in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence highway in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence mobile in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence parisparis in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead of multiple description coded video sequence tempetetempete in the CIF format using the H.26L encoder. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the bridge-close video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the bridge-close video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the bridge-far video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the bridge-far video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the carphone video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the carphone video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the claire video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the claire video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the container video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the container video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the foreman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the foreman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
40 41 41 41 41 42 42 42 42 43 43 44 44 44 44 45 45 45 45 46 46 46 46
Page 5
73 74 75 76 77 78 79 80 81 82 83 84 85
86
87
88
89
90
91
Overhead for the grandma video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the grandma video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the highway video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the highway video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the mthr dotr video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the mthr dotr video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the news video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the news video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the salesman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the salesman video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overhead for the silent video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandwidth requirements for the silent video sequence in the QCIF format for different quantization values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
47 47 47 47 48 48 48 48 49 49 49 49
50
51
51
52
52
53
53
Page 6
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
54
54
55
55
56
56
57
57
58
58
59
59
60
60
61
Page 7
107 PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . 108 PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . . 109 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . . 110 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . 111 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . 112 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . . 113 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . 114 PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . . 115 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 2 . . . . . . . . . . . . . . . . . . . . . . . . . 116 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 3 . . . . . . . . . . . . . . . . . . . . . . . . . 117 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 5 . . . . . . . . . . . . . . . . . . . . . . . . . 118 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10 . . . . . . . . . . . . . . . . . . . . . . . . . 119 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15 . . . . . . . . . . . . . . . . . . . . . . . . . 120 PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20 . . . . . . . . . . . . . . . . . . . . . . . . .
61
62
62
63
63
64
64
65
65
66
66
67
67
68
List of Tables 1
YUV QCIF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
13
Page 8
2
YUV CIF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
c
Aalborg University.
R-04-1005
14
Page 9
1 Introduction MDC has emerged as an attractive coding scheme for robust transmission over channels with transient bursty error characteristics such as wireless systems during a deep fade. MDC is a great solution to this problem at the price of redundancy. The packet losses, which is most probable and can be correlated over wireless links, can cause severe degradation in the received video quality. The application of MDC can be classified into two main groups, namely path diversity and heterogeneous terminals. We give a short introduction of the related work. Path diversity Various network infrastructures such as multi–cast services in cellular networks, multi–hop networks and wired networks utilize MDC to provide path diversity. The study carried out in [1] combats errors encountered in the wireless channel by coding the highly compressed video data into multiple independently decodable streams send over different paths. An unbalanced MDC scheme to provide path diversity is proposed in [2]. This scheme creates unbalanced Multiple Description (MD) streams by adjusting the frame rate, quantization or spatial resolution of each stream and can achieve unbalanced rates of up to 2:1. Authors compare the conventional MDC system with their proposed system. Their results show that unbalanced MDC scheme provides improved reliability over multiple paths with unequal bandwidths. Authors examine the effectiveness of unbalanced MDC and path diversity for transmission of foreman and bus sequences. The recovered video quality for unbalanced rates of up to 1.75:1 and 1.76:1 are compared with that of SD scheme. Their results show that the proposed unbalanced MDC scheme is more robust to bursty errors and network congestions than SD scheme. The effectiveness of the combination of MDC with multiple path transport (MPT) for video qand image transmission over multi–hop wireless links is studied in [3]. Some applications have high bandwidth and stringent reliability requirements. For supporting these kind of applications, the sender may need to send the data stream over several paths to the destination [3]. The study in [3] addresses this need and the authors compare their proposed MDC–MPT system with a system which uses layered coding and asymmetrical paths for the base and enhancement layers. Apart from the standard Quarter Common Intermediate Format (QCIF) and Common Intermediate Format (CIF) sequences given in [4], authors present average Picture Signal to Noise Ratio (PSNR) for MDC for Susie, Football and Flower Garden video sequences. In [5], an MDC system for multi–hop networks is proposed. The application and network layers cooperate to provide more robustness against severe network conditions. The authors propose a multi–path selection method that chooses a set of paths to achieve more robust media transmission. The media content is transmitted over these intelligently selected multiple paths instead of shortest path or maximally disjoint paths. The simulation results in this paper show an average PSNR improvement of up to 8.1 dB than conventional MDC scheme. The challenge that the authors faced in this paper was finding a set of paths to minimize the cost function that estimated average streaming distortion in terms of network statistics, media characteristics and application requirements [6]. To remedy this problem, they proposed a fast heuristics–based solution by exploiting the infrastructure features of the internet in [6]. Their simulations run over several random internet topologies show that the proposed heuristic approach is capable of finding a good set of paths in a much shorter time than the methods they used in [5]. This heuristic is therefore very well suited for time critical applications such as VoIP and video conferencing. The application of MDC for multi–cast networks has been addressed in [7]. Multi–cast services addresses to a wide variety of receiver terminals having different bandwidth requirements. The study in [7] explores layered multiple description codes where base layer descriptions are trans-
c
Aalborg University.
R-04-1005
Page 10
mitted to users having low bandwidth capacities whereas the enhancement layer descriptions are transmitted to users having high bandwidth capacities. MDC is also being exploited in wired infrastructures for increased link capacity and reliability. For example, the wired connection to the base stations can have more capacity if MDC is used. MDC divides the data stream into several streams whereby the probability of buffer flow in wired links decreases [8]. Multiple path transport (MPT) scheme is proposed for wired links to provide path diversity. The source coding and traffic splitting is jointly designed for MPT schemes [9]. MDC and layered coding (LC) are the two options for source coding in MPT scheme [9]. Heterogeneous terminals MDC is exploited to support heterogeneous terminals. 4G mobile systems are envisioned to offer wireless services to a wide variety of mobile terminals ranging from cellular phones and Personal Digital Assistants (PDAs) to Laptops [10]. The flexibility of the bandwidth assigned to each descriptor and the number of descriptors assigned to end users makes MDC a very attractive coding scheme for 4G networks. Terminals use the same bandwidth and decide on the number of descriptors that they can receive [11]. In [12], a new MDC technique called Multiple Description Scalable Coding (MDSC) is proposed. MDSC can simply be described as a combination of MDC and scalable coding. This technique addresses receiving device characteristics and bandwidth variations of the channel and also enables tradeoffs between throughput, redundancy and complexity which is not possible with non–scalable MDC schemes. In MDSC, the number and the composition of descriptions are changed dynamically to make the proposed system very robust to changing channel characteristics [12]. For example, in [?] layered multiple description scheme is proposed. Layered codes incorporate sequences of layers in order to provide bandwidth heterogeneity and to cope with dynamic network congestion [?]. Authors use base layer descriptions for low bandwidth clients whereas they use both base and enhancement layer descriptions for high bandwidth clients. Therefore, we give plots of overhead versus sub–streams generated, video content or number of sub–streams successfully received. Other Work in this field and their shortcomings! In [13], the performance of a single stream based voice communication is compared with that of multiple description coding (multistream communications) using Forward Error Correction (FEC) scheme. Therefore, the payload data rate of single stream and multistream schemes are identical and MDC does not increase the overall data rate of the payload compared to single stream scheme. In this way, the added overhead of packet headers as a result of transmitting multiple streams are compared. The motion vector data is the most important part of the compressed video stream [14]. It is therefore very crucial to send it without errors. In [14], a multiple description motion coding (MDMC) algorithm is proposed. This algorithm ensures robust delivery of the motion vector data to the destination. Authors present their simulation results only for the Foreman video sequence. After making a literature search, we realized that there is a shortage of performance results of MDC with all of the video sequences given in [4]. We present the performance investigations of MDC for wireless networks using twelve of the video sequences given in [4]. The goal of this paper is to show the drawbacks of MDC in terms of overhead and quality measurements. In literature, authors usually give exact results for the overhead generated by the MDC process. For example, in [13], it was reported that the overall data rate of multiple streaming is 11 % higher than single–streaming. However, for better understanding and more solid investigation into the overhead generated by MDC, no exact overhead values should be given as it is strongly dependent on the video content. We are investigating the overhead generated by encoding and
c
Aalborg University.
R-04-1005
Page 11
network for IPv6. In the following section our measurement setup and the terminology used is introduced.
1.1 Structure Of This Document The paper is structured as followed: In Section 2 we introduce the methodology for our results achieved. The metric and terms which are used throughout the paper are addressed in Section 3 Our results are presented in Section 4 for the overhead measurements and in Section 6 for the quality measurements.
c
Aalborg University.
R-04-1005
Page 12
2 Methodology and Measurement Setup Our investigations are based on raw video sequences in the 4:2:0 YUV format, which are widely used in the research community and available at [15]. A video frame consists out of multiple pixels. Each pixel can be represented in two different color spaces: RGB or YUV. In the RGB color space, the pixel is composed of three colors: Red (R), Green (G) and Blue (B). In the YUV color space, the pixel is represented by its luminance (Y), chrominance (U), and saturation (V) [16]. Our investigations are based on raw video sequences in the YUV format, which are widely used in the research community and available at [4]. Two video formats are under investigation, namely Quarter Common Intermediate Format (QCIF) (an example is given in Figure 1) and Common Intermediate Format (CIF) (an example is given in Figure 2). QCIF and CIF are representing two different video formats with 176x144 pixels and 352x288 pixels, respectively [17]. Twelve QCIF and six CIF video sequences with different video length and content were used. The video sequences with frame length and a short information are given in Table 1 for the QCIF format and 2 for CIF format.
Figure 1: The carphone sequence in the QCIF format. Figure 2: The bridge–close sequence in the CIF format.
seq. name bridge-close bridge-far carphone claire container foreman grandma highway mthr dotr news salesman silent
no. of frames 2000 2101 382 494 300 400 870 2000 961 300 449 300
Table 1: YUV QCIF info Charles Bridge (Karl` uv most) oldest bridge in Prague. Charles Bridge (Karl` uv most) oldest bridge in Prague. Man talking at the phone. Female talking to the camera. Ship leaving the habor. Man speaking to the camera. Grandma in front of the camera. Driving over highway. Mom and daughter speaking to the camera. News studio and two speakers. Salesman in his office. Woman doing sign language.
As MDC splits the video stream into multiple descriptors, we have chosen the frame based
c
Aalborg University.
R-04-1005
Page 13
seq. name bridge-close bridge-far highway mobile paris tempete
Table 2: YUV CIF info Charles Bridge (Karl` uv most) oldest bridge in Prague. Charles Bridge (Karl` uv most) oldest bridge in Prague. Driving over highway. Train is moving Two people talking to each other. Moving Cam
no. of frames 2000 2101 2000 300 1065 260
approach. By means of a splitter, the raw video sequences are splitted into J sub–sequences, such that the first sub-sequence contains picture 1,J+1,2J+1, and so on. The J sub-sequences are feeded into different video encoder. In this work we only use the H.263 and H.26L. An illustration of the splitting and encoding process is given in Figure 3 for J=3. ..... J+1
1
H.26x Encoder
D1
..... J+2
2
H.26x Encoder
D2
..... J+3
3
H.26x Encoder
D3
raw video sequence J+3 J+2 J+1
J
.....
3
2
1
Figure 3: Splitter and encoding chain for J=3.
The properties of the encoded sub-streams such as amount of data and robustness are measured and evaluated. In the first step we are interested in the overhead that arises from the splitting and encoding process. Obviously the inter frame differences will increase with larger J, which in turn results in smaller compression gains as the video encoder has more data to work on. In our former work we have reported about the bandwidth requirements of the example video sequences for H.263 [18] and H.26L [19] for a single stream. Than we investigate how sensitive the overhead is in dependency of the encoder’s settings. Once the video is encoded with multiple description we are measuring the quality in terms of PSNR in case of missing or corrupted descriptors. For this purpose we need a merger as given in Figure 4. Each descriptor is decoded individually and than merged to a single stream. This stream will be conveyed towards the application. To calculate the PSNR values in dependency of the percentage of successfully received descriptors we select a sub–set of descriptors and feed this to the merger. In case descriptors are lost, frames will be missing in the video sequence. As a a very simple error concealment, we are freezing the last successful received frame until we have a new one. By freezing we mean that we make a copy of the last received frame for the next frame until an update frame is received. This procedure is shown in Figure 5 in case descriptor 2 is missing. In this case frame 1, J + 1, . . . will displayed two times. In case the descriptors are error prone they are decoded and merged afterwards.
c
Aalborg University.
R-04-1005
Page 14
D1
H.26x Decoder
..... J+1
1
D2
H.26x Decoder
..... J+2
2
D3
H.26x Decoder
..... J+3
3
J+3 J+2 J+1
J
.....
3
2
1
J
.....
3
1
1
Figure 4: Merger for J=3.
D1
H.26x Decoder
..... J+1
1
D2
D3
J+3 J+2
H.26x Decoder
..... J+3
J
3
Figure 5: Merger for J=3, where one descriptor is missing.
c
Aalborg University.
R-04-1005
Page 15
3 Metric and Terminology For the performance evaluation we refer to the following terminology: single descriptor stream The single descriptor stream is the encoded version with J = 1. multiple descriptor stream The multiple descriptor stream is the encoded version with J > 1. Thus multiple sub–streams exists, where each of the sub–streams is decodable for itself. sub–streams One encoded entity of the splitted video sequence. quantization parameter (QP) The Discrete Cosine Transform (DCT) is used to convert a block of pixels into a block of transform coefficients. The transform coefficients represent the spatial frequency components of the original block. Then the quantization is achieved as follows [16]: 1. Coefficients with an absolute value smaller than the quantizer threshold are set to zero (i.e., they are considered to be in the so–called dead zone. 2. Coefficients with an absolute value larger than or equal to the quantizer threshold are divided by twice the QP and rounded to the nearest integer. The larger the step size, the larger the compression gain, as well as the loss of information. The trade–off between compression and decodable image quality is controlled by setting the QP and quantizer threshold [16]. For the performance evaluation we refer to the following metric: bandwidth BSD represents the bandwidth of single description coding. Bi is the bandwidth of each descriptor i. overhead The overhead is defined as the amount of data by which the splitted streams of one video sequence are increased in comparison to the single stream. In all results we refer to the overhead produced by the encoding process. Further overhead such as the network overhead are addressed in Section 5. The overhead θ is defined as PJ
θ=
i=1 Bi
BSD
−1
(1)
video quality We use the picture signal to noise ratio (PSNR). The PSNR represents the objective video quality each video frame by a single number. A video frame is composed by N · M pixels (where M is the length and N the height of the frame). Each pixel is presented by one luminance value and a set of pixels by two chrominance values. Because the human eye is more sensible to the change in luminance we focus only on this parameter. The
c
Aalborg University.
R-04-1005
Page 16
mean squared error (MSE) and the PSNR in decibels are computed by the following two equations: P
M SE =
[f (i, j) − F (i, j)]2
∀i,j
N· M 255 P SN R = 20 · log10 √ , M SE
(2) (3)
where f (i, j) represents the original source frame and F (i, j) the reconstructed possibly error–prone frame containing N by M pixels. As mentioned before, wireless links have smaller bandwidth than wired links, therefore we concentrate on the transmission of videos in the CIF and QCIF format.
c
Aalborg University.
R-04-1005
Page 17
4 Coding Overhead Measurements The results are presented for the video sequences given in Table 1 and 2 with values of J=1 . . . 20. For illustration purposes, we highlight only some results in this section and refer to the appendix for complete results. Two different encoders are used, namely H.263 [20] and H.26L [21, 22, 23]. In the first subsection we present the results of the H.263 encoder.
4.1 H.263 For the encoding process we used mainly the default settings of the tmn software []. We changed the value for the skip frames to zero (-S 0) such that each frame of the video sequences was encoded. Furthermore unrestricted motion vector mode (annex D), syntax-based arithmetic coding (annex E), and advanced prediction mode were switched on. This was motivated by former work we have done in this field achieving comparable results. The quantization parameter was not changed and therefore equal to 10.1 The encoded streams contain only one I frame at the beginning and P frames are used afterwards. 4.1.1 QCIF In Figure 6 the overhead of selected H.263 video sequences in the QCIF format for different number of sub–streams is given. We report two main insights: i.) Increasing J results in a larger overhead and ii.) The overhead itself depends dramatically on the video content chosen. While the container sequence has an overhead three times larger than single encoded video in case J ≥ 18, the overhead of the bridge-far sequence is below 100% for all J ≤ 20. For all video sequences the overhead is less than 50% if only two sub-streams are used. H.263 allows for several types of macro-block coding schemes, such as INTRA mode (coding non-predicted DCT coefficients), INTER mode (predictive coding using 1 motion vector) and INTER4V (predictive coding using 4 motion vectors). In Figure 7 the number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridge-far video sequence in the QCIF format is given. 4.1.2 CIF In Figure 8 the overhead of all H.263 video sequences in the CIF format for different number of sub–streams is given. Figure 9 the number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridgefar video sequence in the CIF format is given.
4.2 H.26L For the overhead calculation of H.26L encoded video streams we used a quantization parameter of 31 as default parameter2 . Later we will investigate even the impact of this parameter by varying these values between 1 and 51. We have agreed on a group of picture (GoP) structure
1
Note, that the quantization parameters chosen differ between H.263 and H.26L. This is intended as the H.263 uses quantization parameter between 1 and 31, while the range for H.26L is between 1 and 51. 2 The give quantization value QP refers to the I frame, the P frames are encoded than with QP-1, thus resulting in higher quality. This decision is based on the default settings of the decoder.
c
Aalborg University.
R-04-1005
Page 18
Comparison QCIF H263 3.5 3
overhead
2.5 2 1.5 1 0.5 0 0
2
4
6
foreman salesman
8 10 12 number of sub-streams
14
container bridge-close
16
18
20
bridge-far
Figure 6: Overhead of selected H.263 video sequences in the QCIF format for different number of sub–streams.
Bridge-Far QCIF H263
mean number of macroblock per frame
10
1
0.1
0.01
0.001
0.0001 0
5
inter
10 number of sub-streams inter4v
15
20
intra
Figure 7: Number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridge-far video sequence in the QCIF format.
c
Aalborg University.
R-04-1005
Page 19
Comparison CIF H263 2 1.8 1.6
overhead
1.4 1.2 1 0.8 0.6 0.4 0.2 0 0
2
4
6
bridge-close bridge-far
8 10 12 number of streams
14
highway mobile
16
18
20
tempete paris
Figure 8: Overhead of selected H263 video sequences in the CIF format for different number of sub–streams.
Bridge-Far CIF H263
mean number of macroblock per frame
100
10
1
0.1
0.01
0.001 0
5
inter
10 number of sub-streams inter4v
15
20
intra
Figure 9: Number of INTRA, INTER, and INTER4V macro-blocks per frame for the bridge-far video sequence in the CIF format.
c
Aalborg University.
R-04-1005
Page 20
with one I frame and eleven P frames. As for H.263 we distinguish the same two video formats, namely QCIF and CIF. 4.2.1 QCIF In Figure 10 the overhead of selected H.26L video sequences in the QCIF format for different number of sub–streams is given. As for the H.263 encoder the overhead increase with larger number of descriptors. The highest gain is achieved switching from one descriptor to a second one. In general the overhead caused by the H.26L is less than for the H.263 encoder. Because of different encoder settings this comparison my be unfair, but it is not the main goal to compare the two encoders, but to look into the overhead. Therefore we want to note this observation.
Comparison QCIF H26L 1.6 1.4 1.2
overhead
1 0.8 0.6 0.4 0.2 0 0
2
foreman salesman
4
6
8 10 12 number of sub-streams container bridge-close
14
16
18
20
bridge-far
Figure 10: Overhead of selected H.26L video sequences in the QCIF format for different number of sub–streams. In contrast to the H.263 measurements the curves are not that smooth. This effect can be explained by the used GoP structure. While the encoded video stream with only one descriptor consists out of exactly 1/12 I frames and 11/12 P frames, the encoded stream with multiple descriptors has a much higher ratio of I frames as given in Figure 11. The percentage λ of I frames in one descriptor is calculated as given in Equation 4: h
λ=
N GoP ∗J
i
, (4) N/J where N is the number of frames of the original video sequence, GoP the number of frames per GoP, and J the number of descriptors of the video sequence. The operator [x] returns the smallest integer that is not less than its argument.
c
Aalborg University.
R-04-1005
Page 21
For a video length of 300 frames (as for container, silent, news, etc) the percentage of I frames increases by 44% for 12 descriptors. This explains the gaps in Figure 10. The effect vanish with larger video sequences and lower quantization values (see investigation of the quantization parameters).
Dependency of the video length
percentage of I frames in one descriptor
0.14
0.13
0.12
0.11
0.1
0.09
0.08 1
2
3
300
4
5
6
7
8 9 10 11 12 13 14 15 16 17 18 19 20 number of sub-streams 2000
infinity
Figure 11: Percentage of I frames in one descriptor for different video lengths.
4.2.2 CIF In Figure 12 the overhead of selected H26L video sequences in the CIF format for different number of sub–streams is given for all six video sequences introduced in Table 2.
4.3 Impact of the Quantization Parameter In Figure 69 the overhead for the container video sequence in the QCIF format for different quantization values between 1 and 51 is given. Simultaneously in Figure 70 we present also the bandwidth requirements for the container video sequence in the QCIF format for the same quantization values. In Figure 71 and 72 the overhand and the bandwidth requirements are given now for the foreman video sequence.
c
Aalborg University.
R-04-1005
Page 22
Comparison CIF H26L 1.2 1 0.8
overhead
0.6 0.4 0.2 0 -0.2 -0.4 0
2
4
6
8 10 12 number of streams
bridge-close bridge-far
14
16
highway mobile
18
20
tempete paris
Figure 12: Overhead of selected H26L video sequences in the CIF format for different number of sub–streams.
Container QCIF Container QCIF
1e+07
1.8 1.6
bandwidth [bytes]
1.4
overhead
1.2 1 0.8
1e+06
100000
0.6 0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
4
6
QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 13: Overhead for the container video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 14: Bandwidth requirements for the container video sequence in the QCIF format for different quantization values.
R-04-1005
Page 23
Foreman QCIF Foreman QCIF
1e+07
1.6 1.4
bandwidth [bytes]
1.2
overhead
1 0.8 0.6
1e+06
100000
0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
4
6
QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 15: Overhead for the foreman video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 16: Bandwidth requirements for the foreman video sequence in the QCIF format for different quantization values.
R-04-1005
Page 24
5 Network Overhead After investigating the overhead caused by the encoding approach, we want to highlight the overhead caused by the underlying communication system. Once the multiple descriptors are available they have to be conveyed to the receiver. In the IP world this is mostly done by using the real time protocol (RTP) in conjunction with the real time control protocol (RTCP). Both protocols transport their packet via the user datagram protocol (UDP) to the underlying Internet protocol (IP) layer. As given in Figure 17 the multiple description coded sources arrive at the RTP layer. Each data segment is packatized into an RTP packet. The different streams are distinguished by the SS
Application
Video MD stream 1
Video MD stream 2
Video MD stream ...
Video MD stream J
Audio MD stream 1
Audio MD stream 2
RTCP rep
RTCP rep
RTCP rep
socket
socket
RTP RTCP rep
RTCP rep
RTCP rep
UDP socket
socket
socket
socket
IP Figure 17: Transport of multiple description coded video and voice over RTP, UDP, and IP network layer.
Each layer will add an additional overhead to each data chunk that is transmitted. Using the RTP/UDP/IP suite will result in an overhead of 40 byte. Header compression schemes such as Van JAcobsen Header Compression (VJHC) ?? or Robust Header Compression (ROHC) ?? will mitigate the overhead. Interested readers are referred to ?? for header compression overview. The use of ROHC, disabling the UDP checksum and doing some more refinements, would decrease the overhead from 40 bytes to 2 bytes per frame. Without these refinements the compressed header is about 6 byte as it is shown in [24]. Is has to be
c
Aalborg University.
R-04-1005
Page 25
mentioned here, that header compression can be used only in case of single hop communication. In case of multi hop some forms of tagging is needed to be capable to route the packet through the network. In Figure 18 and 19 the network overhead for the foreman video sequence and three different quantization values (namely 1, 31, and 51) is given for IP version 4 and IP version 6, respectively. It can be seen that the overhead increases dramatically due to the network overhead and not due to the encoder overhead if large quantization values are used. For small quantization values the impact becomes less. Splitting the video sequence into 20 sub–streams lead to an overhead of 1.3 introduced by the encoder. If the network overhead of IPv6 is also taken under consideration the overhead increases up to 9.2. Therefore, header compression is needed.
Foreman QCIF 9 8
network overhead
7 6 5 4 3 2 1 0 0
2
4
QP 1 QP 31
6
8 10 12 number of sub-streams QP 51 QP 1 + Network
14
16
18
20
QP 31 + Network QP 51 + Network
Figure 18: Network Overhead (RTP/UDP/IPv4) for the foreman video sequence and two different quantization values.
c
Aalborg University.
R-04-1005
Page 26
Foreman QCIF 12
network overhead
10
8
6
4
2
0 0
2
4
QP 1 QP 31
6
8 10 12 number of sub-streams QP 51 QP 1 + Network
14
16
18
20
QP 31 + Network QP 51 + Network
Figure 19: Network Overhead (RTP/UDP/IPv6) for the foreman video sequence and two different quantization values.
c
Aalborg University.
R-04-1005
Page 27
6 Quality Measurements After the overhead measurements we investigate the quality of the video in dependency of successfully received sub–streams. As metric for the video quality we use the picture signal to noise (PSNR). Here the differences between two pictures are taken into account pixel–wise. For the quality measurements we focus only the H.26L encoder. The MDC is done by splitting the video stream on the basis of frames. Video quality is given by the Picture to Signal Noise (PSNR) values. The calculation is done as given in [17]. The PSNR calculations were done using the videometer tool [25]. The videometer tool calculates the PSNR values and is able to freeze video frames in case the following frames are lost. This is important to have some sort of error resilience. In Figure 20 the PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams is given. The successful received sub– streams are chosen randomly. For each combination (number of sub–streams and percentage), we have repeated the simulations multiple times with a confidence interval of 99%.
Foreman PSNR (Random) 36
PSNR [dB] of Y component
34 32 30 28 26 24 22 20 18 0
2
0.1
0.2
3
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received substreams 5
10
15
0.9
1
20
Figure 20: PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (random approach).
By these results we observe that the PSNR values degrades for a larger number of J even if the same percentage of sub–streams is received. As an example we look at the point were 50% of the sub–streams are received for J = 2 and J = 20. With J = 2 the PSNR value is almost 31 dB, while J = 20 leads to a value slightly above 29 dB. The reason is that for J = 2 we alternating loose and receive a frame, while for J = 20, we might end up in not receiving any frame for a long time and than receiving ten frames in a row. The latter case we refer to as worst case, while the former one is referred to as best case. If we now could force the system operate in the best case, results as in Figure 21 can be
c
Aalborg University.
R-04-1005
Page 28
received. Now the PSNR values are mainly the same if the percentage of received sub-streams is also the same.
foreman PSNR (best case) 36 34
video quality [PSNR]
32 30 28 26 24 22 20 18 0
2
0.1
0.2
3
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams 5
10
15
0.9
1
20
Figure 21: PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (best case).
On the other side if we have a look at the worst case we see the results in Figure 22. In case of J = 20 and 50% of the sub–streams are received the PSNR value is only 27 dB. As an example we compare the case where one stream out of two is received with the example where five out of ten are received (even though if they have the same percentage of received streams). The first example will always result in better quality as between two missing frames one correct frame will be received. In the second example the gap can be five frames. By means of the results achieved, we come up with an optimization scheme, where the receiver informs the sender or the sender is already aware (e.g. analyzing the RTP sender/receiver reports) which streams are received successfully and rearrange the streams at sender side. As an example if the first ten out of twenty streams are received successfully, while the other half is missing, the sender could rearrange the streams in that way that i.) the streams are mixed randomly or ii.) sending the odd frames over the existing channels and the even frames are skipped. For the foreman example this would lead to an improvement of 2.25 dB if the rearrangement is done randomly and 3.32 dB if the rearrangement is done in smarter way by assuring that the odd frames are received and the even frame are skipped. In Figure 23 the PSNR values versus time for the bad and a random case are given. It can be seen that that the random approach has significant higher PSNR values if the bad case do not receive any information resulting in a dramatic degradation of the PSNR values. In Figure 24, PSNR measurements versus percentage of successfully received sub–streams for a total of J = 15 descriptors are given. The results are obtained using 6 different video
c
Aalborg University.
R-04-1005
Page 29
foreman PSNR (worst case) 36 34
video quality [PSNR]
32 30 28 26 24 22 20 18 0
0.1
2
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
3
5
10
15
0.9
1
20
Figure 22: PSNR measurements using the foreman video sequences in the QCIF format versus percentage of successful received sub–streams (worst case).
PSNR comparison 36
PSNR [dB] of Y component
34 32 30 28 26 24 22 20 18 10
15
20
25
30 frame
dedicated
35
40
45
50
random
Figure 23: PSNR values versus time for the bad case and a random case.
c
Aalborg University.
R-04-1005
Page 30
sequences. The foreman video sequence includes high motion. Whereas clarie and container video sequences include relatively lower motion than foreman. As we can observe in the figure, if the motion in a given video sequence is high, the slope of the PSNR degradation curve is also high. For example, for a loss of 60 % out of J = 15 descriptors, the PSNR degradation is 6 dB for foreman, 2 dB for claire and 0.5 dB for container video sequence. Thus, we conclude that the more the motion in a given video sequence, the more important is to receive as many descriptors as possible for a given J.
Compare PSNR (Random) for 15 descriptors 40 38 PSNR [dB] of Y component
36 34 32 30 28 26 24 22 20 0
0.1
0.2
foreman highway
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received substreams silent container
0.9
1
claire carphone
Figure 24: Comparison of PSNR degradation for different video sequences for J = 15.
7 Conclusion 8 Acknowledgment This work was done within the Jade project supported by SAMSUNG, Korea. The script for splitting and Merging was done by Patrick Seeling. Parts of the introduction were taken from one of our WWRF contribution in 2004. We further would like to thank Axel Meyer for his advice in video post-processing. Furthermore we would to thank Sterica Rein for the technical support generating the video sequences highway, bridge-far, and bridge-close in Eastern Europe. Also we would like to thank all anonymous people which helped us to realize this work.
c
Aalborg University.
R-04-1005
Page 31
References [1] J. G. Apostolopoulos, “Error Resilient Video Compression Through The Use of Multiple States,” in IEEE Int’l Conf. on Image Processing, Vancouver, BC, Canada, September 2000, vol. 3, pp. 352–355. [2] J. G. Apostolopoulos and S. J. Wee, ,” . [3] N. Gogate, D. M. Chung, S. S. Panwar, and Y. Wang, “Supporting Image and Video Applications in a Multihop Radio Environment Using Path Diversity and Multiple Description Coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 12, no. 9, pp. 777–834, September 2002. [4] “Video traces for network performance evaluation: http://trace.eas.asu.edu/yuv/yuv.html.
YUV 4:2:0 Video Sequences,”
¨ Erg¨ [5] A.C. Beˇ g en, Y. Altunba¸sak, and O. un, “Multi-path Selection for Multiple Description Encoded Video Streaming,” in IEEE Int’l Conf. on Communications, Anchorage,Alaska,USA, May 2003, vol. 3, pp. 1583–1589. ¨ Erg¨ [6] A. C. Beˇ g en, Y. Altunba¸sak, and O. un, “Fast Heuristics for Multi-Path Selection for Multiple Description Encoded Video Streaming,” in IEEE Int’l Conf. on Multimedia and Expo, Baltimore, Maryland, July 2003, pp. 517–520. [7] P. A. Chou, H. J. Wang, and V. N. Padmanabhan, “Layered Multiple Description Coding,” in In Proc. Packet Video Workshop, April 2003. [8] Vivek K Goyal, “Multiple Description Coding: Compression Meets the Network,” IEEE SIGNAL PROCESSING MAGAZINE, vol. 01, pp. 74–93, September 2001. [9] Yao Wang, Shivendra Panwar, Shunan Lin, and Shiwen Mao, “Wireless video transport using path diversity: multiple description vs layered coding,” Image Processing Proceedings, vol. 1, pp. I–21 – I–24, September 2002. [10] R. Prasad and S. Hara, Multicarrier Techniques for 4G Mobile Communications, Universal Personal Communications. Artech House, 2003. [11] Frank H.P. Fitzek, Ba¸sak Can, Ramjee Prasad, DS Park, and Youngkwon Cho, “Application of Multiple Description Coding in 4G Wireless Communication Systems,” WWRF 8 bis, February 2004. [12] M. Schaar and D. S. Turaga, “Multiple Description Scalable Coding Using Wavelet-Based Motion Compensated Temporal Filtering,” in IEEE Int’l Conf. on Image Processing, Barcelona, Spain, September 2003, vol. 3, pp. 489–492. [13] Y. J. Liang, E. G. Steinbach, and B. Girod., “Multi-stream Voice over IP Using Packet Path Diversity,” in IEEE 4th Workshop on Multimedia Signal Processing, 2001. [14] C. S. Kim and S. U. Lee, “Multiple Description Coding of Motion Fields for Robust Video Transmission,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 11, no. 9, pp. 999–1010, September 2001.
c
Aalborg University.
R-04-1005
Page 32
[15] Arizona State University, http://trace.eas.asu.edu.
“Video traces for network performance evaluation,”
[16] Frank Fitzek, Patrick Seeling, and Martin Reisslein, Wireless Internet, chapter Video Streaming in Wireless Internet, Number ISBN: 0849316316, Electrical Engineering & Applied Signal Processing Series. March 2004. [17] F.H.P. Fitzek, P. Seeling, and M. Reisslein, Wireless Internet, chapter Video Streaming in Wireless Internet, Electrical Engineering & Applied Signal Processing Series. CRC Press, 2004, To be published. [18] F.H.P. Fitzek and M. Reisslein, “MPEG–4 and H.263 Video Traces for Network Performance Evaluation,” Tech. Rep., Technical University of Berlin, 2000, TKN–00–06. [19] P. Seeling, F. Fitzek, M. Reisslein, and A. Wolisz, “H26L Video Traffic Measurements,” Tech. Rep. TKN-02-000, TKN, August 2002, www.acticom.info. [20] ITU-T/SG15, “Recommendation H.263, video coding for low bitrate communication,” 1996. [21] JVT, “JM / TML Software Encoder/decoder,” http://bs.hhi.de/ suehring/tml/, 2002, H.26L Software Coordination by Carsten Suehring. [22] Th. Wiegand, “H.26L Test Model Long–Term Number 9 (TML-9) draft0,” ITU-T Study Group 16, Dec. 2001. [23] Thomas Wiegand, “H26lstandard,” Tech. Rep., JVT, 2002. [24] P. Seeling, F.H.P. Fitzek, S. Hendrata, and M. Reisslein, “Video Quality Evaluation for Wireless Transmission with Robust Header Compression,” in Fourth International Conference on Information, Communications & Signal Processing and Fourth IEEE Pacific-Rim Conference on Multimedia (ICICS-PCM 2003). IEEE, december 2003. [25] F.H.P. Fitzek, P. Seeling, and M. Reisslein, “VideoMeter tool for YUV bitstreams,” Tech. Rep. acticom-02-001, acticom – mobile networks, Germany, October 2002.
c
Aalborg University.
R-04-1005
Page 33
A Additional Overhead Measurements A.1 H263 A.1.1 QCIF In this section the overhead of multiple description coded video sequence for 12 famous sequences in the QCIF format are given.
Bridge-Close QCIF H263
Bridge-Far QCIF H263
1.5
1 0.8
1
0.6
overhead
overhead
0.4 0.5
0
0.2 0 -0.2 -0.4
-0.5
-0.6 -0.8
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 25: Overhead of multiple description Figure 26: Overhead of multiple description coded video sequence bridge-close in coded video sequence bridge-far in the QCIF format using the H.263 enthe QCIF format using the H.263 encoder. coder.
Claire QCIF H263
Carphone QCIF H263 3.5
2
3 1.5 2.5 2 overhead
overhead
1
0.5
0
1.5 1 0.5 0
-0.5 -0.5 -1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 27: Overhead of multiple description Figure 28: Overhead of multiple description coded video sequence claire in the coded video sequence carphone in QCIF format using the H.263 enthe QCIF format using the H.263 encoder. coder.
c
Aalborg University.
R-04-1005
Page 34
Container QCIF H263
Foreman QCIF H263
3.5
2
3 1.5 2.5 1 overhead
overhead
2 1.5 1 0.5
0.5
0
0 -0.5 -0.5 -1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 29: Overhead of multiple description Figure 30: Overhead of multiple description coded video sequence container in coded video sequence foreman in the the QCIF format using the H.263 enQCIF format using the H.263 encoder. coder.
Highway QCIF H263
Grandma QCIF H263 1.5
3 2.5
1 2
overhead
overhead
1.5 1 0.5
0.5
0
0 -0.5 -0.5 -1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 31: Overhead of multiple description Figure 32: Overhead of multiple description coded video sequence highway in the coded video sequence grandma in QCIF format using the H.263 enthe QCIF format using the H.263 encoder. coder.
c
Aalborg University.
R-04-1005
Page 35
News QCIF H263 3
2.5
2.5
2
2
1.5
1.5 overhead
overhead
Mthr_Dotr QCIF H263 3
1 0.5
1 0.5
0
0
-0.5
-0.5
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 33: Overhead of multiple description Figure 34: Overhead of multiple description coded video sequence mthr dotr in coded video sequence news in the QCIF format using the H.263 enthe QCIF format using the H.263 encoder. coder.
Silent QCIF H263
Salesman QCIF H263 2.5
3.5 3
2
2.5 1.5 overhead
overhead
2 1.5 1
1 0.5
0.5 0 0 -0.5
-0.5
-1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 35: Overhead of multiple description Figure 36: Overhead of multiple description coded video sequence silent in the coded video sequence salesman in QCIF format using the H.263 enthe QCIF format using the H.263 encoder. coder.
c
Aalborg University.
R-04-1005
Page 36
A.1.2 CIF In this section the overhead of multiple description coded video sequence for 6 famous sequences in the CIF format are given.
Bridge-Far 1.2
1
1
0.8
0.8
0.6
0.6
0.4
0.4 overhead
overhead
Bridge-Close 1.2
0.2 0
0.2 0
-0.2
-0.2
-0.4
-0.4
-0.6
-0.6
-0.8
-0.8
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 37: Overhead of multiple description Figure 38: Overhead of multiple description coded video sequence bridge-close in coded video sequence bridge-far in the CIF format using the H.263 enthe CIF format using the H.263 encoder. coder.
Mobile
Highway 0.8
2
0.6 1.5 0.4 0.2 overhead
overhead
1
0.5
0 -0.2 -0.4
0
-0.6 -0.5 -0.8 -1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 39: Overhead of multiple description Figure 40: Overhead of multiple description coded video sequence mobile in the coded video sequence highway in the CIF format using the H.263 encoder. CIF format using the H.263 encoder.
A.2 H26L
c
Aalborg University.
R-04-1005
Page 37
Paris
Tempete
2
1 0.8
1.5
0.6 0.4 overhead
overhead
1
0.5
0
0.2 0 -0.2 -0.4 -0.6
-0.5
-0.8 -1
-1 0
5
total data
10 number of sub-streams
15
20
0
data per stream
5
total data
10 number of sub-streams
15
20
data per stream
Figure 41: Overhead of multiple description Figure 42: Overhead of multiple description coded video sequence paris in the coded video sequence tempete in the CIF format using the H.263 encoder. CIF format using the H.263 encoder.
c
Aalborg University.
R-04-1005
Page 38
A.2.1 QCIF In this section the total amount of data and the amount per sub-stream for 12 famous video sequences in the QCIF format are given.
Bridge-Close QCIF H26L
Bridge-Far QCIF H26L
0.8
0.2
0.6 0 0.4 -0.2 overhead
overhead
0.2 0 -0.2 -0.4
-0.4
-0.6
-0.6 -0.8 -0.8 -1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 43: Overhead of multiple description Figure 44: Overhead of multiple description coded video sequence bridge-close in coded video sequence bridge-far in the QCIF format using the H.26L the QCIF format using the H.26L encoder. encoder.
Carphone QCIF H26L
Claire QCIF H26L
1.2
1.5
1 0.8
1
0.6
overhead
overhead
0.4 0.2 0 -0.2
0.5
0
-0.4 -0.5
-0.6 -0.8 -1
-1 0
5
total data
10 number of sub-streams
15
20
0
data per stream
5
total data
10 number of sub-streams
15
20
data per stream
Figure 45: Overhead of multiple description Figure 46: Overhead of multiple description coded video sequence carphone in coded video sequence claire in the the QCIF format using the H.26L QCIF format using the H.26L enencoder. coder.
c
Aalborg University.
R-04-1005
Page 39
Container QCIF H26L
Foreman QCIF H26L
2
1.5
1.5
1
overhead
overhead
1
0.5
0.5
0
0 -0.5
-0.5
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 47: Overhead of multiple description Figure 48: Overhead of multiple description coded video sequence container in coded video sequence foreman in the the QCIF format using the H.26L QCIF format using the H.26L enencoder. coder.
Highway QCIF H26L 1
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
overhead
overhead
Grandma QCIF H26L 1
0 -0.2
0 -0.2
-0.4
-0.4
-0.6
-0.6
-0.8
-0.8 -1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 49: Overhead of multiple description Figure 50: Overhead of multiple description coded video sequence highway in the coded video sequence grandma in QCIF format using the H.26L enthe QCIF format using the H.26L coder. encoder.
c
Aalborg University.
R-04-1005
Page 40
Mthr_Dotr QCIF H26L
News QCIF H26L
2
1.5
1.5
1
overhead
overhead
1
0.5
0.5
0
0 -0.5
-0.5
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 51: Overhead of multiple description Figure 52: Overhead of multiple description coded video sequence mthr dotr in coded video sequence news in the QCIF format using the H.26L enthe QCIF format using the H.26L coder. encoder.
Silent QCIF H26L
Salesman QCIF H26L 1.2
1
1
0.8
0.8
0.6
0.6
0.4
0.4 overhead
overhead
1.2
0.2 0
0.2 0
-0.2
-0.2
-0.4
-0.4
-0.6
-0.6
-0.8
-0.8 -1
-1 0
5
total data
10 number of sub-streams
15
0
20
5
total data
data per stream
10 number of sub-streams
15
20
data per stream
Figure 53: Overhead of multiple description Figure 54: Overhead of multiple description coded video sequence silent in the coded video sequence salesman in QCIF format using the H.26L enthe QCIF format using the H.26L coder. encoder.
c
Aalborg University.
R-04-1005
Page 41
A.2.2 CIF In this section the total amount of data and the amount per sub-stream for 6 famous video sequences in the CIF format are given.
Bridge-Close
Bridge-Far 2.5
2
2
1.5
1.5 overhead
overhead
2.5
1 0.5
1 0.5
0
0
-0.5
-0.5
-1
-1 0
5
10 number of sub-streams
total data
15
20
0
data per stream
5
10 number of sub-streams
total data
15
20
data per stream
Figure 55: Overhead of multiple description Figure 56: Overhead of multiple description coded video sequence bridge-close in coded video sequence bridge-far in the CIF format using the H.26L enthe CIF format using the H.26L encoder. coder.
Highway
Mobile
1
1
0.8
0.8 0.6
0.6
0.4 overhead
overhead
0.4 0.2 0 -0.2
0.2 0 -0.2 -0.4
-0.4
-0.6
-0.6
-0.8
-0.8
-1 0
5
total data
10 number of sub-streams
15
20
0
data per stream
5
total data
10 number of sub-streams
15
20
data per stream
Figure 57: Overhead of multiple description Figure 58: Overhead of multiple description coded video sequence highway in coded video sequence mobile in the the CIF format using the H.26L enCIF format using the H.26L encoder. coder.
c
Aalborg University.
R-04-1005
Page 42
Paris
Tempete
2.5
1.5
2 1
overhead
overhead
1.5 1 0.5
0.5
0
0 -0.5 -0.5 -1
-1 0
5
total data
10 number of sub-streams
15
20
0
data per stream
5
total data
10 number of sub-streams
15
20
data per stream
Figure 59: Overhead of multiple description Figure 60: Overhead of multiple description coded video sequence parisparis in coded video sequence tempetetemthe CIF format using the H.26L enpete in the CIF format using the H.26L encoder. coder.
c
Aalborg University.
R-04-1005
Page 43
B Additional Quantization Measurements
Bridge-Close QCIF Bridge-Close QCIF
1e+08
0.7 0.6 1e+07 bandwidth [bytes]
0.5
overhead
0.4 0.3
1e+06
0.2 100000 0.1 0 10000 0
-0.1 0
2
QP 1
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 61: Overhead for the bridge-close video sequence in the QCIF format for different quantization values.
Figure 62: Bandwidth requirements for the bridge-close video sequence in the QCIF format for different quantization values.
Bridge-Far QCIF Bridge-Far QCIF
1e+08
0.2 0.18 0.16
1e+07 bandwidth [bytes]
0.14
overhead
0.12 0.1 0.08
1e+06
0.06 100000
0.04 0.02 0
10000 0
-0.02 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 63: Overhead for the bridge-far video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 64: Bandwidth requirements for the bridge-far video sequence in the QCIF format for different quantization values.
R-04-1005
Page 44
Carphone QCIF Carphone QCIF
1e+07
1.2
bandwidth [bytes]
1
overhead
0.8
0.6
1e+06
100000
0.4
0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 65: Overhead for the carphone video sequence in the QCIF format for different quantization values.
Figure 66: Bandwidth requirements for the carphone video sequence in the QCIF format for different quantization values.
Claire QCIF Claire QCIF
1e+07
1.4 1.2
bandwidth [bytes]
overhead
1 0.8 0.6
1e+06
100000
0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 67: Overhead for the claire video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 68: Bandwidth requirements for the claire video sequence in the QCIF format for different quantization values.
R-04-1005
Page 45
Container QCIF Container QCIF
1e+07
1.8 1.6
bandwidth [bytes]
1.4
overhead
1.2 1 0.8
1e+06
100000
0.6 0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 69: Overhead for the container video sequence in the QCIF format for different quantization values.
Figure 70: Bandwidth requirements for the container video sequence in the QCIF format for different quantization values.
Foreman QCIF Foreman QCIF
1e+07
1.6 1.4
bandwidth [bytes]
1.2
overhead
1 0.8 0.6
1e+06
100000
0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 71: Overhead for the foreman video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 72: Bandwidth requirements for the foreman video sequence in the QCIF format for different quantization values.
R-04-1005
Page 46
Grandma QCIF Grandma QCIF
1e+08
1 0.9 1e+07 bandwidth [bytes]
0.8
overhead
0.7 0.6 0.5 0.4
1e+06
100000
0.3 0.2 0.1
10000 0
0 0
2
QP 1
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 73: Overhead for the grandma video sequence in the QCIF format for different quantization values.
Figure 74: Bandwidth requirements for the grandma video sequence in the QCIF format for different quantization values.
Highway QCIF Highway QCIF
1e+08
1 0.9 0.8
1e+07 bandwidth [bytes]
0.7
overhead
0.6 0.5 0.4
1e+06
0.3 100000
0.2 0.1 0
10000 0
-0.1 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 75: Overhead for the highway video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 76: Bandwidth requirements for the highway video sequence in the QCIF format for different quantization values.
R-04-1005
Page 47
Mthr_Dotr QCIF Mthr_Dotr QCIF
1e+08
1.6 1.4 1e+07 bandwidth [bytes]
1.2
overhead
1 0.8
1e+06
0.6 100000 0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 77: Overhead for the mthr dotr video sequence in the QCIF format for different quantization values.
Figure 78: Bandwidth requirements for the mthr dotr video sequence in the QCIF format for different quantization values.
News QCIF News QCIF
1e+07
1.4 1.2
bandwidth [bytes]
overhead
1 0.8 0.6
1e+06
100000
0.4 0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 79: Overhead for the news video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 80: Bandwidth requirements for the news video sequence in the QCIF format for different quantization values.
R-04-1005
Page 48
Salesman QCIF Salesman QCIF
1e+07
1.2 1
bandwidth [bytes]
overhead
0.8 0.6 0.4
1e+06
100000
0.2 0 10000 0
-0.2 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
2
4
6
20 QP 11
QP 21
8 10 12 number of sub-streams QP 31
14
16
QP 41
18
20
QP 51
QP 51
Figure 81: Overhead for the salesman video sequence in the QCIF format for different quantization values.
Figure 82: Bandwidth requirements for the salesman video sequence in the QCIF format for different quantization values.
Silent QCIF Silent QCIF
1e+07
1.2
bandwidth [bytes]
1
overhead
0.8
0.6
1e+06
100000
0.4
0.2 10000 0
0 0
QP 1
2
4
QP 11
6
QP 21
8 10 12 number of sub-streams QP 31
14
16
18
QP 1 QP 41
QP 11
4
6
QP 21
8 10 12 number of sub-streams QP 31
14
QP 41
16
18
20
QP 51
QP 51
Figure 83: Overhead for the silent video sequence in the QCIF format for different quantization values.
c
Aalborg University.
2
20
Figure 84: Bandwidth requirements for the silent video sequence in the QCIF format for different quantization values.
R-04-1005
Page 49
C Additional Quality Measurements
carphone PSNR comparison 36 35.5
video quality [PSNR]
35 34.5 34 33.5 33 32.5 32 31.5 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 85: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 50
carphone PSNR comparison 36
video quality [PSNR]
35
34
33
32
31
30 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 86: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
carphone PSNR comparison 36 35
video quality [PSNR]
34 33 32 31 30 29 28 27 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 87: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 51
carphone PSNR comparison 36
video quality [PSNR]
34
32
30
28
26
24 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 88: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 10
carphone PSNR comparison 36
video quality [PSNR]
34 32 30 28 26 24 22 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 89: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 52
carphone PSNR comparison 36
video quality [PSNR]
34 32 30 28 26 24 22 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 90: PSNR comparison for different modes and the worst case investigating carphone video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 20
claire PSNR comparison 38.6 38.5
video quality [PSNR]
38.4 38.3 38.2 38.1 38 37.9 37.8 37.7 37.6 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 91: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 53
claire PSNR comparison 38.6 38.4
video quality [PSNR]
38.2 38 37.8 37.6 37.4 37.2 37 36.8 36.6 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 92: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
claire PSNR comparison 38.5 38
video quality [PSNR]
37.5 37 36.5 36 35.5 35 34.5 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 93: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 54
claire PSNR comparison 39
video quality [PSNR]
38 37 36 35 34 33 32 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 94: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 10
claire PSNR comparison 39 38
video quality [PSNR]
37 36 35 34 33 32 31 30 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 95: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 55
claire PSNR comparison 39 38 37 video quality [PSNR]
36 35 34 33 32 31 30 29 28 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 96: PSNR comparison for different modes and the worst case investigating claire video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 20
container PSNR comparison 34.9 34.85
video quality [PSNR]
34.8 34.75 34.7 34.65 34.6 34.55 34.5 34.45 34.4 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 97: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 56
container PSNR comparison 34.9 34.8 34.7 video quality [PSNR]
34.6 34.5 34.4 34.3 34.2 34.1 34 33.9 33.8 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 98: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
container PSNR comparison 34.8 34.6 34.4 video quality [PSNR]
34.2 34 33.8 33.6 33.4 33.2 33 32.8 32.6 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 99: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 57
container PSNR comparison 35 34.5
video quality [PSNR]
34 33.5 33 32.5 32 31.5 31 30.5 30 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 100: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 10
container PSNR comparison 35
video quality [PSNR]
34 33 32 31 30 29 28 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 101: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 58
container PSNR comparison 35 34
video quality [PSNR]
33 32 31 30 29 28 27 26 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 102: PSNR comparison for different modes and the worst case investigating container video sequences in the QCIF format versus percentage of successful received sub– streams. Number of descriptors is 20
foreman PSNR comparison 34.5 34
video quality [PSNR]
33.5 33 32.5 32 31.5 31 30.5 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 103: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 59
foreman PSNR comparison 35
video quality [PSNR]
34 33 32 31 30 29 28 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 104: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
foreman PSNR comparison 35 34
video quality [PSNR]
33 32 31 30 29 28 27 26 25 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 105: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 60
foreman PSNR comparison 36 34
video quality [PSNR]
32 30 28 26 24 22 20 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 106: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 10
foreman PSNR comparison 36 34
video quality [PSNR]
32 30 28 26 24 22 20 18 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 107: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 61
foreman PSNR comparison 36 34
video quality [PSNR]
32 30 28 26 24 22 20 18 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 108: PSNR comparison for different modes and the worst case investigating foreman video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 20
highway PSNR comparison 37
video quality [PSNR]
36.5
36
35.5
35
34.5
34 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 109: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 62
highway PSNR comparison 36.5 36
video quality [PSNR]
35.5 35 34.5 34 33.5 33 32.5 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 110: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
highway PSNR comparison 37
video quality [PSNR]
36
35
34
33
32
31 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 111: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 63
highway PSNR comparison 37 36
video quality [PSNR]
35 34 33 32 31 30 29 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 112: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 10
highway PSNR comparison 37 36
video quality [PSNR]
35 34 33 32 31 30 29 28 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 113: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 64
highway PSNR comparison 37 36
video quality [PSNR]
35 34 33 32 31 30 29 28 27 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 114: PSNR comparison for different modes and the worst case investigating highway video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 20
silent PSNR comparison 34.5
video quality [PSNR]
34
33.5
33
32.5
32 0.5
Mode F/T
0.6 0.7 0.8 0.9 percentage of successfully received sub-streams Mode R
1
worst case
Figure 115: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 2
c
Aalborg University.
R-04-1005
Page 65
silent PSNR comparison 34.5 34
video quality [PSNR]
33.5 33 32.5 32 31.5 31 30.5 0.3
0.4
Mode F/T
0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 116: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 3
silent PSNR comparison 35
video quality [PSNR]
34 33 32 31 30 29 28 0.2
0.3
Mode F/T
0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 117: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 5
c
Aalborg University.
R-04-1005
Page 66
silent PSNR comparison 35 34
video quality [PSNR]
33 32 31 30 29 28 27 26 0.1
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams
Mode F/T
Mode R
0.9
1
worst case
Figure 118: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 10
silent PSNR comparison 35 34
video quality [PSNR]
33 32 31 30 29 28 27 26 25 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 119: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 15
c
Aalborg University.
R-04-1005
Page 67
silent PSNR comparison 36
video quality [PSNR]
34 32 30 28 26 24 22 0
0.1
Mode F/T
0.2
0.3 0.4 0.5 0.6 0.7 0.8 percentage of successfully received sub-streams Mode R
0.9
1
worst case
Figure 120: PSNR comparison for different modes and the worst case investigating silent video sequences in the QCIF format versus percentage of successful received sub–streams. Number of descriptors is 20
c
Aalborg University.
R-04-1005
Page 68