Pseudo Video Sequence Coding of Integral Imaging based 3D Images
There is a significant
amount of inherent redundancy in an integral image. Applying a 2D-image coding
scheme to an integral image fail to extract and reduce this redundancy
efficiently. We have proposed a pre-processing and compression scheme that aims
to enhance the compression efficiency of integral images. The scheme first
transforms the still integral image into a pseudo video sequence consisting of
sub-images, which is then compressed using an H.264 video encoder. Thereby
leveraging on the efficient temporal and spatial coding tools of the
H.264-standard for coding integral images.
The improvement in compression efficiency of using this scheme is evaluated and
presented. Different parameterizations are also investigated, which affects the
coding efficiency at low range bitrates. An average PSNR increase of 5.7 dB or
more, compared to JPEG 2000, is observed on a set of reference images. Compared
to other coding schemes the introduced artifacts are distributed more
homogenously within the depicted 3D-volume.
Proposed coding scheme
To lay the groundwork we first define an integral imaging based 3D-image (II-frame) with a resolution of M x N pixels as
II(m, n) = [IIR, IIG, IIB]T, (1)
where m = 0, 1, ..., M-1 and n = 0, 1, ... N-1 are the horizontal and vertical position of a II-frame pixel, which constitute a RGB-triplet. In our simulations we assume a lens array with rectangular lenslets, but the proposed scheme is applicable to hexagonal and lenticular lenslets as well provided that the number of views are sufficiently high. This gives K·L elementary images (EI) with a resolution of U x V pixels that are defined as
EIk,l (u
, v) = II(k·U + u, l·V + v), (2)where k = 0, 1, ..., K-1 and l = 0, 1, ... L-1 is the position of each EI and u= 0, 1, ..., U-1 and v = 0, 1, ... V-1 is the pixel positions within every EI.
As a first step in our proposed coding scheme we extract the set of sub-images (SI) according to
SIu,v(
k, l) = II(k·U + u, l·V + v). (3)where consequently each of the U·V SIs have a resolution of K x L pixels. Figure 1 shows examples of the above defined images II, EI and SI.
![]() |
![]() |
![]() |
![]() |
(a) II-frame | (b) EI | (c) SI | (d) 2D-image |
Figure 1. Synthesized II-frame divided into its component parts. (d) shows the scene captured by a 2D-camera merely for reference.
The SIs are then considered as pictures in a pseudo video sequence (PVS) according to
PV S(t) = SIu(t),v(t)(k, l) = II(k·U + u(t), l·V + v(t)), (4)
where t is the pseudo time index, or picture number, of the PVS. Selecting different functions u(t) and v(t), or selection orders, results in PVSs with different coding properties. The PVS is then coded using a H.264/MPEG-4 AVC encoder. The three steps are illustrated in Figure 2.
Figure 2. Block diagram of the proposed coding scheme
Results
The proposed coding scheme outperforms JPEG2000 and other proposed PVS-schemes for resolution prioritizing II-techniques where the number of lenslets is large. For depth prioritizing II, with a lower number of high resolution EI, the benefits are less. A proper choice of selection order for the proposed scheme becomes more important at low bit rates. Compared to the references the proposed coding scheme distributes the coding artifacts homogenously over the 3D-scene's depth.
Proposed
scheme vs. 2D-image coding and other PVS-based coding schemes ("Car")
Different
selection orders' effect on coding efficiency ("Twins")
Lens array properties effect on coding efficiency
("Cuboid")
Induced coding artifacts ("Apples", front view @ 0.15 bpp)
Original (24
bpp)
Proposed SI-based PVS coding scheme
EI-based PVC coding
scheme
JPEG2000
Publications
"Multiview image coding scheme transformations: artifact characteristics and effects on perceived 3D quality"
Roger Olsson and Mårten Sjöström
Proceedings of Stereoscopic Displays and Applications, January 2010
Web page accompanying paper
"Empirical Rate-Distortion Analysis of JPEG2000 3D
and H.264/AVC Coded Integral Imaging Based 3D-Images"
Roger Olsson
Proceedings of 3DTV Conference, IEEE/EURASIP/MPEG-IF,
May 2008
Web page accompanying paper
"A novel quality metric for evaluating
depth distribution of artifacts in coded still 3D images",
Roger Olsson and
Mårten Sjöström
Electronic Imaging - Stereoscopic Displays and Applications, IS&T/SPIE,
January 2008
"A depth dependent
quality metric for evaluation of coded integral imaging based 3D-images",
Roger Olsson
"Evaluation
of Combined Pre-Processing and H.264-Compression Schemes for 3D Integral Images",
Roger Olsson, Mårten
Sjöström
and Youzhi Xu,
Proceedings of Electronic Imaging - VCIP, SPIEIS&T/SPIE, January
2007
"A
Combined Pre-processing and H.264-compression Scheme for 3D Integral Images"
Roger Olsson, Mårten Sjöström and Youzhi Xu,
Proceedings of ICIP, IEEE, October 2006