Improved seam carving for stereo image resizing

Yue, Bin; Hou, Chun-ping; Zhou, Yuan

doi:10.1186/1687-1499-2013-116

Research
Open access
Published: 26 April 2013

Improved seam carving for stereo image resizing

Bin Yue¹,
Chun-ping Hou¹ &
Yuan Zhou¹

EURASIP Journal on Wireless Communications and Networking volume 2013, Article number: 116 (2013) Cite this article

2976 Accesses
8 Citations
Metrics details

Abstract

When stereo images are shown in three-dimensional (3D) display devices of different aspect ratios, the resizing algorithm for single image could lead to shape and depth distortion of the stereo image’s main content. This paper aims to propose a novel method for retargeting stereo image pairs without distorting important objects in the scene while still maintaining the consistency between the left and right images. We extended seam carving algorithm to stereo images. The novelty of our method is that important objects are determined by jointly considering the intensities of gradients and visual fusion area. The retargeted stereo pair has a feasible 3D interpretation that is similar to the original one. Our method protected the important content and reduced the visual distortion in each of the images as well as the depth distortion. Experimental results are presented to demonstrate that the proposed method effectively guaranteed the geometric consistency of resized stereo images.

1. Introduction

With the development of three-dimensional (3D) display technology, there are many kinds of 3D display devices on the market, from cell phone to IMAX screen. The resolutions of these devices have various aspect ratios. When the same stereo image is displayed on screens with different aspect ratios, the stereo image needs to be resized without distorting the shape and depth of the main content.

Normal image scaling only resizes images uniformly and ignores the distortion of the main content. Cropping can protect the reserved content but loses many pixels of the image periphery. Image resizing methods attempt to adapt the image content to the screen without distorting the main objects in the scene. Seam carving[1] is an effective method for image resizing. It is termed as content-aware resizing. The operator retargets the image to a new size considering the main content. When resizing the single image for aspect ratio change, seam carving gets much better results compared to scaling and cropping. However, when seam carving is applied, the only way to determine seams is to consider the change of intensities around the seam. If the intensities of the main content have fewer changes than those of the background, the carving or inserting seams will go through and distort the main content.

Existing resizing methods are effective for single image. However, these methods do not give enough protection to the important content, and using them for stereo images will distort the depth perception. For stereo images, the resizing operator considers not only the distortion of shape, but also the distortion of depth. Depth perception is derived from the small differences in the location of homologous, or corresponding, points in the image pair incident on the retina of the eyes [2]. A general stereo image is composed of two planar images of the same scene from different viewpoints. The difference of corresponding points in the viewpoints generates disparity in the stereo image. When the left and right eyes respectively view different viewpoints, the observer perceives depth depending on stereo image disparity. Clearly, stereo image resizing should consider the relevancy of stereo image pairs, retargeting both images without distorting the depth perception of the main content.

Two retargeting algorithms have also been adjusted to work on stereo image pairs. In [3], stereo matching results are fused into a framework in the seam carving to preserve the consistency between the left and right images. Both image retargeting and depth adjustment algorithms were discussed. Similarly, a method for retargeting a pair of stereo images was proposed in [4]. It takes into account the visibility relations between pixels in the image pair, and the geometric consistency was mostly preserved by generalizing seam carving to simultaneously carve a pair of seams in both images while reducing distortion in appearance and depth.

In this paper, we present a stereo image resizing method based on seam carving. The proposed method firstly calculates the disparity map of stereo image pairs and then segments the main content of the image within the scope of human vision which is determined with Panum's fusional area [5]. At last, it resizes the stereo image pairs with seam carving algorithm and protects the main content.

This paper is organized as follows: In Section 2, Panum's fusional area of stereo image is calculated. Section 3 proposes our method for stereo image resizing based on seam carving. Experimental results and analysis are shown in Section 4. Finally, Section 5 concludes this paper.

2. Panum’s fusional area of stereo image

Panum presented that when the two eyes were aimed at one point F, only the objects within the area around F were seen as single fused images. The area was named Panum's fusional area [5].

For 3D display devices, Panum's fusional area is illustrated in Figure 1. The distances between the edges of Panum's fusional area and the eyes are denoted by z_f and z_b, respectively. z_f and z_b can be respectively calculated as

z_{f} = \frac{e}{2} \cot (\arctan \frac{e}{2 v} + \frac{|\deg|}{2}),

(1)

z_{b} = \frac{e}{2} \cot (\arctan \frac{e}{2 v} - \frac{|\deg|}{2}),

(2)

where v is the viewing distance, e is the eye separation, and deg is the angular disparity of Panum's fusional area which was discussed by Krol and van der Grind [6].

Panum's fusional area in front and behind the screen are denoted by P_f and P_b, respectively. Then we have

P_{f} = v - z_{f},

(3)

P^{b} = v - z^{b} .

(4)

Panum's fusional area of 3D display devices is expressed as [P_b, P_f]. Because the perceived depth depends on the disparity of the stereo image [2], [P_b, P_f] is converted into [D_b, D_f]. The minimum and the maximum disparity of Panum's fusional area of the stereo image are defined by D_b and D_f, respectively.

Many stereo images are obtained using parallel stereo cameras. If these stereo image pairs are not horizontally moved in changing the disparity, the perceived depth will only be in front of the screen [7]. For this case, Agarwal and Blake have discussed Panum's fusional area in [8].

3. The proposed method

The resized stereo image for our method is a pair of rectified images I_L and I_R. The proposed method retargets I_L and I_R into a new size. The new stereo image is composed of paired images I′_L and I′_R.

In this paper, we described the method focusing on reducing the stereo image width. Carving and inserting are reciprocal, and horizontal resizing is similar as vertical resizing.

The proposed method resized the stereo image as follows:

1.
Calculate the disparity map (D) of the stereo image, then segment the main content base on D and Panum's fusional area of the stereo image, [D _b, D _f].
2.
Calculate and select the seam of the left image (S _L), then pick the seam of the right image (S _R) with the disparity, D. Respectively carve S _L and S _R from I _L and I _R.
3.
Repeat step 2 according to the image width which needs to be reduced.

3.1. Main content segmentation

According to whether the disparity of corresponding objects is in Panum's fusional area ([D_b, D_f]), the image was segmented into the main content and background.

The disparity map (D) of the stereo image is calculated by the belief propagation (BP) algorithm [9]. We consider the disparity map with respect to I_L, which is taken to be the reference image. The disparity map (D) is shown in Figure 2.

The BP algorithm we used for calculating the disparity map was based on belief propagation and mean shift segmentation [10]. The disparity map and the reference image (I_L) are segmented into some objects. The objects and the average disparity of these objects are denoted by $o_{L}^{i}$ and $d_{L}^{i}$ , respectively, i = 1, 2,…, m. If $d_{L}^{i}$ is in [D_b, D_f], $o_{L}^{i}$ is regarded as the main content, $o_{L}^{i} \in O_{main content}$ . If $d_{L}^{i}$ is not in [D_b, D_f], $o_{L}^{i}$ is regarded as the background, $o_{L}^{i} \in O_{background}$ . That is,

o_{L}^{i} \in \{\begin{array}{c} O_{main content} d_{L}^{i} \in [D_{b}, D_{f}] \\ O_{background} d_{L}^{i} \notin [D_{b}, D_{f}] \end{array} .

(5)

Figure 3 depicts the result of the main content segmentation. The main objects are reserved, and the background is removed from the left image.

3.2. Seam selection and carving

A seam is an optimal eight-connected path of pixels on a single image from top to bottom (vertical) and consisted of one and only one pixel in each row, which guarantees that the image keeps a rectangle when the seams are removed. In [1], an energy function defines the cost of a seam. The optimal seam S^* which minimizes this seam cost is selected:

S^{*} = \min_{s} E (s) = \min_{s} \sum_{j = 1}^{n} e_{HoG} (I (s_{i})),

(6)

where e_HoG is the energy with Histogram of Gradients, which is defined as follows:

e_{HoG} (I) = \frac{|\frac{\partial}{\partial x} I| + |\frac{\partial}{\partial y} I|}{\max (HoG (I (x, y)))} .

(7)

In our method, the seam is selected by both energy function and main content protection. Main content protection lets the seam bypass the main content without hurting the proportion.

Let S_L denote the seam in I_L and S_R denote the seam in I_R. S_L and S_R are correlative by the disparity map D. The disparity map shows the correspondence of each pixel in the left and right images. So S_R can be obtained by S_L and D.

S_L is computed by the energy function. The energy function based on the gradient was used to select the energy which the pixels in the seams have, which is called the backward energy at first. The minimum energy tried to minimize the artifacts introduced in the generated image.

If it does not cross the main content, S_L and its correlative seam S_R will be removed for retargeting the stereo image.

If S_L crosses the main content, its crossing part will be replaced. Let S_cross be the parts that cross the main content and S_not cross be the parts that do not cross the object of the main content. Let S_E denote the new part of the seam used to replace S_cross. S_E is selected beside the edge of the object; the vertical height of S_E and S_cross is the same, and the start points and end points of S_E and S_cross have the same ordinate value. That is, assuming that S_cross starts from point a(i₁, j₁) to point b(i₂, j₂) to end, S_E will start from point c(k₁, j₁) to point d(k₂, j₂). Let S′_L denote the new seam that consists of S_E and S_not cross. By S′_L and D, the new seam of the right image (S′_R) is obtained. Figure 4 shows the coupled seams S′_L and S′_R. S′_L and S′_R are not necessarily continuous:

S^{'}_{L} = \{\begin{cases} S_{L} & if S_{L} does not cross the main content \\ S_{L} - S_{cross} + S_{E} & if S_{L} crosses the main content \end{cases} .

Since the left and right images are captured from different views, some pixels around objects are occluded [11]. Therefore, removing S′_L and S′_R takes a little effect on depth perception.

The energy function calculates the changing intensities of gradients. When the change of aspect ratio is large, many seams need to be removed from the stereo image. In this scenario, some seams may intensively appear around the objects with smooth gradients and lead to cracks in the resized stereo image. In order to prevent this condition, when n seams need to be removed, we calculate 2n − 1 seams at once and alternately remove n seams from the 2n − 1 seams.

For some stereo images with large main content, the excessive main content protection will cause overflow or seriously distort the global structure of stereo images. To address this problem, we set a ratio based on the image content. If the reduced width is more than the ratio of the image, the seams of the left image will be defined only by the energy function, without considering main content protection.

4. Experimental results and analysis

We tested our method on standard stereo images obtained from middlebury stereo datasets. We reduced by 20% the width of tsukuba[12] and Reindeer[13] by our method and single image seam carving, respectively. Figures 5 and 6 compare our results and single image seam carving results.

As can be observed from Figures 5 and 6, the main content is not impacted by resizing using our proposed method. The depth perception of important content is preserved well. It is also clearly seen that a significant depth distortion is caused when naive independent retargeting of each image is considered.

5. Conclusions

Since independently retargeting each image of the stereo image pair will distort the geometric structure of the main content, in this paper, we proposed a new seam carving method to work on stereo images without distorting the depth perception of the main content. We improved the seam carving to work on a pair of stereo images and proved that the proposed method is guaranteed to give a geometrically consistent result of main content. Our method takes advantage of both appearance and perceived depth and can deal with stereo images that are difficult to process with single image seam carving. The experimental results showed that our method had a satisfactory result on stereo images.

References

Avidan S, Shamir A: Seam carving for content-aware image resizing. ACM Transactions on Graphics (TOG) 2007, 26(3):10. 10.1145/1276377.1276390
Article Google Scholar
Holliman N: 3D Display Systems. Bristol: IOP; 2004.
Google Scholar
Utsugi K, Shibahara T, Koike T, Takahashi K, Naemura T: Seam carving for stereo images, in 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON). Volume 2010. IEEE: Piscataway; 2010:1-4.
Google Scholar
Basha T, Moses Y, Avidan S: Geometrically consistent stereo seam carving, in 2011 IEEE International Conference on Computer Vision (ICCV). IEEE: Piscataway; 2011:1816-1823.
Book Google Scholar
Panum PL: Physiologische Untersuchungen über das Sehen mit zwei Augen. Nabu: Charleston; 2010.
Google Scholar
Krol JD, van der Grind WA: Rehabilitation of a classical notion of Panum’s fusional area. Perception 1982, 11(5):615-619. 10.1068/p110615
Article Google Scholar
Son JY, Yeom S, Lee DS, Lee KH, Park MC: A stereoscopic camera model of focal plane detector array. Journal of Display Technology 2011, 7(5):281-288.
Article Google Scholar
Agarwal A, Blake A: Dense stereo matching over the Panum band. IEEE Transactions on Pattern Analysis and Machine Intelligence 2010, 32(3):416-430.
Article Google Scholar
Klaus A, Sormann M, Karner K: Segment-based stereo matching using belief propagation and a self-adapting dissimilarity measure, in 18th International Conference on Pattern Recognition, 2006 (ICPR 2006). Volume 3. IEEE: Piscataway; 2006:15-18.
Google Scholar
Comaniciu D, Meer P: Mean shift: a robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 2002, 24(5):603-619. 10.1109/34.1000236
Article Google Scholar
Geiger D, Ladendorf B, Yuille A: Occlusions and binocular stereo, in Computer Vision—ECCV'92. Berlin: Springer; 1992:425-433.
Google Scholar
Scharstein D, Szeliski R: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International Journal of Computer Vision 2002, 47(1/2/3):7-42.
Article Google Scholar
Scharstein D, Pal C: Learning conditional random fields for stereo. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007). Minneapolis; 2007.
Google Scholar

Download references

Acknowledgments

The authors are grateful to the anonymous referees for the constructive comments. This study was funded by The National High Technology Research and Development Program of China (2012AA03A301), The National Natural Science Foundation of China (60932007, 61201179), Ph.D. Programs Foundation of the Ministry of Education of China (20110032110029), and Key Projects in the Tianjin Science & Technology Pillar Program (11ZCKFGX02000).

Author information

Authors and Affiliations

School of Electronic Information Engineering, Tianjin University, Tianjin, 300072, China
Bin Yue, Chun-ping Hou & Yuan Zhou

Authors

Bin Yue
View author publications
You can also search for this author in PubMed Google Scholar
Chun-ping Hou
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuan Zhou.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yue, B., Hou, Cp. & Zhou, Y. Improved seam carving for stereo image resizing. J Wireless Com Network 2013, 116 (2013). https://doi.org/10.1186/1687-1499-2013-116

Download citation

Received: 09 March 2013
Accepted: 08 April 2013
Published: 26 April 2013
DOI: https://doi.org/10.1186/1687-1499-2013-116

Improved seam carving for stereo image resizing

Abstract

1. Introduction

2. Panum’s fusional area of stereo image

3. The proposed method

3.1. Main content segmentation

3.2. Seam selection and carving

4. Experimental results and analysis

5. Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

About this article

Cite this article

Share this article

Keywords