Infrared and visible image fusion based on nonlinear enhancement and NSST decomposition

Xing, Xiaoxue; Liu, Cheng; Luo, Cong; Xu, Tingfa

doi:10.1186/s13638-020-01774-6

Research
Open access
Published: 24 August 2020

Infrared and visible image fusion based on nonlinear enhancement and NSST decomposition

Xiaoxue Xing¹,
Cheng Liu¹,
Cong Luo¹ &
…
Tingfa Xu²

EURASIP Journal on Wireless Communications and Networking volume 2020, Article number: 162 (2020) Cite this article

2222 Accesses
9 Citations
Metrics details

Abstract

In multi-scale geometric analysis (MGA)-based fusion methods for infrared and visible images, adopting the same representation for the two types of images will result in the non-obvious thermal radiation target in the fused image, which can hardly be distinguished from the background. To solve the problem, a novel fusion algorithm based on nonlinear enhancement and non-subsampled shearlet transform (NSST) decomposition is proposed. Firstly, NSST is used to decompose the two source images into low- and high-frequency sub-bands. Then, the wavelet transform (WT) is used to decompose high-frequency sub-bands to obtain approximate sub-bands and directional detail sub-bands. The “average” fusion rule is performed for fusion for approximate sub-bands. And the “max-absolute” fusion rule is performed for fusion for directional detail sub-bands. The inverse WT is used to reconstruct the high-frequency sub-bands. To highlight the thermal radiation target, we construct a non-linear transform function to determine the fusion weight of low-frequency sub-bands, and whose parameters can be further adjusted to meet different fusion requirements. Finally, the inverse NSST is used to reconstruct the fused image. The experimental results show that the proposed method can simultaneously enhance the thermal target in infrared images and preserve the texture details in visible images, and which is competitive with or even superior to the state-of-the-art fusion methods in terms of both visual and quantitative evaluations.

1 Introduction

Image fusion technology, which aims to combine images obtained from different sensors to create a single and rich fused image [1], has been widely used in medical imaging [2, 3], remote sensing [4,5,6], object recognition [7, 8], and detection [9]. Among the combination of different types of images, infrared and visible image fusion has attracted increasing attention [10]. The infrared images record the thermal radiation of the scene, thus, the target in the infrared images is prominent and obvious. However, the infrared images have less detail information, low contrast, poor visual effects, and poor imaging performance. In contrast, the visible images can provide abundant detail information, while the target will be inconspicuous and easily influenced by smoke, bad weather conditions, and other factors. Therefore, fusion of the two types of the images can compensate for the insufficient imaging competence of infrared and visible sensors [11]. The final fused image can possess clearer scene information as well as better target characteristics [12].

There are seven main fusion methods: multi-scale geometric analysis (MGA)-based, sparse representation-based [13,14,15], neural network-based [16, 17], subspace-based [18], saliency-based methods [19], hybrid models [20], and other methods. Among them, MGA-based methods are the most popular. MGA-based methods assume that the images can be represented by different coefficients in different scale. These methods decompose the source images into low- and high-component bands, combine the corresponding bands with specific fusion rules, and reconstruct the fused image with the inverse MGA transform [21]. The key to MGA-based methods is the MGA transform, which decides the amount of the useful information that can be extracted from source images and integrated in the fused image. Popular transforms used for decomposition and reconstruction include wavelet transform [22] (WT), wedgelet transform [23], curvelet transform [24, 25], contourlet transform [26], NSCT [27, 28], shearlet transform [29] (ST), non-subsampled shearlet transform [30] (NSST), and so on. Due to the characteristics of shift-invariant, high sensitivity, strong directivity, fast operation speed, and multi-directional processing, NSST has been widely used in the image fusion [31]. Many researches have shown that NSST is more consistent with human visual characteristics than other MGA transforms, and the performance can make the fused images have better visual effects [32]. However, it may be inappropriate for the infrared and visible image fusion. In infrared images, the target information is significant and easy to detect and recognize. While in visible images, the detailed information is mainly provided by gradients. Therefore, adopting the same representation for the two types of images will cause the thermal radiation target inconspicuous, which can hardly be distinguished from the background. In MGA-based fusion methods, it is difficult to keep the thermal radiation in infrared images and appearance information in visible images simultaneously.

To overcome the problem, we proposed a new fusion algorithm based on nonlinear enhancement and NSST decomposition for the infrared and visible images. Firstly, the NSST is used to decompose the two source images into low- and high-frequency sub-bands. Then, the high-frequency sub-bands are fused with WT-based method. To highlight the target, we construct a non-linear transform function to determine the fusion weight of low-frequency sub-bands, and whose parameters can be further adjusted to meet different fusion requirements. Finally, the inverse NSST is used to reconstruct the fused image. The experiments demonstrate that the proposed method can not only enhance the thermal target in infrared images, but also preserve the texture details in visible images. The presented method is competitive with or even superior to other methods in terms of both visual and quantitative evaluations.

The rest of this paper is organized as follows. The principle theoretical base and implementation steps of NSST are reviewed in Section 2. The details of the proposed image fusion method are proposed in Section 3. Experimental results and comparisons are presented in Section 4. The main conclusion of this paper is drawn in Section 5.

2 Related works

NSST is one of the most suitable multi-scale geometric analysis tools for fusion applications. The NSST provides an elegant sparse image representation with edges and much detail information. It does not introduce artifacts or noise when the inverse NSST is performed. In addition, the shearlet coefficients are well-localized in tight frames ranging at various locations, scales with anisotropic orientation. This achieves a successful fusion process and produces higher image quality and more clearness of image details and edges [33].

2.1 Basic principle of NSST

The shearlet construction is based on the non-sampled pyramid filter banks that provide the multi-scale decomposition and directional filtering generated using shear matrix that provides multi-directional localization. When the dimension n = 2, the affine system with synthetic expansion is A_AB(ψ) [10].

$$ {A}_{AB}\left(\psi \right)=\left\{{\psi}_{j,l,k}(x)={\left|\det A\right|}_2^j\psi \left({B}^l{A}^jx-k\right):j,l\in Z,k\in {Z}^2\right\} $$

(1)

where ψ ∈ L²(R²), A and B are 2 × 2 invertible matrices and |detB| = 1. If A_AB(ψ) forms a Parseval tight framework for L²(R²), the elements of the system are called composite wavelets. For any f ∈ L²(R²), there is

$$ {\sum}_{j,k,l}{\left|<f,{\psi}_{j,l,k}>\right|}^2={\left\Vert f\right\Vert}^2 $$

(2)

Among them, matrix A^j and B^lare respectively associated with scale and geometric transformations, such as rotation and shear operations.

Where $ {A}_a=\left(\begin{array}{cc}a& 0\\ {}0& \sqrt{a}\end{array}\right) $, $ {B}_s=\left(\begin{array}{cc}1& s\\ {}0& 1\end{array}\right) $, the system can be shown as follows:

$$ \left\{{\psi}_{ast}(x)=a-\frac{3}{4}\psi \left({A}_a^{-1}x-t\right),a\in {R}^{+},s\in R,t\in {R}^2\right\} $$

(3)

Equation (3) is a shearlet system, and ψ_ast(x) is a shearlet.

Figure 1 shows the tiling of the frequency plane induced by the shearlets and frequency supports of shearlet elements. It can be seen from Fig. 1 that each element $ {\hat{\psi}}_{j,l,k}(x) $ is supported on a pair of trapezoidal pairs with the size of about 2^j × 2^2j, and the direction is along a straight line with a slope of l2^−j.

2.2 Implementation steps

The NSST can be realized through two steps:

(1) Multi-scale decomposition. The nonsubsampled pyramid (NSP) filter bank decomposes each source image into a set of high- and low-frequency sub-images to attain multi-resolution decomposition. Firstly, the source image is decomposed into the low- and high-frequency coefficients with NSP. Then, the NSP decomposition of each layer will iterate on the low-frequency components obtained by the upper layer decomposition to get the singular points. Without the down-sampling operation, the sub-band images will have the same size as the source image. Finally, for j level decomposition, we can obtain a low-pass image and j band-pass images.

(2) Directional localization. The shearlet filter bank decomposes these high-frequency sub-images to attain multi-direction decomposition. Firstly, the pseudo polarization coordinates are mapped to Cartesian coordinates. Then, the “Meyer” wavelet is used to construct window function and generate shearlet filters. Finally, the sub-band image is convoluted with “Meyer” window function to obtain the direction sub-band images.

The two-level decomposition structure is shown in Fig. 2. The NSP decomposes the source image f into a low-pass filtered image $ {f}_a^1 $ and a high pass filtered image $ {f}_d^1 $. In each iteration, the NSP decomposes the low-pass filtered image from the upper layer until the specified number of decomposition layers is reached. Finally, a low-pass low-frequency image and a series of high-frequency images are obtained.

3 Proposed method

In this section, we introduce the process of the proposed method and discuss the setting of parameters. The low- and high-frequency components obtained from the NSST decomposition represent different feature information. For example, the low-frequency components carry the approximate features of the source image, and the high-frequency components carry the detailed features. The approximate parts of images provide more visually significant information and contrast information. The detailed parts of images provide more contour and edges information. Therefore, we should use different fusion rules to fuse the low- and high-frequency components. According to the stage of image data to be fused and the degree of information extraction in the fusion system, image fusion is divided into three levels: pixel level, feature level, and decision level. The proposed method focuses on the pixel level. The specific fusion scheme is shown in Fig. 3. The steps of proposed method are as follows:

Step 1: Decompose the infrared and visible images with NSST into low- and high-frequency coefficients.
Step 2: Fuse low-frequency coefficients based on nonlinear enhancement algorithm.
Step 3: Fuse high-frequency coefficients based on WT-based method.
Step 4: Apply inverse NSST to obtain the fused image.

3.1 Low-frequency sub-band fusion

The low-frequency components reflect the contour information of the image, which contain a lot of energy information of the original image [34]. Weighted average method is commonly used to fuse low-frequency sub-bands; however, unreasonable fusion weight will cause loss of source image information or poor image performance. We introduce a fusion strategy that construct a nonlinear transform function to determine the fusion weight of the low-frequency sub-bands to address the problems.

In infrared images, the target information is significant. Due to the large gray values, the target is easy to detect and recognize. In order to highlight the target in the fused image, we extract the coefficients in the low-frequency component of the infrared image to determine the low-frequency fusion weight.

Each coefficient of the low-frequency components takes the absolute value as follows:

$$ R=\left|{\mathrm{LFC}}_{\mathrm{IR}}\right| $$

(4)

Where LFC_IR represents the low-frequency sub-band of the infrared image after decomposition, R represents the significant infrared characteristic distribution. R_mean means the average of the LFC_IR. When R is larger than R_mean, it can be considered as a bright point; when R is smaller than R_mean, it can be considered as a dark point. The bright points are regarded as the target, while the dark points are regarded as backgrounds. In order to highlight the target, a nonlinear transform function is introduced to control the degree of the enhancement. The nonlinear transform function is as follows:

$$ S\left(\lambda \right)=1-\frac{1}{1+{\left(\frac{R}{R_{\mathrm{mean}}}\right)}^{\lambda }} $$

(5)

where the parameter λ belongs to (0, ∞).

The low-frequency information fusion weight can be expressed as:

$$ {C}_{\mathrm{IR}}=S\left(\lambda \right) $$

(6)

$$ {C}_{\mathrm{VIS}}=1-{C}_{\mathrm{IR}} $$

(7)

Where C_IR is the fusion weight of the infrared image, C_VIS is the weight of visible image, and they both belong to [0, 1].

As shown in Eqs. 5–7, the parameter λ directly affects the fusion weight of the infrared image. Therefore, we can adjust λ to control the proportion of the infrared features of the fused image. Particularly, the larger the value of C_IR, the more obvious the target is. To strengthen the thermal radiation target, the value of C_IR should be relatively large.

The final low-frequency sub-band fusion result can be obtained as follows:

$$ \mathrm{LFC}\_\mathrm{F}={C}_{\mathrm{IR}}\ast {\mathrm{LFC}}_{\mathrm{IR}}+{C}_{\mathrm{VIS}}\ast {\mathrm{LFC}}_{\mathrm{VIS}} $$

(8)

where LFC _ F represents the low-frequency component of the fused image. LFC_VIS represents the low-frequency component decomposed by visible images.

3.2 High-frequency sub-band fusion

High-frequency components reflect detailed information, such as edges and contours of the source image. To obtain more detailed information, we use the WT-based method to fuse the high frequency sub-bands of the infrared and visible images. Firstly, the WT is used to decompose high-frequency sub-bands to obtain approximate sub-bands (LFC_IR and LFC_VIS) and directional detail sub-bands (HFC_IR and HFC_VIS). Here, Haar wavelet is selected as the WT basis, and the decomposition layers are set to 1. Then, the “average” fusion rule is performed for fusion for approximate sub-bands. The approximate sub-band fusion rule is defined as follows:

$$ \mathrm{LFC}\_\mathrm{F}=\frac{\left|{\mathrm{LFC}}_{\mathrm{IR}}\right|+\left|{\mathrm{LFC}}_{\mathrm{VIS}}\right|}{2} $$

(9)

And the “max-absolute” fusion rule is performed for fusion for directional detail sub-bands. The directional detail sub-band fusion rule can be expressed as follows:

$$ \mathrm{HFC}\_\mathrm{F}=\left\{\begin{array}{l}{\mathrm{HFC}}_{\mathrm{IR}}\kern0.48em ,\left|{\mathrm{HFC}}_{\mathrm{IR}}\right|>\left|{\mathrm{HFC}}_{\mathrm{VIS}}\right|\\ {}{\mathrm{HFC}}_{\mathrm{VIS}},\mathrm{otherwise}\end{array}\right. $$

(10)

where LFC _ F and HFC _ F represent the approximate and directional detail sub-bands of high-frequency sub-band images.

Finally, the inverse WT is implemented on LFC _ F and HFC _ F to get the high-frequency sub-bands of the fused image.

3.3 Analysis of parameter

In the nonlinear enhancement method, there is a main parameter which influence the enhancement performance, namely, the parameter λ. In this section, we draw the curve of the enhancement weight C_IR under different parameter λ shown in Fig. 4. The intensity of the target pixel in the fused image is determined by the value of C_IR. The larger the value of C_IR, the more evident the target is.

As shown in Fig. 4, the C_IR curve with the the abscissa R (the gray level of the pixel) is “S” type, which shows that the target pixels can obtain larger enhancement than that of the background pixels. Moreover, the shape of C_IR becomes steep when the parameter λ increases. Therefore, it is convenient to adjust λ to get different fusion result.

Figure 5 shows the fused images under the parameter λ of 5, 10, 30, 50, 100, and 200. As seen in Fig. 5, the pixel intensity distribution of infrared images is strengthened with the increase of λ. However, when λ reaches a certain degree, the distortion in the fused image will occur. The parameter λ should be appropriately large to meet different fusion requirements. In this paper, the value of λ is 10. The proposed algorithm is summarized as Table 1.

Table 1 Algorithmic module

Full size table

4 Experimental results and discussion

4.1 Experimental scheme

To evaluate the performance of the proposed algorithm, two groups of simulation experiments have been carried out. Firstly, we compare the proposed method with six MGA-based methods. Then, we compare our method with other five advanced methods. Finally, qualitative and quantitative analysis of experimental results is achieved. The infrared and visible images to be fused are collected from TNO Image Fusion Dataset. Our experiments are performed using MATLAB Code on a computer with 2.6 Hz Intel Core CPU and 4 GB memory.

4.2 Fusion quality evaluation

4.2.1 Subjective evaluation

The subjective evaluation methods assess the quality of the fused image according to the evaluator’s own experience and feeling. To some extent, it is a relatively simple, direct, fast, and convenient method. However, the lower efficiency and poorer real-time performance limit its practical applications. Table 2 shows the common used subjective evaluation criteria.

Table 2 Subjective evaluation criteria

Full size table

4.2.2 Objective evaluation

According to the different subjects, the objective evaluation indicators of image fusion quality can be divided into three categories: the characteristics of the fusion image itself, the relationship between the fusion image and the standard reference image, and the relationship between the fusion image and the source images [10]. We use A, B, and F to infrared, visible, and fused image, respectively, and R to be the ideal reference image. Here are the five objective evaluation parameters we used.

(1)
Entropy (E)

E can be directly used to measure the richness of image information. The larger the E value, the better the fusion effects are. The calculation formula is shown in Eq. (11):

$$ E=-\sum \limits_{i=0}^{L-1}{p}_i\;{\mathit{\log}}_2\kern0.24em {p}_i $$

(11)

where L is the total number of gray levels of the image, and p_i is the probability with the gray value i in the image.

(2)
Average gradient (AG)

AG is used to reflect the micro-detail contrast and texture variation in the image. The larger the AG value, the more gradient information the fused image contains. The calculation formula is shown in Eq. (12):

$$ \Delta \overline{G}=\frac{1}{M\times N}\sum \limits_{m=1}^M\sum \limits_{n=1}^N\sqrt{\frac{\Delta {F}_x^2\left(m,n\right)+\Delta {F}_y^2\left(m,n\right)}{2}} $$

(12)

where ΔF_x is the difference in the x direction of the fused image F, and ΔF_y is the difference in the y direction.

(3)
Standard deviation (SD)

SD is used to reflect the distribution of pixel gray values and the contrast of the fused image. It is defined as follows:

$$ \mathrm{SD}=\sqrt{\frac{1}{M\times N}\sum \limits_{m=1}^M\sum \limits_{n=1}^N{\left(F\left(m,n\right)-\overline{F}\left(m,n\right)\right)}^2} $$

(13)

(4)
Spatial frequency (SF)

SF is used to reflect the overall activity of the image in the spatial domain. The solution of SF is defined in Eq. (16). The larger the SF, the better the fusion effects are.

$$ \mathrm{RF}=\sqrt{\frac{1}{M\times N}\sum \limits_{m=1}^M\sum \limits_{n=1}^N{\left[F\left(m,n\right)-F\left(m,n-1\right)\right]}^2} $$

(14)

$$ \mathrm{CF}=\frac{1}{M\times N}\sum \limits_{m=1}^M\sum \limits_{n=1}^N{\left[F\left(m,n\right)-F\left(m-1,n\right)\right]}^2 $$

(15)

$$ \mathrm{SF}=\sqrt{{\mathrm{RF}}^2+{\mathrm{CF}}^2} $$

(16)

where RF and CF are the row and column frequency of image respectively.

(5)
Edge information retention (Q^AB/F)

Q^AB/F measures the amount of edge information that is transferred from the source image to the fused image. Q^AB/F is defined as follows:

$$ {Q}^{\mathrm{AB}/F}=\frac{\sum_{\forall m,n}{Q}_{m,n}^{\mathrm{AF}}{w}_{m,n}^A+{A}_{m,n}^{\mathrm{BF}}{w}_{m,n}^B}{\sum_{\forall m,n}{w}_{m,n}^A+{w}_{m,n}^B} $$

(17)

w^A and w^B denote the weight of the importance of infrared and visible images to the fused image. Q^AF and Q^BF are calculated from the edges. A large Q^AB/Fmeans that considerable edge information is transferred to the fused image. For a perfect fusion result, Q^AB/F is 1.

4.3 Experiments and results

4.3.1 Comparison with MGA-based methods

In the first group of simulation tests, we used the presented method to fuse five typical infrared and visible images in the TNO datasets, namely, “Men in front of house,” “Bunker,” “Sandpath,” “Kaptein_1123,” and “barbed_wire_2”. In addition, six MGA-based methods are selected for comparison experiments, including WT [23], TEMST [35], NSST with weighted average [36], NSST with WT [37], NSCT with WT [38], and CURV with WT [39].

The key of MGA-based fusion schemes is the selection of the transforms. WT- and CURV-based methods have block artifacts, reduce the contrast of the image, and cannot capture abundant directional information of images. NSCT-based method can capture the geometry of image edges well, while the number of the directions at every level is fixed. In NSST-based methods, the number of the directions can be set arbitrarily, and thus the more detailed information can be obtained. But the more directions, the longer running time is. We replaced LP with NSST in TEMST as a comparative experiment.

In the proposed method, the pyramid filter for NSST is set as “maxflat,” the decomposition level of NSST is set for 3, and the number of the directions is set for {4,4,4}. The high-frequency sub-bands are decomposed into 1 level by WT (with the basis of Harr). The results are shown in Fig. 6. The first two rows in Fig. 6 show the infrared and visible images. The six remaining rows denote the fused images of our method, TEMST, NSST with weighted average, WT, NSST with WT, NSCT with WT, and CURV with WT. The subjective and objective evaluation parameters introduced earlier are used to analyze the fusion results.

The above five assessment indicators (i.e., E, AG, SD, SF, and Q^AB/F) on the five typical infrared and visible images are shown in Fig. 7. The larger their values, the better the fusion effects are.

4.3.2 Comparison with the state-of-the-art methods

In this part, seven typical infrared and visible images in the TNO datasets (i.e., men in front of house, bunker, soldier_behind_smoke_1, Nato_camp_sequence, Kaptein_1123, lake, and barbed_wire_1) are chosen to evaluate the effectiveness of the proposed method. We compare the proposed method with other 5 advanced methods, including: guided filtering-based weighted average technique (GF) [40], multi-resolution singular value decomposition (MSVD) [41], fourth order partial differential equations (FPDE) [42], different resolutions via total variation (DRTV) [43], and visual attention saliency guided joint sparse representation (SGJSR) [44].

The fused images are shown in Fig. 8. The values of the five evaluations metrics on the seven infrared and visible images are shown in Fig. 9.

4.3.3 Results and discussion

As seen in Figs. 6, 7, 8, and 9, the above 12 methods can implement the effective fusion of infrared and visible images. In the other MGA-based methods, the fused image is dark and the target is not prominent, which can be clearly seen from the sky in images “Men in front of house” and “Kaptein_1123” in Fig. 6. It can be seen that the proposed method can achieve apparently and easily identifiable target information. In terms of objective evaluation parameters, our proposed method is generally higher than other methods as seen in Fig. 7. In short, the presented method in this paper is superior to other MGA-based methods.

Compared with five advanced methods, the presented method can achieve the best visual quality as shown in Fig. 8. However, analyzing the objective evaluation parameters (i.e., E, AG, SD, SF, and Q^AB/F) as seen in Fig. 9, there is a fluctuation. Our method cannot always get the highest values, but it can get the more stable image quality. In all, our method is competitive with the five advanced fusion methods.

5 Conclusions

In this study, we propose a new fusion algorithm for infrared and visible images based on nonlinear enhancement and NSST decomposition. It can be demonstrated that this algorithm can not only retain the texture details of the visible image, but also highlight the targets in the infrared image. Compared with other MGA-based and advanced algorithms, it is competitive or even superior in terms of qualitative and quantitative evaluation. And the fusion performance is beneficial for target detection and tracking in complex environments.

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

MGA:: Multi-scale geometric analysis
NSST:: Non-subsampled shearlet transform
WT:: Wavelet transform
CURV:: Curvelet transform
NSCT:: Non-subsampled contourlet transform
TEMST:: Target-enhanced multiscale transform decomposition
GF:: Guided filtering-based weighted average technique
MSVD:: Multi-resolution singular value decomposition
FPDE:: Fourth order partial differential equations
DRTV:: Different resolutions via total variation
SGJSR:: Visual attention saliency guided joint sparse representation

References

W. Liu, Z. Wang, A novel multi-focus image fusion method using multiscale shearing non-local guided averaging filter [J]. Signal Process. (2020). https://doi.org/10.1016/j.sigpro.2019.107252
S.M. Darwish, Multi-level fuzzy contourlet-based image fusion for medical applications [J]. IET Image Process. 7(7), 694–700 (2013)
Article Google Scholar
P.H. Venkatrao, S.S. Damodar, HWFusion: Holoentropy and SP-Whale optimisation-based fusion model for magnetic resonance imaging multimodal image fusion [J]. IET Image Process. 12(4), 572–581 (2018)
Article Google Scholar
X. Wei, Adaptive remote sensing image fusion under the framework of data assimilation [J]. Opt. Eng. 50(6), 067006 (2011)
Article Google Scholar
G. Simone, A. Farina, F.C. Morabito, et al., Image fusion techniques for remote sensing applications [J]. Information Fusion 3(1), 3–15 (2002)
Article Google Scholar
W. Li, X. Hu, J. Du, et al., Adaptive remote-sensing image fusion based on dynamic gradient sparse and average gradient difference [J]. Int. J. Remote Sens. 38(23), 7316–7332 (2017)
Article Google Scholar
R. Raghavendra, C. Busch, Novel image fusion scheme based on dependency measure for robust multispectral palmprint recognition [J]. Pattern Recogn. 47(6), 2205–2221 (2014)
Article Google Scholar
R. Singh, M. Vatsa, A. Noore, Integrated multilevel image fusion and match score fusion of visible and infrared face images for robust face recognition [J]. Pattern Recogn. 41(3), 880–893 (2008)
Article MATH Google Scholar
J. Han, B. Bhanu, Fusion of color and infrared video for moving human detection [J]. Pattern Recogn. 40(6), 1771–1784 (2007)
Article MATH Google Scholar
Z. Zhou, M. Dong, X. Xie, et al., Fusion of infrared and visible images for night-vision context enhancement [J]. Appl. Opt. 55(23), 6480 (2016)
Article Google Scholar
M. Ding, L. Wei, B. Wang, Research on fusion method for infrared and visible images via compressive sensing [J]. Infrared Phys. Technol. 57, 56–67 (2013)
Article Google Scholar
S. Gao, W. Jin, L. Wang, Objective color harmony assessment for visible and infrared color fusion images of typical scenes [J]. Opt. Eng. 51(11), 117004 (2012)
Article Google Scholar
Q. Zhang, Y. Fu, H. Li, et al., Dictionary learning method for joint sparse representation-based image fusion [J]. Opt. Eng. 52(5), 057006 (2013)
Article Google Scholar
M. Wang, Z. Mi, J. Shang, et al., Image fusion-based video deraining using sparse representation [J]. Electron. Lett. 52(18), 1528–1529 (2016)
Article Google Scholar
X. Fengtao, J. Zhang, L. Pan, et al., Robust image fusion with block sparse representation and online dictionary learning [J]. IET Image Process. 12(3), 345–353 (2018)
Article Google Scholar
Kong W, Liu J. Technique for image fusion based on nonsubsampled shearlet transform and improved pulse-coupled neural network [J].Opt. Eng.. 52(1):017001-1-017001-12 (2013).
G. Wang, H. Tang, B. Xiao, et al., Pixel convolutional neural network for multi-focus image fusion [J]. Information Sciences: An International Journal 433/434, 125–141 (2018)
Article MathSciNet Google Scholar
S. Li, Z. Yao, W. Yi, Frame fundamental high-resolution image fusion from inhomogeneous measurements [J]. IEEE Trans. Image Process. 21(9), 4002–4015 (2012)
Article MathSciNet MATH Google Scholar
D.P. Bavirisetti, R. Dhuli, Two-scale image fusion of visible and infrared images using saliency detection [J]. Infrared Phys. Technol. 76, 52–64 (2016)
Article Google Scholar
L. Petrusca, P. Cattin, V. De Luca, et al., Hybrid ultrasound/magnetic resonance simultaneous acquisition and image fusion for motion monitoring in the upper abdomen [J]. Investig. Radiol. 48(5), 333–340 (2013)
Article Google Scholar
W. Kong, Technique for gray-scale visual light and infrared image fusion based on non-subsampled shearlet transform [J]. Infrared Phys. Technol. 63, 110–118 (2014)
Article Google Scholar
Z. Zhou, M. Tan, Infrared image and visible image fusion based on wavelet transform [J]. Adv. Mater. Res. 756-759(2), 2850–2856 (2013)
Article Google Scholar
D.L. Donoho, Wedgelets: nearly minimax estimation of edges [J]. Ann. Stat. 27(3), 859–897 (1999)
Article MathSciNet MATH Google Scholar
F.E. Ali, I.M. El-Dokany, A.A. Saad, et al., A curvelet transform approach for the fusion of MR and CT images [J]. J. Mod. Opt. 57(4), 273–286 (2010)
Article MATH Google Scholar
L. Guo, M. Dai, M. Zhu, Multifocus color image fusion based on quaternion curvelet transform [J]. Opt. Express 20(17), 18846 (2012)
Article Google Scholar
Do M N, Member, IEEE et al., The contourlet transform: an efficient directional multiresolution image representation [J]. IEEE Trans. Image Process. 14(12), 2091–2106 (2006)
Google Scholar
G. Bhatnagar, Q. Wu, Z. Liu, Directive contrast based multimodal medical image fusion in NSCT domain [J]. IEEE Transactions on Multimedia. 15(5), 1014–1024 (2013)
Article Google Scholar
Y. Li, Y. Sun, X. Huang, et al., An image fusion method based on sparse representation and sum modified-laplacian in NSCT domain [J]. Entropy 20(7), 522 (2018)
Article Google Scholar
Z. Fan, D. Bi, S. Gao, et al., Adaptive enhancement for infrared image using shearlet frame [J]. J. Opt. 18(8), 085706 (2016)
Article Google Scholar
P. Ganasala, V. Kumar, Multimodality medical image fusion based on new features in NSST domain [J]. Biomed. Eng. Lett. 4(4), 414–424 (2015)
Article Google Scholar
W. Kong, B. Wang, Y. Lei, Technique for infrared and visible image fusion based on non-subsampled shearlet transform and spiking cortical model [J]. Infrared Phys. Technol. 71, 87–98 (2015)
Article Google Scholar
L. Xu, G. Gao, D. Feng, Multi-focus image fusion based on non-subsampled shearlet transform [J]. IET Image Process. 7(6), 633–639 (2013)
Article Google Scholar
Q. Miao, C. Shi, P. Xu, et al., A novel algorithm of image fusion using shearlets [J]. Opt. Commun. 284(6), 1540–1547 (2011)
Article Google Scholar
Y. Zhang, L. Zhang, X. Bai, et al., Infrared and visual image fusion through infrared feature extraction and visual information preservation [J]. Infrared Phys. Technol. 83, 227–237 (2017)
Article Google Scholar
J. Chen, X. Li, L. Luo, G. Mei, J. Ma, Infrared and visible image fusion based on target-enhanced multiscale transform decomposition [J]. Inf. Sci. (2020). https://doi.org/10.1016/j.ins.2019.08.066
X. Liu, W. Mei, H. Du, Structure tensor and nonsubsampled shearlet transform based algorithm for CT and MRI image fusion [J]. Neurocomputing. 235, 131–139 (2017)
Article Google Scholar
Z. Qu, Y. Xing, Y. Song, An image enhancement method based on non-subsampled shearlet transform and directional information measurement [J]. Information 9(12), 308 (2018)
Article Google Scholar
Y. Wu, H. Zhang, F. Zhang, et al., Fusion of visible and infrared images based on non-sampling contourlet and wavelet transform [J]. Appl. Mech. Mater. 3360(1200), 1523–1526 (2014)
Article Google Scholar
G.G. Bhutada, R.S. Anand, S.C. Saxena, Edge preserved image enhancement using adaptive fusion of images denoised by wavelet and curvelet transform [J]. Digital Signal Processing 21(1), 118–130 (2011)
Article Google Scholar
S. Li, X. Kang, J. Hu, Image fusion with guided filtering [J]. IEEE Trans. Image Process. 22(7), 2864–2875 (2013)
Article Google Scholar
V.P.S. Naidu, Image fusion technique using multi-resolution singular value decomposition [J]. Def. Sci. J. 61(5), 479–484 (2011)
Article MathSciNet Google Scholar
D. P. Bavirisetti, Xiao G, Liu G, “Multi-sensor image fusion based on fourth order partial differential equations,” 2017 20th International Conference on Information Fusion (Fusion), Xi’an. pp. 1-9 (2017). doi:10.23919/ICIF.2017.8009719
Du Q, Han X, et al. Fusing infrared and visible images of different resolutions via total variation model [J]. Sensors (Basel, Switzerland). (2018). doi:10.3390/s18113827
B. Yang, S. Li, Visual attention guided image fusion with sparse representation [J]. Optik - International Journal for Light and Electron Optics 125(17), 4881–4888 (2014)
Article Google Scholar

Download references

Acknowledgements

The research is supported by the National Natural Science Foundation of China under NO. 61805021 and the Department of Science and Technology of Jilin Province under NO. JJKH20191196KJ.

Funding

This work is supported in part by the Natural Science Foundation of China under NO. 61805021 and in part by the Department of Science and Technology Plan Projects of Jilin Province under NO. JJKH20191196KJ.

Author information

Authors and Affiliations

College of Electronic Information Engineering, Changchun University, Changchun, 130012, Jilin, China
Xiaoxue Xing, Cheng Liu & Cong Luo
Beijing Institute of Technology, Beijing, 100081, China
Tingfa Xu

Authors

Xiaoxue Xing
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Cong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Tingfa Xu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XX is the main writer of this paper. She proposed the main idea and constructed the nonlinear function. LC (the second author) and LC (the third author) completed simulation experiment and compared with other algorithms. XT gives some important suggestions for the simulation. All authors read and approve the final manuscript.

Corresponding author

Correspondence to Xiaoxue Xing.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xing, X., Liu, C., Luo, C. et al. Infrared and visible image fusion based on nonlinear enhancement and NSST decomposition. J Wireless Com Network 2020, 162 (2020). https://doi.org/10.1186/s13638-020-01774-6

Download citation

Received: 20 February 2020
Accepted: 09 August 2020
Published: 24 August 2020
DOI: https://doi.org/10.1186/s13638-020-01774-6

Infrared and visible image fusion based on nonlinear enhancement and NSST decomposition

Abstract

1 Introduction

2 Related works

2.1 Basic principle of NSST

2.2 Implementation steps

3 Proposed method

3.1 Low-frequency sub-band fusion

3.2 High-frequency sub-band fusion

3.3 Analysis of parameter

4 Experimental results and discussion

4.1 Experimental scheme

4.2 Fusion quality evaluation

4.2.1 Subjective evaluation

4.2.2 Objective evaluation

4.3 Experiments and results

4.3.1 Comparison with MGA-based methods

4.3.2 Comparison with the state-of-the-art methods

4.3.3 Results and discussion

5 Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords