You are on page 1of 12

b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

Available online at www.sciencedirect.com

ScienceDirect

journal homepage: www.elsevier.com/locate/issn/15375110

Research Paper

Improving rice nitrogen stress diagnosis by


denoising strips in hyperspectral images via deep
learning

Yueming Zhu a,b, Alwaseela Abdalla a,b, Zheng Tang a,b, Haiyan Cen a,b,*
a
College of Biosystems Engineering and Food Science, and State Key Laboratory of Modern Optical Instrumentation,
Zhejiang University, Hangzhou, 310058, China
b
Key Laboratory of Spectroscopy Sensing, Ministry of Agriculture and Rural Affairs, Hangzhou, 310058, China

article info
It is critical to assess crop nitrogen status for precision fertilization and crop management.
Article history: Hyperspectral imaging (HSI) has been proved as one of the valuable sensing technologies
Received 3 October 2021 for plant nitrogen stress detection, while exploring efficient methods for hyperspectral
Received in revised form image analysis remains challenging due to the high dimensionality, information redun-
1 April 2022 dancy, and noise interference. This study aims to develop a de-striping convolution neural
Accepted 3 May 2022 network (DS-CNN) to remove strip noise in hyperspectral images and design a nitrogen
Published online 20 May 2022 diagnosis CNN (ND-CNN) for rice leaves. The DS-CNN included an image encoder, a strip
dropout bottleneck, and a decoder to remove the strip noise in hyperspectral images.
Keywords: Activation function Leaky-ReLU was introduced to activate the potential neurons to keep
Deep learning the real text features of the hyperspectral image. Element-wise feature addition mecha-
Hyperspectral imaging nism was also applied to guarantee the full range of band images that reached the best
Strip denoising structural and textural similarities. We constructed six datasets with different noise scales
Rice crop to explore the potential of DS-CNN. The best performance of DS-CNN was on the lowest
Nitrogen strip noise dataset (s ¼ 0.02), with the mean squared error (MSE) lower than 2  104,
highest structure similarity index metric (SSIM) of 0.99, and peak signal-to-noise ratio
(PSNR) of around 36 dB on the validation dataset. Moreover, we compared the nitrogen
diagnosis performance before and after denoising. The ND-CNN developed from the
denoised dataset not only avoided the overfitting but also improved the accuracy of ni-
trogen stress diagnosis of rice leaves.
© 2022 IAgrE. Published by Elsevier Ltd. All rights reserved.

nitrogen status of the crops can be estimated using either


1. Introduction laboratory-based chemical analysis, visual observation by
experts, or soil plant analysis development (SPAD) chlorophyll
The measurement of nitrogen (N) stress is a prerequisite for meter. Although chemical analysis is the most accurate and
the quantitative assessment of crops. Traditionally, the precise method, it has the limitations of being invasive,

* Corresponding author. College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, China.
E-mail address: hycen@zju.edu.cn (H. Cen).
https://doi.org/10.1016/j.biosystemseng.2022.05.001
1537-5110/© 2022 IAgrE. Published by Elsevier Ltd. All rights reserved.
166 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

destructive, and preventing instantaneous nitrogen mea- Wang, & Zhu, 2003; Pan & Chang, 1992)and wavelet analysis
surements. The SPAD is a simple and portable diagnostic tool (Torres & Infante, 2001), can only perform well on the periodic
that measures the greenness or relative content of leaves stripe noise and may affect the structural details of the image
using an empirical formula. This method, however, has that have similar frequencies as the stripe resulting in blur-
limited success in the large-scale area because it is time- ring and or ringing artifacts of the output images. The
consuming (Abdalla, Cen, Wan, Mehmood, & He, 2021). As statistical-based methods, such as histogram equalization
an alternative technique, hyperspectral imaging (HSI) tech- and moment matching, rely on the statistical properties of the
niques combined with machine learning (and its subclass digital sensor and assume that the mean and standard devi-
deep learning) have been considered one of the most impor- ation of the digital sensors are consistent and the image scene
tant remote and proximal techniques to instantaneously is homogenous, when such assumption is violated the image
monitor plant nutrient status none destructively and none- details can be damaged during destriping. Hence, there is a
invasively. Compared to other imaging techniques, the need to have a common and robust de-striping framework
advantage of HSI can extract both physiological and structural that will directly estimate stripe-free images from the stripe
information associated with nitrogen status (Jung, Song, images without considering the intrinsic properties of the
Hong, & Shim, 2021; Sabzi, Pourdarbani, Rohban, Garcı́a- stripe noise. Recent advances in deep learning networks,
Mateos, & Arribas, 2021). particularly convolutional neural networks (CNN), have
Nitrogen stress can be sensed by spectral reflectance in attracted the attention of researchers to apply this approach
both visible and near-infrared (NIR) regions (400e1000 nm), A to solve such an ill-posed problem. The first deep learning
significant correlation between plant nitrogen status and method (named SNRCNN) is proposed by Kuang, Sui, Chen,
spectral reflectance has been reported in recent literature. For and Gu (2017) to remove the stripe noise from infrared im-
example, Wang et al. (2021) employed unmanned aerial ages. It is a simple and shallow CNN architecture, which is
vehicle-based hyperspectral imagery to estimate the nitrogen evaluated on simulated non-uniform stripes and tends to
of the paddy rice at leaves and plant levels taking into account over-smooth the output image. Deshpande, Patale, and Roy
the growth stage, high correction between vegetation indices (2021) proposed a global residual-based deep neural network
that are derived from the hyperspectral images, and nitrogen to remove the stripe noise from remote sensing imagery, the
status have been found. Similar works have been reported by image quality is further improved using multilevel wavelet
Li, Li, Elsayed, Hu, and Schmidhalter (2021), their results decomposition and frequency domain filtering as a post-
showed that the reflectance in the 550 nm wavelength region processing step. A wavelet-based deep neural network pro-
has a significant influence on differentiating between various posed by Guan, Lai, and Xiong (2019) separates the stripe noise
N treatments. While other studies indicate that the total N in from the scene. This method preserves the image details
plants can be determined at 671 nm and 780 nm wavelengths without using post-processing but the details along the stripe
(Tros et al., 2021). direction are over-smoothed. This over-smoothing cause
The application of HSI, however, is hampered by different degradation of image quality in remotely sensed data and
sources of noise, including but limited to, atmospheric effects reduce its usefulness for any application in the automated
and internal sensor noises. The atmospheric effects can be analysis process. Therefore, for remote sensing appl, there is a
eliminated by using close-range HSI under controlled illumi- need to have a common and robust method to remove the
nation conditions, but the internal hyperspectral sensor stripe noise from hyperspectral images while maintaining the
noises, such as thermal-driven noise (i.e., strip noise), quan- structural and textural details without using post-processing
tization noise, and shot (i.e., photon) noise are the main to the output image. With this motivation, a new deep
challenges associated with the use of HSI for extraction learning network for hyperspectral image de-striping with
meaningful information from the plants. The strip noise improved performance has been proposed in this study,
resulting from an increasing temperature of the sensor (i.e., namely a de-striping convolution neural network (DS-CNN).
due to poor heat dissipation) can cause severe degradation in The main contributions of this article can be summarized
image quality, but also significantly influence the perfor- as follows; (1) we proposed a de-striping convolution neural
mance of HSI-based nutrient status diagnosis of the crops (Liu, network (DS-CNN) for removing strip noise in the HSI while
Cai, Li, Ning, & Jiang, 2022; Sellami & Tabbone, 2022) and other preserving the structural details of the output image. Our DS-
level tasks based on hyperspectral imaging. Therefore, strip CNN architecture consists of an encoder, a strip dropout
noise removal has become an essential and inevitable pre- bottleneck, and a decoder. (2) a new activation function, so-
processing step before the subsequent image analysis and called Leaky-ReLU, was proposed to activate the potential
applications of the HSI. Eliminating the strip noise may neurons that help in maintaining good image quality and
significantly improve the performance and robustness of HSI preventing the over-smoothing problem. Additionally, an
applications (Dao, Mantripragada, He, & Qureshi, 2021). element-wise feature addition mechanism was applied to
Generally, strip noise can be divided into two classes; peri- ensure that the bands have the same structural and textural
odical and non-periodical noises. A periodical stripe noise is details. (3) extensive experiments on simulated and real data
more prevalent and difficult to handle. Recently, several de- demonstrate that our method outperforms many existing
stripping algorithms have been developed for remote state-of-the-art de-stripping methods in terms of quantitative
sensing applications and can be grouped into several cate- assessments. (4) finally, as proof of concept and to represent
gories; such as filtering-based methods, statistical-based the practical challenge of the proposed method in real plant
methods, and learning-based methods. Filtering-based phenotyping applications, the nitrogen status of the plants
methods, such as Fourier transform (Chen, Shao, Guo, was diagnosed using real de-stripped HSI.
b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6 167

2. Materials and methods N* ¼ minn Eðo; sÞ þ RðnÞ (2)

where E(o,s) is a task-related metric, s is the strip noise cor-


2.1. Experimental design rupted image, o is the original clean image, R(n) is a regulari-
zation function learned from data (Sidorov & Hardeberg, 2019).
The experiment was implemented at the rice-production While, Lempitsky, Vedaldi, and Ulyanov (2018) consider that
Functional Area (29 310 5.3500 N, 120 60 6.1200 E) which is located this regularization can be extracted with the CNN network
at Anhua Town, Zhuji City, Zhejiang Province, China with an parameters optimization process, which can be applied to the
annual temperature of 16.3  C and 16 m altitude above the sea. noise removal problem depicted by the equation below.
The rice (Yongyou 1540) in the experimental site was divided
 
into 25 subplots, each of them with 18  10 m2 , all of the rice n* ¼ fq* ðsÞ; where q* ¼ argminn E fq ðsÞ; n0 (3)
plants were transplanted in the June of 2019 at the pre-
in which f is a CNN architecture, s is a specific input of noise,
planning stage. Five different stress of N fertilizers (N0, N1,
meanwhile, an expected clean image can be restored with the
N2, N3, and N4 ¼ 0: 0.5: 1: 1.5: 2) were applied in the form of
optimization of the neural network f, the weights, and other
urea, with five repetitions at different growth stages including
parameters can be learned from the training and optimization
40% at pre-planting stage, 30% at the tillering stage and 30% at
process. Generally, we can optimize f's performance by mean
the booting stage.
squared error (MSE) for loss function L which can be depicted
as:
2.2. Hyperspectral image acquisition
1 Xn

The hyperspectral imaging system used in this study was LðtÞ ¼ kMðSi ; tÞ  ðSi  Oi Þk2f (4)
2n i¼1
developed by AgroOptics and Imaging Lab at Zhejiang Uni-
versity, it covers the visible and near-infrared spectral range where Si and Oi are the striped and clean image pairs in the
of 400e1000 nm. The system consists of a CMOS camera (Zyla- dataset, t are the parameters to be trained, f is the Frobenius
4.2 P, Andor Technology, UK.), an imaging spectrograph norm function. The whole architecture of DS-CNN is shown in
(ImSpector V10E, Spectral Imaging Ltd., Oulu, Finland) with a Fig. 1 which will be explained in the following.
spectral resolution of 2.8 nm, a high-quality lens (Schneider The proposed DS-CNN employs 3 functional modules
Kreuznach Xenoplan 1.4/17e0903), two tungsten halogen namely encoder, dropout block, and decoder. The input of DS-
lamps (Fibre-Lite DC950 Illuminator, 150 W, Dolan Jenner In- CNN is optimized as 128  128  1, note that we have a single
dustries Inc. Boxborough, MA, USA.) and a stepper motor channel feature map, each encoder layer in the encoder block
driven conveyer belt. owns a different set of filters with optimized contractible sizes
Hyperspectral images of rice leaves were collected during and numbers, as depicted in order for this up-sampling
the initial heading stage and initial filling stage on 28 August computation is applied to save essential features of the
and 21 September 2019, respectively. The nitrogen supplica- input image, these abundant features from the encoder are
tion at these two growth stages is very crucial to the final yield sent to the decoder by layer addition for twice. The multiple
production in terms of both quality and quantity (Cen et al., filters with different sizes and numbers are also applied at the
2019). beginning of the dropout block after the dense layers, three
Before the imaging acquisition, the surface of the leaves deconvolution layers are applied to drop out more noise fea-
was cleaned to avoid the interference of dust and water, the tures. In the decoding block, the batch-normalization layer is
resolution parameter was set to 1024  237 (spatial  spectral) applied twice to activate the expected parameters. The final
mode. The white and dark reference images were used for denoised image is reconstructed as the same size as the input,
hyperspectral image correction to remove the environmental the more detailed explanation of DS-CNN will be explained in
effects by equation (1): the following.
The encoder itself can be applied for image feature
IO  B extraction as shown in Fig. 1, we tuned different kernel sizes
IC ¼  100% (1)
WB
for down-sampling with stride 2 rather than pooling as pre-
where IC is the corrected hyperspectral image, IO is the original vious research reported (Mao, Shen, & Yang, 2016) for the four
raw image, B is the image collected under dark conditions convolutional layers in the encoder block. During the training,
(with 0% reflectance), and W is the reflectance image obtained the MSE of the model on different filter sizes are recorded and
by the standard whiteboard (Isuzu Optics Corp., Taiwan, compared, the PSNR and SSIM are also evaluated as image
China) with a white surface (99% reflectance) under the quality monitors. The training time and resource cost increase
halogen lamp. After calibration, the image segmentation was rapidly as the filter size increases the computation
performed using the threshold method to mask out the region complexity. Filter size 3  3 showed an outperforming de-strip
of interest (ROI). result than others, which is an outperformed filter size for DS-
CNN after tuning.
2.3. DS-CNN architecture Dropout is arguably the most prominent regularization
technique practically (Cai et al., 2019) to pre overfitting, we
The noise removal can be expressed as an inverse task construct the model dropout as a bottleneck structure, two
formulated as an energy minimization problem (Chen, convolutional layers (4  4 followed by 2  2 convolution)
Papandreou, Kokkinos, Murphy, & Yuille, 2018) as: with Leaky-ReLU activation function are arranged after both
168 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

Fig. 1 e The architecture design of the De-Striping Convolution Neural Network (DS-CNN).

of the convolutional layers, one flatten layer and two dense x is the feature value in the arbitrary index from the con-
layers are followed as final feature representation, which volutional layers, it is a widely used activation function
was transferred to the final decoder block by the adversely because it costs less but works faster than other activation
convolutional computation, this design was proved to be functions such as tanh, sigmoid, max out, etc. (LeCun, Bengio, &
stable and effective after practical tuning to reach the Hinton, 2015). But ReLU sets the x to zero when x<0, thus
advanced strip removal performance evaluated by MSE, making the neuron dead which improves the sparseness of
PSNR and sent SSIM. the network. However, ReLU has been proved limitations for
The decoder block takes features from the encoder and image denoising, because it fails to identify the noise from the
dropout blocks to reconstruct the clean image, we used the image text feature, especially under heavy and complicated
conv2D transpose layer as a basic unit according to the noise conditions, it cannot retain the positive image features
encoder dimensions and arrangements, essential features as well as remove the noise features effectively (Shang, Sohn,
from the encoder and dropout block are combined twice ac- Almeida, & Lee, 2016).
cording to the feature map dimension. At the end of each convolution layer followed by the leaky-
Skip connection was used twice in the model to transfer ReLU (leaky version of Rectified Linear Unit) function, a non-
the textural feature across the dropout block, it ensures the saturating activation function that overcomes the “dying
maximum flow of features between the encoder and decoder ReLU” problem given by
block. Usually, gradient information can be lost when passing 
ax jx < 0
through many layers, the motivation behind layer addition is f ðxÞ ¼ (6)
x jx  0
that it has an uninterrupted gradient flow from the forward
layer to the after, which can also contribute to tackling the where a is a number that can be very small (we used 0.01) to
vanishing gradient problem. It enables an alternative way to avoid negative values than ReLU, our tuning proved leaky-ReLU
ensure feature reusability in the same dimension as the to be more effective and stable for strip denoising.
former layers. Flatten and dense layers are also applied to Our model tries to reduce the model complexity by using
prevent overfitting after the two skip connections to enforce very small receptive fields (3  3) in every convolutional layer
the DS-CNN reached its gradient edge as early as possible. resulting in a model size of only 23 M. Small size receptive
Activation functions should be selected according to the fields allow DS-CNN to have a large number of convolutional
practical performance, previous studies have reported the layers. Training such a deep network may have a lot of com-
ReLU activation function as follow: plications not only overfitting and high computational cost
 but also have degradation problems in the training accuracy.
0 jx < 0
f ðxÞ ¼ (5) To resolve this issue, we added a skip connection to our
x jx  0
network, which means that some of the layers in the network
b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6 169

are skipped and feed the output of one layer as the input to the Therefore, the data augmentation was adopted to artificially
next layers. Such a technique was introduced to solve the increase the number of the training dataset, 10 times image
degradation problem in different deep learning architectures augmentation was applied with a spatial dimension of
such as ResNet and U-Nets. Another method used to boost the 128  128 covering all of the bands by random cropping. We
performance of the DS-CNN and improve the convergence is simulated strip noised data by adding different scales of strip
that each convolutional layer was followed by a Leaky ReLU noise on the clean hyperspectral images, for each hyper-
(LReLU) activation function which prevents neurons that have spectral image, we add random strip noise level s to each band
values less than zero to die during the training progress and image x, so the simulated striped image y at waveband w
no training will happen to these neurons. Yw¼Xwþs where Xw is the noise-free image and s is the
simulated strip noise. Totally six noisy datasets were gener-
2.4. ND-CNN architecture ated according to the noise level s ¼ 0.02, 0.05, 0.10, 0.15, 0.20,
and 0.25, with the clean image as a reference to calculate PSNR
The nitrogen diagnosis model namely ND-CNN which is and SSIM, DS-CNN was trained on all the datasets to exploring
designed to diagnose the nitrogen status of rice plants is the ultimate denoising capability.
constructed by two convolutional up-sampling blocks as
feature generators, each of that contains two convolutional 3.2. Implementation details
layers and one max-pooling layer, a fully connected layer, and
two dense layers at the end as dropout function to handle with ROI mask process for HSI preprocessing was built with the
the final prediction (see Fig. 2). Environment for Visualizing Images (ENVI) software (ENVI 4.6)
The convolutional kernel size is set to 3  3 which has been (Research Systems Inc., Boulder Co., USA) and batch pro-
proved an excellent performance for image classification, the cessing with MATLAB R2018b (The MathWorks, Natick, MA,
max-pooling layer can reduce the dimensionality of feature USA). The DS-CNN was implemented by a deep learning
maps with a pool size of 2  2, dense layers are applied to framework namely Keras, it offers a GPU version to speed up
dropout the neurons before the final classification. ReLU is the large-scale image computation. All experiments were
applied as an activation function between each of the conducted on the Windows server with an Intel(R) Core(TM)
convolution layers. Softmax is applied as activation for final CPU mode i7-7800X @ 3.50 GHz CPU, the memory of 32 GB, a
classification, the learning rate is set to 1  103. GPU namely NVIDIA GeForce GTX 1080 Ti with 11 GB memory.
Considering the GPU resource and memory limitation, the
training batch size was set to 128 during the training process.
3. Model training

3.1. Simulated dataset construction 4. Results and discussion

The HSI image acquisition is time-consuming and requires a To evaluate the validity and generality of DS-CNN, one loss
large amount of storage capacity making it difficult to obtain a indicator and two image quality indicators were introduced to
large number of images at one time, our total number of monitor the model's performance. We used mean squared
training data contains 98 clean HSI images with a spatial and error (MSE) (X. Liu, Tanaka, & Okutomi, 2013) as back-
spectral resolution of (1024  1024) and 237, respectively. propagation, which can be defined as follows:

Fig. 2 e Framework of nitrogen diagnosis CNN (ND-CNN).


170 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

For a given reference image X and processed image Y, both the strips effectively and produce an image with high quality,
of the same size m  n, the MSE between X and Y is defined by: see Fig. 3 (b) and (c).
Table 1 reports the quantitative indicators of testing sam-
m1 X
1 X n1
2 ples on trained DS-CNN, it is clear that the DS-CNN reached
MSE ¼ ðXði; jÞ  Yði; jÞÞ (7)
mn i¼0 j¼0 outstanding performance on the dataset with noise level
s ¼ 0.02, it is also clear that image contaminated with slight
For DS-CNN, the minimization of the MSE was imple-
noise can be removed with higher image quality indicated by
mented by the Adam optimizer. Besides, peak signal-to-noise
the quantitative index from Table 1.
ratio (PSNR) and structure similarity index metric (SSIM) (Hore
& Ziou, 2010) were introduced to evaluate the denoising
4.2. Hyperspectral strip removal performance
quality. PSNR is an objective index, higher PSNR usually in-
comparison
dicates an increased image quality after denoising, which is
defined as the following:
There are several reported methods for stripe noise removal in
 2
 HSI. Here, we have selected the most four relevant methods to
255
PSNRðX; YÞ ¼ 10 log10 (8) demonstrate the contributions of our work. One is filtering-
MSEðX; YÞ
based method and others are learning-based methods as fol-
The SSIM is a quality metric used for image processing lows. (1) Wavelet-Domain Low-Rank/Group-Sparse (WDLRGS)
expressed by loss of correlation, image luminance distortion, destriping method proposed by N. Liu, Li, Tao, Fowler, and
and image contrast distortion, it is independent of both image Yang (2019). This method leveraged a group-sparse model of
brightness and contrast, which can be used to evaluate the wavelet coefficients of a hyperspectral image with a low-rank
initial feature and structural difference between two images. stripe model to destripe subbands of a 2-diminational discrete
The SSIM is defined as follows: wavelet transform (2D-DWT). The destriping process, as
formulated by an Alternating Direction Method of Multipliers -
SSIMðX; YÞ ¼ lðX; YÞcðX; YÞsðX; YÞ (9)
based low-rank/group-sparse optimization, was deployed
where exclusively in the 2D-DWT subbands wherein the stripes from
a pushbroom sensor are concentrated. (2) Wavelet Deep
2uX uY þ C1
lðX; YÞ ¼ (10) Neural Network (SNRWDNN) proposed by Shan, Liu, Mu, Cao,
u2X þ u2Y þ C1
and He (2019) for Stripe Noise Removal. This approach treats
the stripe noise removal as a transform coefficients prediction
2sX sY þ C2
cðX; YÞ ¼ (11) problem in wavelet domain and utilize the directional char-
s2X þ s2Y þ C2
acteristics of the stripe to suppress the noise with less detail
loss. (3) Dual deep convolutional neural network (DDCNN) for
sXY þ C3
sðX; YÞ ¼ (12) hyperspectral image denoising proposed by Shan et al. (2019).
sX sY þ C3
This approach utilizes a new activation function exchange
Luminance l (X, Y) comparison function indicates the dif- information between the two CNNs. (4) Residual DNN with
ference of the two images’ mean luminance (mX and mY Þ, the post filtering (RDCNN) proposed by Deshpande et al. (2021), it
maximal of this index is 1 and equal to 1 only if mX ¼ mY . cðX; YÞ can adopt residual learning strategy in the context of global
is the indicator of contrast of the image after and before image residual learning method to destrip and denoise images from
processing, this contrast is presented by the standard devia- the estimated noise residue. It works using a single residual
tion sX and sY , this indicator is maximal to 1 and equal to 1 unit to predict the residual noise from the known noise pat-
only if sX ¼ sY . The expression sðX; YÞ reveals the correlation terns and reiterates model training to minimize the loss
coefficient after image denoising. The covariance between X function. These methods are designed either an excellent strip
and Y is sXY , the reasonable values of SSIM are in [0,1]. C1 , noise filter or a functional optimizer to solving the inverse
C2 and C3 are positive constants used to avoid a null problem.
denominator. As shown in Table 2, the proposed method shows
improved performance in terms of both Peak Signal-to-Noise
4.1. Training results on denoising Ratio (PSNR) and Structural Similarity Index Metric(SSIM)
quality measures on all noise scales when compared with the
The MSE is applied as loss indicator for DS-CNN, Adam is used state-of-the-art methods.
as optimization function, it adapts the learning rate of every
parameter, which employs an exponentially decaying average 4.3. Hyperspectral strip removal visualization
of previous gradients in the network, the learning rate was set
to 1  103 to initialize the training, the model reached its For the simulated noise scale datasets, a DS-CNN denoising
balanced peak after 1250 trainging epochs within 26 h. As result is shown in Fig. 4, the original case in Fig. 4 is the clean
showed in Fig. 3 (a), the average loss decreased to 1  104 after image, different noise scale images are shown under original
500 epochs which manifests that DS-CNN can learn the dif- case in Fig. 4(a), the denoised image with corresponding model
ference between the noised image and clean image, in order to are shown in (b) under original case. The outperform of lower
produce an high quality estimated image, DS-CNN should strip noise scale denoising performance in the Fig. 4 also
drop the noise without change the text feature at the same conclued that DS-CNN produced an outstanding performance
time, the PSNR and SSIM curve indicate the model can remove on the dataset with lower strip noise.
b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6 171

Fig. 3 e De-Striping Convolution Neural Network training curves. (a) Average loss (b) Average peak signal to noise ratio
(PSNR) and (c) Average structural similarity index metric (SSIM) on the dataset with strip noise scace s ¼ 0.02.

strip removal and Fig. 5(b) illstrates a promising image details


Table 1 e The validated peak signal to noise ratio (PSNR), after denoising.
Average structural similarity (SSIM), and mean squared As the strip noise are varying among different spectral
error (MSE) on the datasets with coordinate noise scale.
regions, we have tested systems on another band in the visible
s ¼ 0.02 s ¼ 0.05 s ¼ 0.1 s ¼ 0.15 s ¼ 0.2 s ¼ 0.25 region to further confirm its robustness. Fig. 5 visualized the
PSNR 33.82 32.53 30.09 28.77 27.61 27.90 noise removal using different spectrum bands including the
SSIM 0.96 0.93 0.88 0.85 0.81 0.81 21st, 62nd, and 104th bands (482.49 nm, 558.91 nm, and
MSE 41.98 70.77 118.90 146.78 177.28 181.79 671.94 nm, respectively) which represent RGB image as shown
on the left side of Fig. 5 and 66th band (482.49 nm) indicating
An important benefit of the proposed method is the that the noise was removed from all bands and the textural
robustness in the accuracy of generated noise-free images feature was fully recovered after noise removal without
from the images subjected to different levels ranging from causing over-smoothing or damaging the boundaries of the
0.02 to 0.25. Fig. 4 shows the generated noise-free image from image.
the noisy image (Fig. 4a) compared to the noise-free image of
the same subject (Fig. 4b). Visually, it can be observed that the 4.4. Application on real hyperspectral images
denoised images generated by the proposed model have very
good quality in all selected bands; all the strip noises were To represent the practical challenge of our proposed deep
effectively removed from the input image. It is worth learning model, it was also evaluated on real hyperspectral
mentioning that when the noise scale is large, the quality is images contaminated by strip noise generated by hyper-
slightly degraded while maintaining the texture and struc- spectral systems during the experiment. As shown in Fig. 6,
tural information of the image and there is no over-smoothing our proposed model has the unprecedented capability of
or distortion was observed in the denoised image indicating removing the noise that is generated due to the malfunction of
that the generated noise-free images can be used to describe the hyperspectral imaging systems.
any biological phenomena such as nutrient health status of
the plants, as further explained in section 5. These visualiza-
tion results agree well with quantitative evaluation as pre- 5. Improvement of nitrogen stress diagnosis
sented in Table 2.
Figure 5 shows the denoising results simulated by the As one of the potential technologies, HSI has been applied to
hyperspectral data set image with false color (a) and RGB various aspects of plant research. Literature review shows
channels (b) from simulated dataset. The size of the selected that well-known nitrogen absorption bands can produce
image is 128  128, the original image in Fig. 5(a) is the 66th comparative results on plant nitrogen evaluation (Berger et al.,
band. The RGB image in Fig. 5(b) consists 21st, 62nd and 104th 2020; Sun et al., 2020), the nitrogen sensitive absorption bands
band image, the Fig. 5(a) manifests a clear text features after are mostly located in the VIS area, which showed lower

Table 2 e The average PSNR (in dB)/SSIM value of stripe noise cancellation performance on set of 10 hyperspectral images.
Noise Level PSNR/SSIM after strip noise cancellation
SNRWDNN WDLRGS DDCNN RDCNN Ours
s ¼ 0.05 27.04/0.79 29.74/0.91 29.52/0.94 33.18/0.95 33.82/0.96
s ¼ 0.1 24.50/0.82 29.34/0.87 26.67/0.90 30.07/0.93 32.53/0.94
s ¼ 0.15 22.87/0.60 27.88/0.85 27.37/0.85 28.17/0.87 30.09/0.88
s ¼ 0.20 20.35/0.49 24.57/0.82 25.84/0.83 26.45/0.85 28.77/0.86
s ¼ 0.25 18.13/0.45 22.63/0.78 23.36/0.80 25.29/0.80 27.61/0.82
172 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

Fig. 4 e De-Striping Convolution Neural Network (DS-CNN) Denoising visulization. (a) The orginal clean image and 6 noise
scale images on selected bands, and (b) the original clean image and DS-CNN denoised image respectively.

reflectance and transmission but higher absorption by chlo- As shown in Fig. 7, in terms of the training process, the
rophyll a and chlorophyll b, plant leave reflectance also stripe-free image dataset showed a remarkable improvement
showed red edge between the visible area and near-infrared in both accuracy and validation loss, and a trivial difference
area (720e950 nm) (Berger et al., 2020; Wheeler, Levia, & between the training and validation accuracy was observed in
Vargas, 2020). Generally, an increase of nitrogen content the stripe-free dataset compared to the dataset that is
supply for the crop can be revealed by near-infrared region contaminated by the stripe noise; indicated that model
reflectance increase and visible region reflectance decrease trained on the noisy dataset more susceptible to overfitting
(Liang et al., 2018; Mishra et al., 2017), these advanced un- problem.
derstanding of plant spectral reflectance can be the guidance In terms of classification accuracy (Fig. 8), the stripe-free
for further research to reveal the relationship between the image dataset has also shown significant improvement in
absorption band images and the nitrogen contents of plant. distinguishing five levels of nitrogen stresses. The accuracy
Note that our initial purpose was evaluating the nitrogen was improved from 84.44% to 99.56%.
stress from rice leaves, inspired by the spectral sensitivities The classification confusion matrix in Fig. 8 showed the
and strong capabilities of machined learning, we generated ND-CNN performance on the dataset (a) before strip removal
the dataset with the sensitive band images for plant nitrogen and dataset (b) after strip removal by DS-CNN, it is a
stress diagnosis. We concatenated band image R560 from the remarkable improvement for nitrogen stress diagnosis on the
visible area, band image R810, and R950 from the near-infrared dataset (b) compared to (a), for the nitrogen stress diagnosis
area as ND-CNN input images, with 10 times random cropping on the dataset (a) without remove strips, five grades of nitro-
as data augmentation with the hyperspectral image before gen stress were misclassified to some extent, which had been
and after denoising. improved substantially on the dataset after denoised, except a
The ND-CNN model was firstly trained on the noisy dataset few samples from N0 and N3 been misclassified, the rest 3
as mentioned at the beginning which led to overfitting, classes (N1, N2, and N4) are predicted with 100% accuracy.
although we tried to adjust hyperparameters such as con- To show the improvements for nitrogen diagnosis practi-
volutional layers, filters even activation functions, overfitting cally, t-distributed stochastic neighbor embedding (t-SNE)
remain unchanged see Fig. 7(a). While the accuracy of the ND- (Van der Maaten & Hinton, 2008) visualization of feature
CNN remarkably increased on the denoised dataset by DS-CNN extraction by ND-CNN on the dataset (a) before strip removal
shown in Fig. 7(b), ND-CNN successfully avoided overfitting on and (b) dataset after strip removal by DS-CNN are shown in
the dataset (b) with higher accuracy and lower loss. Fig. 9, here we observed that the results of confusion matrix in
b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6 173

Fig. 5 e Example of De-Striping Convolution Neural Network (DS-CNN) validation performance on the 6 noise levels. (a) The
denoising comparison with specific 66th band on one of the hyperspectral image patch (b) selected RGB (R:21st, G:62nd,
B:104th band) image before and after denoised on real leaf HSI image patch.

Fig. 8 is consistent with the t-SNE 2D map in Fig. 9, on the removal can be distributed in a map within 80  80 area rather
dataset (a) the 5 feature maps corresponding to 5 nitrogen than 160  160 area compared to the clustering on the dataset
stress overlapped each other which means a high number of before strip removal, Besides, the distribution of feature
misclassifications, the whole map are distributed in a larger groups from the dataset (b) is separated without any over-
space than the map extracted from the dataset (b). The t-SNE lapping, which indicates an out-performing nitrogen diag-
feature clustering of ND-CNN on the dataset after strip nosis result on the dataset (b).
174 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

Fig. 6 e De-Striping Convolution Neural Network (DS-CNN) denoising on real strip noised hyperspectral image case. Top row
for noised hyperspectral band images, and the bottom row for DS-CNN denoised image. Note that the false color is chosen
for clear visualization. (For interpretation of the references to color in this figure legend, the reader is referred to the Web
version of this article.)

Fig. 7 e The accuracy and loss of nitrogen diagnosis CNN (ND-CNN) on dataset before (a) after (b) denoising.

Fig. 8 e The confusion matrix of Nitrogen Diagnosis CNN (ND-CNN) on (a) test dataset before denoising and after (b).
b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6 175

Fig. 9 e 2D t-distributed stochastic neighbor embedding (t-SNE) feature visualizations after Nitrogen Diagnosis
Conculutional Neuron Network (ND-CNN) on the dataset (a) before and (b) after denoising by De-Striping Convolutional
Neural Network (DS-CNN).

6. Conclusions Acknowledgments

This paper proposed an advanced deep learning networks This research was supported by the Key Research and Devel-
architecture for stripe noise removal in a hyperspectral image opment Program of China (2021YFD2000104).
acquired using a close-range imaging system. Extensive ex-
periments on simulated and real data demonstrate that the
proposed model can effectively remove the stripe noise from references
the input image without employing any postprocessing step
while maintaining structural and textural details of the output
image. Integration of the element-wise feature concatenation Abdalla, A., Cen, H., Wan, L., Mehmood, K., & He, Y. (2021).
Nutrient status diagnosis of infield oilseed rape via deep
mechanism and dropout block helps to boost deep learning
learning-enabled dynamic model. IEEE Transactions on
performance and hence improve the image quality and over- Industrial Informatics, 17(6), 4379e4389. https://doi.org/10.1109/
smoothing issues. Our method outperforms several existing TII.2020.3009736
state-of-the-art destriping methods in terms of both peak ret, J.-B., Hank, T., Wocher, M., Mauser, W.,
Berger, K., Verrelst, J., Fe
signal-to-noise ratio (PSNR) and structural similarity index et al. (2020). Retrieval of aboveground crop nitrogen content
metric (SSIM) quality measures. To represent the practical with a hybrid machine learning method. International Journal of
Applied Earth Observation and Geoinformation, 92, 102174. https://
challenge of the proposed DS-CNN for stripe noise removal,
doi.org/10.1016/j.jag.2020.102174
we developed another deep learning model, so-called ND- ret, J.-B., Wang, Z., Wocher, M.,
Berger, K., Verrelst, J., Fe
CNN, to diagnose the nutrient status of the plants using the Strathmann, M., et al. (2020). Crop nitrogen monitoring:
stripe-free images generated by the DS-CNN model. The re- Recent progress and principal developments in the context of
sults showed that the nutrient status diagnosis was signifi- imaging spectroscopy missions. Remote Sensing of Environment,
cantly improved after stripe noise removal. Considering all 242, 111758. https://doi.org/10.1016/j.rse.2020.111758
types of noise in one study might be a broad topic and we also Cai, S., Shu, Y., Chen, G., Ooi, B. C., Wang, W., & Zhang, M. (2019).
Effective and efficient dropout for deep convolutional neural
believe that there is no universal deep learning model that is
networks. ArXiv, 1e12. Retrieved from http://arxiv.org/abs/
suitable for all types of noises. According to our experience,
1904.03392.
training a single deep learning model to perform multiple Cen, H., Wan, L., Zhu, J., Li, Y., Li, X., Zhu, Y., et al. (2019). Dynamic
tasks without defining a specific objective function (such as monitoring of biomass of rice under different nitrogen
ones used to solve cross-domain problems) would signifi- treatments using a lightweight UAV with dual image-frame
cantly reduce the accuracy. In the future, the proposed model snapshot cameras. Plant Methods, 15(1), 32.
can be tuned and extended to be universally suitable for many Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., &
Yuille, A. L. (2018). DeepLab: Semantic image segmentation
types of noises.
with deep convolutional nets, atrous convolution, and fully
connected CRFs. IEEE Transactions on Pattern Analysis and
Machine Intelligence, 40(4), 834e848. https://doi.org/10.1109/
Declaration of competing interest TPAMI.2017.2699184
Chen, J., Shao, Y., Guo, H., Wang, W., & Zhu, B. (2003). Destriping
The authors declare that they have no known competing CMODIS data by power filtering. IEEE Transactions on Geoscience
and Remote Sensing, 41(9), 2119e2124. https://doi.org/10.1109/
financial interests or personal relationships that could have
TGRS.2003.817206
appeared to influence the work reported in this paper.
176 b i o s y s t e m s e n g i n e e r i n g 2 1 9 ( 2 0 2 2 ) 1 6 5 e1 7 6

Dao, P. D., Mantripragada, K., He, Y., & Qureshi, F. Z. (2021). Mao, X.-J., Shen, C., & Yang, Y.-B. (2016). Image restoration using
Improving hyperspectral image segmentation by applying convolutional auto-encoders with symmetric skip
inverse noise weighting and outlier removal for optimal connections. Retrieved from http://arxiv.org/abs/1606.08921.
scale selection. ISPRS Journal of Photogrammetry and Remote Mishra, P., Asaari, M. S. M., Herrero-Langreo, A., Lohumi, S.,
Sensing, 171, 348e366. https://doi.org/10.1016/j.isprsjprs. Diezma, B., & Scheunders, P. (2017). Close range hyperspectral
2020.11.013 imaging of plants: A review. Biosystems Engineering, 164, 49e67.
Deshpande, A. M., Patale, S. R., & Roy, S. (2021). Removal of line https://doi.org/10.1016/j.biosystemseng.2017.09.009
striping and shot noise from remote sensing imagery using a Pan, J.-J., & Chang, C.-I. (1992). Destriping of Landsat MSS images
deep neural network with post-processing for improved by filtering techniques. Photogrammetric Engineering & Remote
restoration quality. International Journal of Remote Sensing, Sensing, 58(10), 1417e1423.
42(19), 7357e7380. https://doi.org/10.1080/ Sabzi, S., Pourdarbani, R., Rohban, M. H., Garcı́a-Mateos, G., &
01431161.2021.1957512 Arribas, J. I. (2021). Estimation of nitrogen content in cucumber
Guan, J., Lai, R., & Xiong, A. (2019). Wavelet deep neural network plant (Cucumis sativus L.) leaves using hyperspectral imaging
for stripe noise removal. IEEE Access, 7, 44544e44554. https:// data with neural network and partial least squares
doi.org/10.1109/ACCESS.2019.2908720 regressions. Chemometrics and Intelligent Laboratory Systems,
Hore, A., & Ziou, D. (2010). Image quality metrics: PSNR vs. SSIM. 217, 104404. https://doi.org/10.1016/j.chemolab.2021.104404
In In 2010 20th international Conference on pattern Sellami, A., & Tabbone, S. (2022). Deep neural networks-based
recognition (pp. 2366e2369). IEEE. https://doi.org/10.1109/ relevant latent representation learning for hyperspectral
ICPR.2010.579. image classification. Pattern Recognition, 121, 108224. https://
Jung, J. G., Song, K. E., Hong, S. H., & Shim, S. I. (2021). doi.org/10.1016/j.patcog.2021.108224
Hyperspectral characteristics of an individual leaf of wheat Shang, W., Sohn, K., Almeida, D., & Lee, H. (2016). Understanding
grown under nitrogen gradient. Plants, 10(11), 2291. https:// and improving convolutional neural networks via
doi.org/10.3390/plants10112291 concatenated rectified linear units. 33rd International Conference
Kuang, X., Sui, X., Chen, Q., & Gu, G. (2017). Single infrared image on Machine Learning, ICML, 5, 3276e3284, 2016.
stripe noise removal using deep convolutional networks. IEEE Shan, W., Liu, P., Mu, L., Cao, C., & He, G. (2019). Hyperspectral
Photonics Journal, 9(4), 76924e76934. https://doi.org/10.1109/ image denoising with dual deep CNN. IEEE Access, 7,
JPHOT.2017.2717948 171297e171312. https://doi.org/10.1109/ACCESS.2019.2955810
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Sidorov, O., & Hardeberg, J. Y. (2019). Deep hyperspectral prior:
Nature, 521(7553), 436e444. https://doi.org/10.1038/ Single-image denoising, inpainting, super-resolution. In 2019
nature14539 IEEE/CVF international conference on computer vision workshop
Lempitsky, V., Vedaldi, A., & Ulyanov, D. (2018). Deep image prior. (ICCVW) (pp. 3844e3851). IEEE. https://doi.org/10.1109/
In 2018 IEEE/CVF conference on computer vision and pattern ICCVW.2019.00477.
recognition (pp. 9446e9454). IEEE. https://doi.org/10.1109/ Sun, D., Xu, H., Weng, H., Zhou, W., Liang, Y., Dong, X., et al.
CVPR.2018.00984. (2020). Optimal temporal-spatial fluorescence techniques for
Liang, L., Di, L., Huang, T., Wang, J., Lin, L., Wang, L., et al. (2018). phenotyping nitrogen status in oilseed rape (Brassica napus
Estimation of leaf nitrogen content in wheat using new L.). Journal of Experimental Botany. https://doi.org/10.1093/jxb/
hyperspectral indices and a random forest regression eraa372
algorithm. Remote Sensing, 10(12), 1940. https://doi.org/10.3390/ Torres, J., & Infante, S. O. (2001). Wavelet analysis for the
rs10121940 elimination of striping noise in satellite images. Optical
Li, F., Li, D., Elsayed, S., Hu, Y., & Schmidhalter, U. (2021). Using Engineering, 40(7), 1309e1314. https://doi.org/10.1117/1.1383996
optimized three-band spectral indices to assess canopy N Tros, M., Mascoli, V., Shen, G., Ho, M. Y., Bersanini, L., Gisriel, C. J.,
uptake in corn and wheat. European Journal of Agronomy, 127, et al. (2021). Breaking the red limit: Efficient trapping of long-
126286. https://doi.org/10.1016/j.eja.2021.126286 wavelength excitations in chlorophyll-f-containing
Liu, R., Cai, W., Li, G., Ning, X., & Jiang, Y. (2022). Hybrid dilated photosystem I. Inside Cosmetics, 7(1), 155e173. https://doi.org/
convolution guided feature filtering and enhancement 10.1016/J.CHEMPR.2020.10.024
strategy for hyperspectral image classification. IEEE Geoscience Van der Maaten, L., & Hinton, G. (2008). Visualizing Data using t-
and Remote Sensing Letters, 19. https://doi.org/10.1109/ SNE. Journal of Machine Learning Research, 9(86), 2579e2605.
LGRS.2021.3100407 Wang, L., Chen, S., Li, D., Wang, C., Jiang, H., Zheng, Q., et al.
Liu, N., Li, W., Tao, R., Fowler, J. E., & Yang, L. (2019). (2021). Estimation of paddy rice nitrogen content and
Hyperspectral stripes removal with wavelet-domain low- accumulation both at leaf and plant levels from UAV
rank/group-sparse decomposition. In 2019 10th Workshop on hyperspectral imagery. Remote Sensing 2021, 13(15), 2956.
hyperspectral Imaging and signal processing: Evolution in remote https://doi.org/10.3390/RS13152956
sensing (WHISPERS) Vol. 2019eSepte (pp. 1e4). IEEE. https:// Wheeler, K. I., Levia, D. F., & Vargas, R. (2020). Visible and near-
doi.org/10.1109/WHISPERS.2019.8921401. infrared hyperspectral indices explain more variation in
Liu, X., Tanaka, M., & Okutomi, M. (2013). Single-image noise level lower-crown leaf nitrogen concentrations in autumn than in
estimation for blind denoising. IEEE Transactions on Image summer. Oecologia, 192(1), 13e27. https://doi.org/10.1007/
Processing, 22(12), 5226e5237. https://doi.org/10.1109/ s00442-019-04554-2
TIP.2013.2283400

You might also like