Machine Learning for Cotton LAI Retrieval
Machine Learning for Cotton LAI Retrieval
sciences
Article
Comparison of Machine Learning Regression
Algorithms for Cotton Leaf Area Index Retrieval
Using Sentinel-2 Spectral Bands
Huihui Mao 1,2 , Jihua Meng 1, *, Fujiang Ji 1,2 , Qiankun Zhang 1,3 and Huiting Fang 1,2
1 Key Laboratory of Digital Earth Sciences, Institute of Remote Sensing and Digital Earth, Chinese Academy
of Sciences, Beijing 100101, China; maohh@[Link] (H.M.); jifj@[Link] (F.J.);
zhang_q_k@[Link] (Q.Z.); fanght@[Link] (H.F.)
2 University of Chinese Academy of Sciences, Beijing 100049, China
3 College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541004, China
* Correspondence: mengjh@[Link]; Tel.: +86-010-6486-9473
Received: 2 March 2019; Accepted: 2 April 2019; Published: 7 April 2019
Abstract: Leaf area index (LAI) is a crucial crop biophysical parameter that has been widely used in
a variety of fields. Five state-of-the-art machine learning regression algorithms (MLRAs), namely,
artificial neural network (ANN), support vector regression (SVR), Gaussian process regression (GPR),
random forest (RF) and gradient boosting regression tree (GBRT), have been used in the retrieval of
cotton LAI with Sentinel-2 spectral bands. The performances of the five machine learning models are
compared for better applications of MLRAs in remote sensing, since challenging problems remain in
the selection of MLRAs for crop LAI retrieval, as well as the decision as to the optimal number for
the training sample size and spectral bands to different MLRAs. A comprehensive evaluation was
employed with respect to model accuracy, computational efficiency, sensitivity to training sample
size and sensitivity to spectral bands. We conducted the comparison of five MLRAs in an agricultural
area of Northwest China over three cotton seasons with the corresponding field campaigns for
modeling and validation. Results show that the GBRT model outperforms the other models with
respect to model accuracy in average (R2 = 0.854, RMSE = 0.674 and MAE = 0.456). SVR achieves
the best performance in computational efficiency, which means it is fast to train, and to validate that
it has great potentials to deliver near-real-time operational products for crop management. As for
sensitivity to training sample size, GBRT behaves as the most robust model, and provides the best
model accuracy on the average among the variations of training sample size, compared with other
models (R2 = 0.884, RMSE = 0.615 and MAE = 0.452). Spectral bands sensitivity analysis with dCor
(distance correlation), combined with the backward elimination approach, indicates that SVR, GPR
and RF provide relatively robust performance to the spectral bands, while ANN outperforms the
other models in terms of model accuracy on the average among the reduction of spectral bands (R2
= 0.881, RMSE = 0.625 and MAE = 0.480). A comprehensive evaluation indicates that GBRT is an
appealing alternative for cotton LAI retrieval, except for its computational efficiency. Despite the
different performance of the ML models, all models exhibited considerable potential for cotton LAI
retrieval, which could offer accurate crop parameters information timely and accurately for crop
fields management and agricultural production decisions.
Keywords: leaf area index (LAI); machine learning; Sentinel-2; sensitivity analysis; training sample
size; spectral bands
1. Introduction
Leaf area index (LAI), which characterizes the structure and functioning of vegetation, is usually
defined as half of the total green leaf area per unit horizontal ground surface area [1,2]. LAI is one
of the most important vegetation biophysical parameters, and a key variable for climate modeling,
evapotranspiration modeling and crop modeling, and it is recognized as an Essential Climate Variable
(ECV) by the Global Climate Observing System [3–8]. LAI has a wide range of applications regarding
agricultural fields, and it has been demonstrated to be an essential indicator for crop growth monitoring
and key variables for crop yield forecasting [9–11]. Therefore, it is of special relevance to retrieve LAI
in a timely and accurate manner.
Remote sensing techniques provide promising alternatives to obtaining crop biophysical
parameters by high temporally and spatially continuous means over large areas. To date, there
are mainly three categories of methods developed to retrieve LAI based on optical remote sensing
data, which are statistical methods, physically based methods, and hybrid methods [12–14]. Statistical
methods can be further divided into parametric and non-parametric regression methods. Parametric
regression methods usually consist of an explicit relationship between biophysical parameters
and vegetation indices, while non-parametric regression methods define regression models learnt
from the training dataset [15]. Non-parametric algorithms can be split into linear and non-linear
regression methods; the latter is also commonly referred to as machine learning regression algorithms
(MLRAs). While physically-based methods are applications of physical laws establishing cause-effect
relationships, a hybrid method combines elements of non-parametric statistics and physically-based
methods [13], whereas these two methods are both sophisticated models that demand a large number
of parameters, which are usually difficult to obtain in practice. Empirical parametric models typically
make use of a limited number of spectral bands [13,16]. However, nonparametric models can make
full use of spectral information, and directly learn the input-output relationships from a given training
dataset, which makes these models attractive alternatives for crop LAI retrieval.
With the development of remote sensing, more and more optical remote sensing satellites have
been launched (e.g., Landsat 8, Sentinel-2, and Chinese GF-1, GF-2 and newly launched GF-6), which
ensures the availability of high spatial, high temporal resolution satellite remote sensing data, and
correspondingly, high dimensional (spatial, temporal and spectral) of remote sensing data amounts to
large data volume, which also poses great challenges for more efficient, robust and accurate algorithms
in a wide variety of applications with remote sensing.
Recently, machine learning (ML), a broad subfield of artificial intelligence, has attracted
considerable attention in remote sensing applications for classification and regression problems, and
encouraging results have been obtained [17–23]. With advances in computer technology and associated
techniques, ML has drawn tremendous interest in a variety of fields to address complex problems.
ML can be broadly defined as computational methods using experience to improve performance or to
make accurate predictions [24]. ML has been extensively applied to biophysical parameter retrievals
due to the ability to accurately approximate robust relationships between input-output data, which
provides tremendous opportunities for remote sensing-based applications. Considering ML regression
algorithms, a more efficient, robust, and accurate model for crop LAI retrieval should be established.
Despite the considerable advances in ML for remote sensing applications, challenging problems remain
in the selection of MLRAs for crop LAI retrieval among the variety of ML algorithms available, as well
as the optimal number of training sample size and spectral bands to different MLRAs.
As for the versatile ML algorithms, artificial neural network (ANN), support vector regression
(SVR), Gaussian process regression (GPR) and random forest (RF) are reportedly effective for crop LAI
retrieval [25–28]. However, gradient boosting regression tree (GBRT), a highly robust ML algorithm
for a wide range of applications, is capable of achieving high levels of accuracy for regression
problems [29–31] and to our knowledge, has not been investigated for LAI retrieval. Further studies
should be conducted to assess the performance of the GBRT model in crop LAI retrievals.
Appl. Sci. 2019, 9, 1459 3 of 25
Many studies have been dedicated to crop LAI retrieval using MLRAs. However, there are a
limited number of academic studies involving comparisons of different MLRAs for crop LAI retrievals
using remote sensing. Apparently, none of these studies have focused on multispectral remote sensing
data, and none of these studies have evaluated the different impact factor together, to conduct a
comprehensive comparison.
In addition, the validation of global LAI products are important procedures to ensure the
application of the products in a wide range of fields [32]. Regional high-resolution LAI maps can be
used as a reference LAI map to validate the global LAI products, which calls for efficient, robust and
accurate algorithms for LAI retrieval.
The objective of this study is to compare the performance of five advanced MLRAs (ANN, SVR,
GPR, RF and GBRT) for cotton LAI retrieval in a relatively comprehensive manner. We conducted
the study over the entire growth period of cotton using Sentinel-2 spectral bands and corresponding
ground data. Specifically, the following research questions are addressed:
(1) Which of the five MLRAs perform best with regard to model accuracy?
(2) Which is the fastest model during the training and validation processes?
(3) How does the number of training sample size influence the performance of the five MLRAs?
(4) How does the number of spectral bands influence the performance of the five MLRAs?
(5) Which is the best model in consideration of model accuracy, computational efficiency,
sensitivity to training sample size, and sensitivity to spectral bands together?
(6) How accurate are the global LAI products in Northwest China?
2. Materials
Central
Band Bandwidth (nm) Spatial Resolution (m) Purpose
Wavelength (nm)
B1 443 20 60 Atmospheric correction (aerosol scattering).
Sensitive to vegetation senescing, carotenoid,
Appl. Sci. 2019, 9, x FOR PEER REVIEW 4 of 25
B2 1 490 65 10 browning and soil background; atmospheric
correction (aerosol scattering).
Table 2. Sentinel-2 satellite imagery spectral band characteristics [35].
Green peak, sensitive to total chlorophyll in
B3 560 35 10
Central Spatial vegetation.
Bandwidth
B4 Band Wavelength
665 30 Resolution 10 Purpose
Maximum chlorophyll absorption.
(nm)
(nm) (m)
Position of red edge; consolidation of
B1 443 20 60 Atmospheric correction (aerosol scattering).
B5 705 15 20 atmospheric corrections/fluorescence
Sensitive to vegetation senescing, carotenoid,
baseline.
B2 1 490 65 10 browning and soil background; atmospheric
Position of red edge; atmospheric correction,
B6 740 15 20 correction (aerosol scattering).
retrieval of aerosol load.
Green peak, sensitive to total chlorophyll in
B3 560 35 10 Leaf Area Index (LAI), edge of the Near
B7 783 20 20 vegetation.
B4 665 30 10 Infrared plateau.
Maximum chlorophyll absorption.
B8 842 115 Position
10 of red edge; consolidation of atmospheric
LAI.
B5 705 15 20
corrections / NIR
fluorescence
plateau, baseline.
sensitive to total chlorophyll,
Position of red edge; atmospheric
biomass, LAI andcorrection,
protein; water vapor
B8A B6 740
865 15 20 20 20 retrieval of aerosol
absorption load. retrieval of aerosol load
reference;
Leaf Area Index (LAI), edge of the Near Infrared
and type.
B7 783 20 20
plateau.
Water vapor absorption, atmospheric
B9 B8 945
842 115 20 10 60 LAI. correction.
NIR plateau, sensitive to total chlorophyll, biomass,
Detection of thin cirrus for atmospheric
B10 B8A 1380
865 20 30 20 LAI60and protein; water vapor absorption reference;
correction.
retrieval of aerosol load and type.
B9 945 20 90 60 Water Sensitive to lignin, starch and forest above
B11 1610 20 vapor absorption, atmospheric correction.
B10 1380 30 60 ground
Detection of thin cirrus biomass; snow/ice/cloud
for atmospheric correction. separation.
Assessment
Sensitive to lignin, starch and forestof Mediterranean
above ground vegetation
B11 1610 90 20 conditions; separation.
distinction of clay soils for the
biomass; snow/ice/cloud
B12 2190 180 20
Assessment monitoring
of Mediterranean of soilconditions;
vegetation erosion; distinction
distinction of claybetween livemonitoring
soils for the biomass, dead biomass and soil,
of soil
B12 2190 180 20 e.g., for burn scars mapping.
erosion; distinction between live biomass, dead
1 The spectral bands in bold are ones used in this study.
biomass and soil, e.g. for burn scars mapping.
1 The spectral bands in bold are ones used in this study.
color band composition of R (8) G (4) B (3), with standard deviation stretch (right). The red color in the
Sentinel image represents the vegetation (mainly crops and a few trees), the grey color represents the
desert and bare land (right). The green dots represent the quadrats (including three sample points),
and the blue polygons represent agricultural field boundaries (lower left).
Appl.
Appl. [Link]. 2019,
2019, 9, x9,FOR
x FOR PEER
PEER REVIEW
REVIEW 5 of5 25
of 25
colorcolor band
band composition
composition of (8)
of R R (8) G (4)
G (4) B (3),
B (3), with
with standard
standard deviation
deviation stretch
stretch (right).
(right). TheThe
redred color
color in in
thethe Sentinel
Sentinel image
image represents
represents thethe vegetation
vegetation (mainly
(mainly crops
crops and
and a few
a few trees),
trees), thethe grey
grey color
color represents
represents
the
the2019, desert
desert and bare land (right). The green dots represent the quadrats (including
and bare land (right). The green dots represent the quadrats (including three sample three sample
Appl. Sci. 9, 1459 5 of 25
points), and the blue polygons represent agricultural field boundaries
points), and the blue polygons represent agricultural field boundaries (lower left). (lower left).
Figure 2. Field campaign cotton LAI descriptive statistics on three observation dates, shown with
Figure
Figure
boxplots. 2. Field
2. The
Fieldbox campaign
campaign
extends fromcotton
cotton
theLAILAI descriptive
descriptive
lower statistics
statistics
to upper quantile on on
values three
three
of observation
observation
the dates,
dates,
field observation shown
shown
data, with
with
with a
boxplots.
boxplots.
line at theThe The box extends
box extends
median, from
from the
the whiskers the lower
lowerfrom
extend to upper
to upper quantile
quantile
the box to show values
values of the
of the of
the range field
field observation
theobservation data, with
data, with
data. Flier points are
a line
those
a line atthe
past
at thethe median,
end
median, of thethe whiskers
whiskers,
whiskers extend
named
extend asfrom
from thethe
outliers
box box
(red to
to showshow
dots). thethe range
range of the
of the data.
data. Flier
Flier points
points areare
those past the end of the whiskers, named as outliers
those past the end of the whiskers, named as outliers (red dots). (red dots).
2.2. Sentinel-2 Data and Preprocessing
[Link]. Sentinel-2
Sentinel-2
The remote DataData
andand
sensing
Preprocessing
Preprocessing
data used in this study were Sentinel-2 imagery. Sentinel-2 is a constellation
TheThe
of satellites, remote
Sentinel-2A
remote sensing
sensing and
datadatausedused in in
Sentinel-2B, thisthis
whichstudy
study were
were
were Sentinel-2
launched
Sentinel-2 by imagery.
the
imagery. European Sentinel-2
Sentinel-2 Space is ais a constellation
Agency (ESA)
constellation
of of
on satellites,
June 2015
satellites, Sentinel-2A
and March
Sentinel-2A and and
2017, Sentinel-2B,
respectively.
Sentinel-2B, which
which Each were
were launched
satellite
launched byby
carries theathe European
MultiSpectral
European SpaceSpace Agency
Instrument
Agency (ESA)(ESA)onon
(MSI)
June
that 2015
provides and
a March
variety of2017, respectively.
spectral bands Each
covering satellite
the carries
visible, neara
June 2015 and March 2017, respectively. Each satellite carries a MultiSpectral Instrument (MSI) that MultiSpectral
infrared and Instrument
shortwave (MSI)
infrared that
provides
bands. The a variety
MSI of
containsspectral
four bands
bands covering
at 10 m,the
sixvisible,
bands near
at 20infrared
m and
provides a variety of spectral bands covering the visible, near infrared and shortwave infrared bands. Theand
threeshortwave
bands atinfrared
60 m bands.
[36]. It The
is
of
MSIMSI
great contains
importance
contains fourfour bands
thatatthe
bands at 10
10 MSI
m, m, six bands
sixincorporates
bands at 20
at 20 m m
three
and and three
bands
three inbands
bands at 60
mm
theatred-edge
60 [36].
[36]. It isIt of
region, is of great
centered
great importance
at 705,
importance
740that
andthe783 MSInm, incorporates
and two three
Shortwave bands in the
Infrared red-edge
(SWIR) region,
bands centered
centered
that the MSI incorporates three bands in the red-edge region, centered at 705, 740 and 783 nm, and at
at 705,
1610 740
and and
2190 783 nmnm, at and
20two
mtwo
Shortwave
(S2-20
Shortwavem). Many Infrared
Infrared studies(SWIR)
(SWIR) have bands
revealed
bands centered
centeredthat at at
the 1610
red-edge
1610 and and 2190
bands
2190 nmnm at at
and 20
20SWIRmm (S2-20
bands
(S2-20 m). m).
haveMany Many studies
thestudies
potential have
to
have
revealed
improve the that the
accuracy red-edge
of LAI bands
retrievals and SWIR
[37–39], bands
which have
open the
great potential
revealed that the red-edge bands and SWIR bands have the potential to improve the accuracy of LAI to
opportunities improvefor the
crop accuracy
LAI of
retrievals LAI
retrievals
considering [37–39],
Sentinel-2’s whichhighopen great
revisit opportunities
frequency. The for crop
Sentinel-2 LAI retrievals
spectral
retrievals [37–39], which open great opportunities for crop LAI retrievals considering Sentinel-2’s high band considering
characteristics Sentinel-2’s
are shown high
by revisit
Table
revisit frequency.
2.
frequency. The
The Sentinel-2
Sentinel-2 spectral
spectral band band characteristics
characteristics areare shown
shown byby Table
Table 2. 2.
The acquired Sentinel-2 imagery products are Level-1C products, which areare
TheThe acquired
acquired Sentinel-2
Sentinel-2 imagery
imagery products
products areare Level-1C
Level-1C products,
products, which
which are toptop
top of of atmosphere
of atmosphere
atmosphere
(TOA)
(TOA) reflectances
(TOA) reflectances
reflectances[40]. [40]. Atmospheric
Atmospheric
[40]. Atmospheric correctioncorrection
correctionwas was was
conducted conducted
conductedusing using using
the Sen2Cor the
the Sen2Cor Sen2Cor
(2.5.5, ESA, (2.5.5,
(2.5.5, ESA,
Frascati,
ESA,
Frascati,
Italy,
Frascati, Italy,
2018)Italy,
and 2018)
Sentinel
2018) andand Sentinel
Application
Sentinel Application
Platform
Application (SNAP) Platform
toolbox
Platform (SNAP) toolbox
(6.0.1,toolbox
(SNAP) ESA, (6.0.1,
Frascati,
(6.0.1, ESA,
Italy,
ESA, 2018)Frascati,
provided
Frascati, Italy,
Italy,
by 2018) provided by the ESA to produce Level-2A bottom
2018) provided by the ESA to produce Level-2A bottom of atmosphere (BOA) products [41,42]. of
the ESA to produce Level-2A bottom of atmosphere of
(BOA) atmosphere
products (BOA)
[41,42]. products
A flowchart[41,42].AA
flowchart
Sentinel-2
flowchart of Sentinel-2
preprocessing
of Sentinel-2 ispreprocessing
presented in
preprocessing is presented
is Figure
presented 3. in in Figure
Figure 3. 3.
Figure
Figure
Figure 3. Flowchart
Flowchart
3. Flowchart
3. of of the
of the
the preprocessing
preprocessing
preprocessing of of Sentinel-2
of Sentinel-2
Sentinel-2 imagery.
imagery.
imagery.
In this study, aiming at making full use of the spectral bands of Sentinel-2 (especially the red-edge
and SWIR bands), we focus on S2-20 m (the same size as our quadrats) with 10 spectral bands
Appl. Sci. 2019, 9, 1459 6 of 25
considering the red-edge and SWIR spectral bands of Sentinel-2 imagery. A resampling process was
performed using the Nearest Neighbor method with 4 spectral bands (B2, B3, B4 and B8) from 10 m
to 20 m in the SNAP toolbox. Three of the atmospheric spectral bands at 60 m were not used in
this study because these bands contributed to atmospheric applications, such as aerosol correction
(B1), water vapor correction (B9) and cirrus detection (B10) [42]. The reflectance data collected on
the Sentinel-2 images were using Extract by Points on the ArcToolbox of ArcGIS Desktop software
(10.5, Environmental Systems Research Institute, Redlands, CA, USA, 2016), the points used to extract
data on the Sentinel-2 images are the center GPS coordinate of the three sample points that represent
one quadrat.
3. Methods
ML algorithms can automatically learn the relationships in any given data between input
(reflectances) and output (LAI). To identify the performance of five popular ML algorithms for
cotton LAI retrieval, regression models were established based on artificial neural network (ANN),
SVR, Gaussian process regression (GPR), random forest (RF) and gradient boosting regression tree
(GBRT), at S2-20 m for the whole growth period of cotton (using all 117 quadrats data over the three
observation dates).
All the ML models were implemented using the Scikit-learn package [49], which is an open-source
Python [50] module project that integrates a wide range of prevalent ML algorithms [51]. All
hyperparameter tuning of the models is based on GridSearchCV in the Scikit-learn package, which
can evaluate all possible combinations of hyperparameter values using five-fold cross-validation to
determine the best combination of hyperparameter values (the hyperparameter combination that
has the best accuracy of the model in terms of root-mean-square error (RMSE)). Cross-validation are
model validation techniques to obtain reliable and stable models. This study implemented a five-fold
cross-validation, basically, the training datasets are split into five smaller sets, and a model is trained
using four of the folds as training data, then the resulting model is validated on the remaining part of
the data, the processes continues to circulate five times, and finally, the performance measure estimated
by five-fold cross-validation is the average of the values computed in the loop. Training and testing
sampling distribution has a great impact on machine learning regression algorithms (MLRAs), and
some previous studies demonstrated that 70%/30% split option is appropriate for model training and
validation [52–54], nonetheless, other studies argue that the 80%/20% split option is preferable [27,55].
In our study, to validate the performance of the ML models, all the datasets were randomly split into
Appl. Sci. 2019, 9, 1459 7 of 25
75% (n = 87) for model training, and 25% (n = 30) for model validation. Regression models that achieve
satisfactory performance in training datasets may fail to predict unseen datasets, and therefore, a
model that performs well at both training and unseen testing datasets is referred to as having excellent
generalization ability. This type of model could be used for crop LAI retrieval applications.
where Nh is the number of neurons in the hidden layer, m specifies the number of layers, and N
denotes the number of input neurons. The number of neurons in the hidden layer was set to ten in
this study according to Equation (1). In the input layer, the input variables include 10 spectral bands.
Other important hyperparameters were optimized using GridSearch and 5-fold cross-validation. The
remainder of hyperparameters for the ANN model remain defaults. The structure of the neural
network used in this study is presented in Figure 4. The hyperparameter values adopted in this study
Appl. Sci. 2019, 9, x FOR PEER REVIEW 8 of 25
are listed in Table 4.
Figure 4. The architecture of backpropagation (BP) neural networks used in this study for cotton
Figure 4. The architecture of backpropagation (BP) neural networks used in this study for cotton LAI
LAI retrieval.
retrieval.
Table 4. Parameter settings to determine the optimal hyperparameters for the artificial neural network
(ANN) model.
where C > 0 is a regularization parameter that gives more weight to minimizing the flatness or error.
The principal hyperparameter of SVR is the kernel, as it defines the kernel functions of the model.
The radial basis function (RBF) was selected as the kernel function because it has been found to be
efficient and accurate for regression problems [68,69]. The RBF kernel is described as follows:
k xi − x j k 2
!
k xi , x j = exp − (4)
2σ2
The SVR model is easy to establish because only two hyperparameters must be tuned: Penalty
parameter C and the kernel coefficient gamma. The optimal values of the two hyperparameters that
were optimized using GridSearch and 5-fold cross-validation are presented in Table 5. The remainder
of hyperparameters for the SVR model remain defaults.
Table 5. Parameter settings to determine the optimal hyperparameters for the support vector regression
(SVR) model.
k x, x0 = E ( f (x) − m(x)) f x0 − m x0
(6)
f (x) ∼ GP m(x), k x, x0
(7)
where θ0 is the scaling factor, σ is the length-scale, and θ1 corresponds to the independent
noise component.
Beyond the kernel functions, alpha and n_restarts_optimizer, are significant for the GPR model.
The results of hyperparameter tuning using GridSearch and 5-fold cross-validation are shown in
Table 6 as follows. The remainder of hyperparameters for the GPR model are set as defaults.
Table 6. Parameter settings to determine the optimal hyperparameters for the Gaussian process
regression (GPR) model.
as bagging. RF models can provide feature importance estimates, which enables insight into feature
selection processes.
Key hyperparameters include n_estimators, max_depth, min_samples_split and
min_samples_leaf. Hyperparameter tuning results using GridSearch and 5-fold cross-validation are
displayed in Table 7. The remainder of the RF model hyperparameters are set as defaults.
Table 7. Parameter settings to determine the optimal hyperparameters for the random forest
(RF) model.
M
F(x) = ∑ γm h m ( x ) (9)
m =1
where hm ( x ) represents the basis learners in boosting. Then, GBRT builds the additive model in a
forward stagewise fashion:
Fm ( x ) = Fm−1 ( x ) + γm hm ( x ) (10)
At each stage, the decision tree hm ( x ) is chosen to minimize the loss function L, given the current
model Fm−1 and its fitting of Fm−1 ( xi ):
n
Fm ( x ) = Fm−1 ( x ) + arg min ∑ L(yi , Fm−1 ( xi ) + h( x )) (11)
h i =1
The initial model F0 is problem specific. Gradient boosting attempts to solve this minimization
problem numerically via the steepest descent. The steepest descent direction is the negative gradient
of the loss function evaluated at the current model Fm−1 , which can be calculated for any differentiable
loss function as follows:
n
Fm ( x ) = Fm−1 ( x ) − γm ∑ ∇F L(yi , Fm−1 (xi )) (12)
i =1
Appl. Sci. 2019, 9, 1459 11 of 25
The GBRT algorithm can provide feature relevance information and partial dependence, where
this partial dependence shows the dependence among the target response and the most important
features (not shown in this study). However, despite the GBRT’s demonstrated satisfactory accuracy
for versatile domains of regression problems [29,75,76], to our knowledge, this algorithm has not been
previously applied to crop LAI retrievals with remote sensing. GBRT should be studied as it might be
a promising alternative in crop LAI retrieval with remote sensing.
With regard to GBRT model hyperparameters, loss, n_estimators, max_depth, min_samples_split
and min_samples_leaf are selected for hyperparameter tuning using GridSearch and 5-fold
cross-validation, the results are exhibited in Table 8. The remainder of the GBRT model
hyperparameters are set as defaults.
Table 8. Parameter settings to determine the optimal hyperparameters for the gradient boosting
regression tree (GBRT) model.
nsamples −1
1
MAE(y, ŷ) =
nsamples ∑ |yi − ŷi | (15)
i =0
nsamples −1
∑ i =0 (yi − ŷi )2
R2 (y, ŷ) = 1 − nsamples −1
(16)
∑ i =0 ( y i − y )2
1
𝑀𝐴𝐸(𝑦, 𝑦) = |𝑦 − 𝑦 | (15)
𝑛
∑ (𝑦 − 𝑦 )
Appl. Sci. 2019, 9, 1459 𝑅 (𝑦, 𝑦) = 1 − 12(16)
of 25
∑ (𝑦 − 𝑦)
where 𝑦 = ∑nsamples −1 𝑦 , 𝑦 is the estimated cotton LAI value, 𝑦 is the measured cotton
where y = n 1 ∑i=0 yi , ŷi is the estimated cotton LAI value, yi is the measured cotton LAI
𝑛
LAI value andsamples is the number of validation datasets. The higher the 𝑅 , the smaller the RMSE
value and nsamples is the number of validation datasets. The higher the R2 , the smaller the RMSE and
and MAE, and thus the higher the model precision and accuracy.
MAE, and thus the higher the model precision and accuracy.
4.
4. Results
Results
4.1.
4.1. Performance
Performance Evaluation
Evaluation
To avoidskew
To avoid skewresults
results caused
caused by random
by the the random
samplingsampling of training
of training anddatasets,
and testing testing wedatasets,
performedwe
performed 20 random repetitions of the five ML regression models (87 samples for training
20 random repetitions of the five ML regression models (87 samples for training and 30 for testing). The and 30
for testing). The
performance performance
of five ML models of five
withML models
respect to Rwith
2 respect
, RMSE 𝑅 , RMSE
andtoMAE and MAE
is displayed is displayed
in Figure 5. GBRT in
Figure 5. GBRT surpasses the other models 2 on the average (𝑅 = 0.854, 𝑅𝑀𝑆𝐸
surpasses the other models on the average (R = 0.854, RMSE = 0.674 and MAE = 0.456), however, GBRT= 0.674 and 𝑀𝐴𝐸 =
0.456), however, GBRT acts less robust to the training/testing random split 2according
acts less robust to the training/testing random split according to the distribution of R , RMSE and MAE. to the
distribution of 𝑅
RF delivers the , RMSE
worst and MAE.
accuracy on theRF delivers
average (R2the worst RMSE
= 0.807, accuracy on the
= 0.781 average
and MAE (𝑅 and𝑅𝑀𝑆𝐸
= 0.807,
= 0.545), it also
=acts
0.781 and 𝑀𝐴𝐸 = 0.545), and it also acts less robust to the training/testing
less robust to the training/testing random split. Nonetheless, SVR achieves a desirable result whilerandom [Link]
Nonetheless, SVR robust
also acts reasonably achieves a training/testing
to the desirable resultrandom while split
it also
(R =acts
2 0.835,reasonably
RMSE = 0.730 robust to the
and MAE =
training/testing random split (𝑅 = 0.835, 𝑅𝑀𝑆𝐸 = 0.730 and 𝑀𝐴𝐸 = 0.550), which indicates
0.550), which indicates that SVR is highly stable to the random sampling processes. Overall, all models that SVR
is highlysatisfactory
achieved stable to performances,
the random which sampling processes.
indicates that MLOverall, all are
algorithms models achieved
appealing methodssatisfactory
for cotton
performances,
LAI [Link] indicates that ML algorithms are appealing methods for cotton LAI estimation.
4.2. Computational
4.2. Computational Efficiency
Efficiency
Beyond the
Beyond the model
model accuracy
accuracy of
of the
the five
five models
models for
for the
the training
training and
and testing
testing random
random split,
split, it
it is
is of
of
particular relevance to compare the computational efficiency (time required to
particular relevance to compare the computational efficiency (time required to the model during the model during
training and
training and validation
validation processes)
processes) of of the
the models,
models, asas it
it is
is an
an important criterion for
important criterion for operational
operational
algorithms. All models were implemented in a Python environment on an Intel(R) Xeon(R) CPUCPU
algorithms. All models were implemented in a Python environment on an Intel(R) Xeon(R) E5-
E5-2620
2620 v22.10
v2 @ @ 2.10
GHz GHz processor
processor andand installed
installed memory
memory (RAM)ofof32.0
(RAM) 32.0GB.
[Link]
Thecomputational
computationalefficiency
efficiency
is recorded
is recorded from
from the
the 20
20 repetitions
repetitions in
in Section
Section 4.1.
4.1. The
The averaged
averaged results
results of
of 20
20 repetitions
repetitions are
are illustrated
illustrated
in Figure 6. Large differences among the five ML models are clearly found. SVR performs incredibly
fast (less than 1 s). However, GBRT is frustrating, owing to the large amounts of its hyperparameter
tuning processes. In general, SVR is able to deliver near real time operational products with a highly
efficient processing speed, whereas the GBRT model is not recommended for this kind of application,
because the GBRT model is computationally more demanding. RF, GPR and ANN showed moderate
performances in terms of computational efficiency.
in Figure 6. Large differences among the five ML models are clearly found. SVR performs incredibly
fast (less than 1 s). However, GBRT is frustrating, owing to the large amounts of its hyperparameter
tuning processes. In general, SVR is able to deliver near real time operational products with a highly
efficient processing speed, whereas the GBRT model is not recommended for this kind of application,
Appl. Sci. 2019,
because the 9, 1459 model is computationally more demanding. RF, GPR and ANN showed moderate
GBRT 13 of 25
performances in terms of computational efficiency.
model accuracy improves overall with an increasing training sample size, while GBRT provides the
most robust performance for the training samples, and the best model accuracy among the variations
in training
Appl. 9, 1459 size on average (𝑅 = 0.884, 𝑅𝑀𝑆𝐸 = 0.615 and 𝑀𝐴𝐸 = 0.452, calculated by14all
Sci. 2019, sample of the
25
8 groups of training sample size).
Figure 8. Distance correlation among the Sentinel-2 spectral bands and corresponding cotton LAI.
Figure 8. Distance correlation among the Sentinel-2 spectral bands and corresponding cotton LAI.
Figure 8. Distance correlation among the Sentinel-2 spectral bands and corresponding cotton LAI.
Figure 10. Comprehensive comparison of five machine learning regression algorithms (MLRAs), with
Figure 10. Comprehensive comparison of five machine learning regression algorithms (MLRAs), with
different metrics using a radar chart.
different metrics using a radar chart.
Table 10. Metrics used in a radar chart and the corresponding descriptions.
Table 10. Metrics used in a radar chart and the corresponding descriptions.
Metrics in the Radar Chart Corresponding Description
Metrics in the Radar Chart Corresponding Description
M1 Fitness (R2 )
M1 Fitness (𝑹𝟐 )
M2 Accuracy (RMSE)
M3M2 Accuracy (RMSE)
Accuracy (MAE)
M4 M3 Accuracy
Computational (MAE)efficiency
M5M4 Computational
Sensitivity sample size (R2 )
efficiency
to training
𝟐
M6M5 Sensitivity
Sensitivitytoto
training
trainingsample
sample size
size )
(𝑹(RMSE)
M7M6 Sensitivity
Sensitivity to training
to training sample
sample size size
(RMSE)(MAE)
M8M7 Sensitivity
Sensitivity to spectral
to training sample bands (R2 )
size (MAE)
M9M8 Sensitivity 𝟐
Sensitivity toto spectral
spectral bands
bands (𝑹(RMSE)
)
M10M9 Sensitivity to spectral bands
Sensitivity to spectral bands (RMSE) (MAE)
M10 Sensitivity to spectral bands (MAE)
4.6. Final LAI Maps
4.6. Final LAI Maps
The best performing ML regression model has been applied to map cotton LAI with Sentinel-2
The as
imagery, best performing
analyzed ML regression
in previous sections, model
we use has
the been
GBRTapplied
model to tomap
mapcotton
cottonLAI
LAIinwith Sentinel-2
the study area.
imagery,
The fieldsasin analyzed
the studyin previous
area sections,
are regular, we use theaGBRT
we conducted manual model to map cotton
vectorization based LAI in the study
on Google Earth
area. The
image datafields in theaccurate
to obtain study area are regular,
agricultural weboundaries,
fields conducted anda manual vectorization
Landsat 8 image databased
wereoncollected
Google
Earth
to image
update thedata to as
fields, obtain
Googleaccurate agricultural
Earth image fields
data has poorboundaries, and LandsatTotally,
temporal information. 8 image data fields
40,211 were
collected to update the fields, as Google Earth image data has poor temporal information.
were collected by this way in the study area. Figure 11 displayed the agricultural fields boundaries. Totally,
40,211
A fieldscrop
per-field were collected bywas
classification thisperformed
way in the to
study area.
extract Figure
cotton 11 displayed
fields. The finalthe agricultural
cotton LAI mapfields
was
boundaries.
obtained A per-field
by masking thecrop classification
results was performed
using the extracted to extract
cotton fields. cotton
Figure fields. The
12 presents the final cotton
Appl. Sci. 2019, 9, x FOR PEER REVIEW 17 of 25
Appl. Sci. 2019, 9, x FOR PEER REVIEW 17 of 25
Appl. Sci. 2019, 9, 1459 17 of 25
LAI map was obtained by masking the results using the extracted cotton fields. Figure 12 presents
LAI map was obtained by masking the results using the extracted cotton fields. Figure 12 presents
the final cotton LAI map. From Figure 12, it is clear to find that there is an increasing trend of cotton
the final
LAI [Link]
From LAI map.
Figure 12,From Figure 12, it is clear to find that there is an increasing trend of cotton
LAI growth, as revealed byitFigure
is clear
[Link] find that there is an increasing trend of cotton LAI growth, as
LAI growth,
revealed as revealed
by Figure 2. by Figure 2.
Figure Scatter
13.13.
Figure plots
Scatter plotsamong
among GEOV3, MODISLAI
GEOV3, MODIS LAI products
products andand upscaled
upscaled Sentinel-2
Sentinel-2 LAI inLAI
the in the 2018
2018
cotton season.
cotton season.
Appl. Sci. 2019, 9, 1459 19 of 25
5. Discussion
Over the last decade, there has been a considerable increase in the introduction of ML algorithms
to remote sensing for a wide range of fields. Crop biophysical parameters (e.g., LAI) are key variables
for a wide range of applications. With the progress of remote sensing techniques, we are able to acquire
high dimensional (spatial, temporal, and spectral) remote sensing data, which demands more efficient,
accurate and robust algorithms in a wide variety of applications with remote sensing. In this context,
there is great potential for ML algorithms to be used in a wide range of remote sensing applications.
The diversity of available ML algorithms poses great challenges for the selection of MLRAs, as
well as the decision of optimal number of training sample size and spectral bands to different MLRAs.
Besides, another significant problem that may arise involves hyperparameter tuning in the application
of ML algorithms. In general, experience is required to obtain satisfactory results for ML algorithms,
and experiments with a large number of datasets may be needed, which are generally not available.
Accordingly, it is necessary to perform a comparison of different fashionable ML models to support
better remote sensing applications. In this study, we focus on the comparison of five well-known
ML algorithms for cotton LAI retrievals with Sentinel-2 imagery, because these algorithms have a
wide range of applications in remote sensing. Additionally, the hyperparameter tuning results of five
prominent ML models are provided. Our study could provide support for associated remote sensing
studies based on ML algorithms.
Furthermore, a comparatively great fluctuation is observed at 7 among all the models except
for GPR in Figure 9, the previous removing spectral band is the red-edge band (B7), which indicates
that having low dCor values do not necessarily correspond to being less important for LAI retrieval.
Notably, ensemble methods (RF and GBRT) and GPR models have a great benefit of delivering
feature importance information (not shown in our study), which provides insight into the greatest
contributing features of the model. Such information could be used for better model interpretation,
and this information is also useful for feature selection processes when applied to models that contain
a great number of features.
In related studies, Verrelst, et al. [55] compared four ML algorithms (NN, SVR, KRR and GPR)
using simulated Sentinel-2 and 3 data to assess three biophysical parameters (Chlorophyll content, LAI
and FVC). GPR outperformed the other regression methods for the majority of Sentinel configurations,
whereas in our study, GPR performed worse than the other regression methods. Results that differ from
our study may be for the following reasons. First, we used real Sentinel-2 data rather than simulation
data, and it may be difficult to represent the performance of real Sentinel-2 data with simulation data.
Second, we focus on cotton crops over the entire growth period rather than on various crop types. In
addition, there may be different hyperparameter settings between models. Finally, there are differences
between the Sentinel-2 bands, and we used 10 spectral bands (SWIR included). Yuan, et al. [83]
compared the RF, ANN and SVM regression models for soybean LAI retrieval using unmanned aerial
vehicle (UAV) hyperspectral data with different sampling methods. The results showed that RF is
suitable for the whole growth period of soybean LAI estimation, while ANN is appropriate for a
single growth period. Siegmann and Jarmer [84] compared SVR and RFR for wheat LAI estimation
using hyperspectral data, and the results showed that SVR provided the best performance of the entire
dataset. Different from previous studies, we considered the GBRT model due to its highly robust
performance over a wide range of applications. We further explored the sensitivity of different ML
algorithms to training sample size and Sentinel-2 spectral bands. The results show that GBRT achieves
the best performance, and GBRT was more robust for the training samples than the other models.
Despite the promising results revealed by five ML regression methods, there are some limitations
of our study, and further studies are needed to assess the generalization ability of the five ML regression
methods. Firstly, we focus on only one crop type, and the performance of ML regression methods for
other crop types has not been validated. It is necessary to evaluate ML algorithms with various crop
types, provided there are enough field measurement LAI data of different crop types available.
Appl. Sci. 2019, 9, 1459 20 of 25
Secondly, some other crop biophysical parameters (e.g., Chlorophyll content and FVC) demand
evaluation as well as ML regression methods, as different results may appear with different ML
algorithms and different crop biophysical parameters. Thirdly, there are limitations with Sentinel-2
spectral bands sensitivity analysis, since some spectral bands are highly correlated, and redundancies
remain among the spectral bands, so this process would be worth further investigations to explore
a better spectral bands sensitivity analysis. Additionally, there are limitations to our quadrats in the
field campaigns, as we chose only three sample points to represent a quadrat (20 m × 20 m). Finally,
the performance of five MLRAs has not yet been evaluated for large amounts of data (e.g., tens of
thousands of data) due to the limited number of samples in our study, however, it remains to be seen
whether a favorable performance can be obtained. In consideration of the ability of ML techniques, it is
expected to obtain a better accuracy of the regression models provided there are enough samples, and
a turning point should be obtained, as has been demonstrated in previous studies [19]. While before
the turning point, the model accuracy increased rapidly with the increase in the training sample size,
whereas, after the turning ponits, the model behaves more stable, in other words, the model accuracy
remmained unchanged with the increasing of training sample szie. Future studies are required to
investigate this problem.
With the advance in ML technology, deep learning, a new subfield of ML, has been applied
successfully to many remote sensing domains, especially with a large quantity of data [85]. It is not
necessary to construct complex ML models (deep learning-based models) in our study, as traditional
ML algorithms have the capability of establishing efficient, accurate and robust estimation models for
cotton LAI retrieval. However, as there are greater amounts of data available, deep learning techniques
may be promising alternatives for handling such large volume datasets and complex relationships
among the input datasets, which deserve further study.
6. Conclusions
With the challenges of selecting ML algorithms for crop LAI estimation, the results of this
study have great implications for the selection of appropriate ML models from the diversity of
available ML algorithms, and at the same time, these same results provide the optimal number of
training sample size and spectral bands of Sentinel-2 for each model required for cotton LAI retrieval.
Regarding the comparison of different ML models for cotton LAI retrieval employed in our study, the
GBRT model outperforms the other ML models according to our results. Our findings increase the
potential for cotton LAI retrieval with Sentinel-2 imagery, and may be transferrable to other associated
problems related to agricultural remote sensing applications. On the other hand, the GBRT model is
computationally demanding, which may be a significant problem with a large scale of data. However,
GBRT can challenge the model accuracy and computational efficiency selection problems. Considering
the computational efficiency, SVR exhibits considerable computational superiority over the other
ML models.
Regarding the sample size sensitivity of ML models, model accuracy increases with the growth of
the training sample size, and the GBRT produces the most robust performance for the training samples
with respect to the standard error and the best model accuracy on average.
In terms of spectral band sensitivity analysis, the distance correlation results showed that the
SWIR and visible bands have great potential to improve the accuracy of cotton LAI retrievals. By
using dCor combined with the backward elimination method, the model accuracy decreases with
the reduction in spectral bands. SVR, GPR and RF perform robustly with the spectral bands, and
ANN provides the best accuracy on average. The minimum number of bands required for cotton LAI
retrieval are recognized as 6 (ANN), 6 (SVR), 5 (RF), 5 (GBRT) and 8 for GPR.
A comprehensive evaluation has been employed to identify the performance of five ML models,
considering a combination of model accuracy, computational efficiency, sensitivity to training sample
size and sensitivity to spectral bands. The comprehensive performance of the models is identified as
GBRT > ANN > SVR > RF > GPR.
Appl. Sci. 2019, 9, 1459 21 of 25
Despite the different performances of the five ML regression models, MLRAs are promising
ways to retrieve cotton LAI with Sentinel-2 imagery because the MLRAs all achieved encouraging
accuracies. With profound applications for a diversity of ML algorithms in remote sensing, MLRAs
may provide positive effects for remote sensing applications in terms of classification, regression, and
other associated problems.
Author Contributions: Conceptualization, J.M. and H.M.; Methodology, H.M.; Formal Analysis, H.M.;
Investigation, H.M., F.J., Q.Z. and H.F.; Data curation, H.M.; Writing—Original Draft Preparation, H.M.;
Writing—Review and Editing, J.M.; Visualization, H.M.
Funding: This research was funded by GF6 Project under grant No. 30-Y20A03-9003-17/18 and
09-Y20A05-9001-17/18; the National Natural Science Foundation of China (41871261); and the open fund of the Key
Laboratory of Oasis Eco-agriculture, Xinjiang Production and Construction Group (201701).
Acknowledgments: We are grateful to Agricultural College of Shihezi University for providing help in field
campaigns. We would like to acknowledge the authors who contributed to the development of Scikit-learn project.
We acknowledge the Copernicus Global Land Service, and the Land Processes Distributed Active Archive Center
(LP DAAC) within the NASA Earth Observing System Data and Information System (EOSDIS) for providing
the global LAI products. We are thankful to the three anonymous reviewers for their insightful comments and
suggestions helped to clarify and improve the manuscript.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Chen, J.M.; Black, T.A. Defining leaf-area index for non-flat leaves. Plant Cell Environ. 1992, 15, 421–429.
[CrossRef]
2. Garrigues, S.; Lacaze, R.; Baret, F.; Morisette, J.T.; Weiss, M.; Nickeson, J.E.; Fernandes, R.; Plummer, S.;
Shabanov, N.V.; Myneni, R.B.; et al. Validation and intercomparison of global Leaf Area Index products
derived from remote sensing data. J. Geophys. Res.-Biogeosci. 2008, 113. [CrossRef]
3. Asner, G.P.; Scurlock, J.M.O.; Hicke, J.A. Global synthesis of leaf area index observations: Implications for
ecological and remote sensing studies. Glob. Ecol. Biogeogr. 2003, 12, 191–205. [CrossRef]
4. Buermann, W.; Dong, J.R.; Zeng, X.B.; Myneni, R.B.; Dickinson, R.E. Evaluation of the utility of satellite-based
vegetation leaf area index data for climate simulations. J. Clim. 2001, 14, 3536–3550. [CrossRef]
5. Yuan, H.; Dai, Y.J.; Xiao, Z.Q.; Ji, D.Y.; Shangguan, W. Reprocessing the MODIS Leaf Area Index products for
land surface and climate modelling. Remote Sens. Environ. 2011, 115, 1171–1187. [CrossRef]
6. Van den Hurk, B.; Viterbo, P.; Los, S.O. Impact of leaf area index seasonality on the annual land surface
evaporation in a global circulation model. J. Geophys. Res.-Atmos. 2003, 108, 4191. [CrossRef]
7. Cheng, Z.Q.; Meng, J.H.; Wang, Y.M. Improving spring maize yield estimation at field scale by assimilating
time-series HJ-1 CCD data into the WOFOST model using a new method with fast algorithms. Remote Sens.
2016, 8, 303. [CrossRef]
8. Systematic Observation Requirements for Satellite-Based Products for Climate 2011 Update: Supplemental
Details to the Satellite-Based Component of the “Implementation Plan for the Global Observing System for
Climate in Support of the UNFCCC (2010 Update)”. Available online: [Link]
lvl=notice_display&id=12907 (accessed on 13 December 2018).
9. Dong, Y.Y.; Zhao, C.J.; Yang, G.J.; Chen, L.P.; Wang, J.H.; Feng, H.K. Integrating a very fast simulated
annealing optimization algorithm for crop leaf area index variational assimilation. Math. Comput. Model.
2013, 58, 871–879. [CrossRef]
10. Jego, G.; Pattey, E.; Liu, J.G. Using Leaf Area Index, retrieved from optical imagery, in the STICS crop model
for predicting yield and biomass of field crops. Field Crop. Res. 2012, 131, 63–74. [CrossRef]
11. Baez-Gonzalez, A.D.; Kiniry, J.R.; Maas, S.J.; Tiscareno, M.; Macias, J.; Mendoza, J.L.; Richardson, C.W.;
Salinas, J.; Manjarrez, J.R. Large-area maize yield forecasting using leaf area index based yield model.
Agron. J. 2005, 97, 418–425. [CrossRef]
12. Liang, S.L. Recent developments in estimating land surface biogeophysical variables from optical remote
sensing. Prog. Phys. Geogr. 2007, 31, 501–516. [CrossRef]
Appl. Sci. 2019, 9, 1459 22 of 25
13. Verrelst, J.; Camps-Valls, G.; Munoz-Mari, J.; Rivera, J.P.; Veroustraete, F.; Clevers, J.; Moreno, J. Optical
remote sensing and the retrieval of terrestrial vegetation bio-geophysical properties—A review. ISPRS-J.
Photogramm. Remote Sens. 2015, 108, 273–290. [CrossRef]
14. Verrelst, J.; Malenovský, Z.; Van der Tol, C.; Camps-Valls, G.; Gastellu-Etchegorry, J.-P.; Lewis, P.; North, P.;
Moreno, J. Quantifying vegetation biophysical variables from imaging spectroscopy data: A review on
retrieval methods. Surv. Geophys. 2018, 1–41. [CrossRef]
15. Campos-Taberner, M.; Garcia-Haro, F.J.; Busetto, L.; Ranghetti, L.; Martinez, B.; Gilabert, M.A.;
Camps-Valls, G.; Camacho, F.; Boschetti, M. A critical comparison of remote sensing Leaf Area Index
estimates over rice-cultivated areas: From Sentinel-2 and Landsat-7/8 to MODIS, GEOV1 and EUMETSAT
polar system. Remote Sens. 2018, 10, 763. [CrossRef]
16. Baret, F.; Buis, S. Estimating canopy characteristics from remote sensing observations: Review of methods
and associated problems. In Advances in Land Remote Sensing: System, Modeling, Inversion and Application;
Liang, S., Ed.; Springer: Dordrecht, The Netherlands, 2008; pp. 173–201. ISBN 978-1-4020-6449-4.
17. Maxwell, A.E.; Warner, T.A.; Fang, F. Implementation of machine-learning classification in remote sensing:
An applied review. Int. J. Remote Sens. 2018, 39, 2784–2817. [CrossRef]
18. Lary, D.J.; Alavi, A.H.; Gandomi, A.H.; Walker, A.L. Machine learning in geosciences and remote sensing.
Geosci. Front. 2016, 7, 3–10. [CrossRef]
19. Wang, T.T.; Xiao, Z.Q.; Liu, Z.G. Performance evaluation of machine learning methods for Leaf Area Index
retrieval from time-series MODIS reflectance data. Sensors 2017, 17, 81. [CrossRef]
20. Noi, P.T.; Kappas, M. Comparison of random forest, k-nearest neighbor, and support vector machine
classifiers for land cover classification using Sentinel-2 imagery. Sensors 2018, 18, 18. [CrossRef]
21. Chen, G.B.; Li, S.S.; Knibbs, L.D.; Hamm, N.A.S.; Cao, W.; Li, T.T.; Guo, J.P.; Ren, H.Y.; Abramson, M.J.;
Guo, Y.M. A machine learning method to estimate PM2.5 concentrations across China with remote sensing,
meteorological and land use information. Sci. Total Environ. 2018, 636, 52–60. [CrossRef]
22. Lary, D.J.; Zewdie, G.K.; Liu, X.; Wu, D.; Levetin, E.; Allee, R.J.; Malakar, N.; Walker, A.; Mussa, H.;
Mannino, A. Machine learning applications for earth observation. In Earth Observation Open Science
and Innovation; Mathieu, P.-P., Aubrecht, C., Eds.; Springer: Cham, Switzerland, 2018; pp. 165–218.
ISBN 978-3-319-65633-5.
23. Kwon, S.K.; Jung, H.S.; Baek, W.K.; Kim, D. Classification of forest vertical structure in South Korea from
aerial orthophoto and lidar data using an artificial neural network. Appl. Sci. 2017, 7, 1046. [CrossRef]
24. Mohri, M.; Talwalkar, A.; Rostamizadeh, A. Foundations of Machine Learning; Dietterich, T., Bishop, C.,
Heckerman, D., Jordan, M., Kearns, M., Eds.; MIT Press: Cambridge, MA, USA, 2012; ISBN 978-0-262-01825-8.
25. Durbha, S.S.; King, R.L.; Younan, N.H. Support vector machines regression for retrieval of leaf area index
from multiangle imaging spectroradiometer. Remote Sens. Environ. 2007, 107, 348–361. [CrossRef]
26. Karimi, S.; Sadraddini, A.A.; Nazemi, A.H.; Xu, T.R.; Fard, A.F. Generalizability of gene expression
programming and random forest methodologies in estimating cropland and grassland leaf area index.
Comput. Electron. Agric. 2018, 144, 232–240. [CrossRef]
27. Verrelst, J.; Alonso, L.; Camps-Valls, G.; Delegido, J.; Moreno, J. Retrieval of vegetation biophysical
parameters using Gaussian process techniques. IEEE Trans. Geosci. Remote Sens. 2012, 50, 1832–1843.
[CrossRef]
28. Bacour, C.; Baret, F.; Beal, D.; Weiss, M.; Pavageau, K. Neural network estimation of LAI, fAPAR, fCover
and LAIxC(ab), from top of canopy MERIS reflectance data: Principles and validation. Remote Sens. Environ.
2006, 105, 313–325. [CrossRef]
29. Li, X.; Bai, R.B. Freight Vehicle travel time prediction using gradient boosting regression tree. In Proceedings
of the 2016 15th IEEE International Conference on Machine Learning and Applications, Anaheim, CA, USA,
18–20 December 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1010–1015.
30. Guneralp, I.; Filippi, A.M.; Randall, J. Estimation of floodplain aboveground biomass using multispectral
remote sensing and nonparametric modeling. Int. J. Appl. Earth Obs. Geoinf. 2014, 33, 119–126. [CrossRef]
31. Xiao, Z.B.; Wang, Y.; Fu, K.; Wu, F. Identifying different transportation modes from trajectory data using
tree-based ensemble classifiers. ISPRS Int. Geo-Inf. 2017, 6, 57. [CrossRef]
32. Martinez, B.; Garcia-Haro, F.J.; Camacho-de Coca, F. Derivation of high-resolution leaf area index maps
in support of validation activities: Application to the cropland Barrax site. Agric. For. Meteorol. 2009, 149,
130–145. [CrossRef]
Appl. Sci. 2019, 9, 1459 23 of 25
33. Traoré, F. Assessing the impact of China net imports on the world cotton price. Appl. Econ. Lett. 2014, 21,
1031–1035. [CrossRef]
34. Wang, H.D.; Wu, L.F.; Cheng, M.H.; Fan, J.L.; Zhang, F.C.; Zou, Y.F.; Chau, H.W.; Gao, Z.J.; Wang, X.K.
Coupling effects of water and fertilizer on yield, water and fertilizer use efficiency of drip-fertigated cotton
in northern Xinjiang, China. Field Crop. Res. 2018, 219, 169–179. [CrossRef]
35. ESA. GMES Sentinel-2 Mission Requirements Document, Technical Report issue 2 revision 1. Available
online: [Link] (accessed on 30 March 2019).
36. Jaramaz, D.; Perović, V.; Belanović, S.; Saljnikov, E.; Čakmak, D.; Mrvić, V.; Životić, L. The ESA Sentinel-2
Mission Vegetation Variables for Remote Sensing of Plant Monitoring. In Proceedings of the 2nd International
Conference on Regional Development, Spatial Planning and Strategic Governance (RESPAG 2013), Belgrade,
Serbia, 22–25 May 2013; Vujošević, M., Milijić, S., Eds.; Institute of Architecture and Urban & Spatial Planning
of Serbia (IAUS): Belgrade, Serbia, 2013; pp. 950–961.
37. Delegido, J.; Verrelst, J.; Meza, C.M.; Rivera, J.P.; Alonso, L.; Moreno, J. A red-edge spectral index for remote
sensing estimation of green LAI over agroecosystems. Eur. J. Agron. 2013, 46, 42–52. [CrossRef]
38. Gong, P.; Pu, R.L.; Biging, G.S.; Larrieu, M.R. Estimation of forest leaf area index using vegetation indices
derived from Hyperion hyperspectral data. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1355–1362. [CrossRef]
39. Twele, A.; Erasmi, S.; Kappas, M. Spatially explicit estimation of leaf area index using EO-1 hyperion and
landsat ETM+ data: Implications of spectral bandwidth and shortwave infrared data on prediction accuracy
in a tropical montane environment. GISci. Remote Sens. 2008, 45, 229–248. [CrossRef]
40. ESA. Copernicus Open Access Hub. Available online: [Link]
(accessed on 6 April 2019).
41. Louis, J.; Debaecker, V.; Pflug, B.; Main-Knorn, M.; Bieniarz, J.; Müller-Wilm, U.; Cadau, E.; Gascon, F.
SENTINEL-2 SEN2COR: L2A processor for users. In Proceedings of the Living Planet Symposium 2016,
Prague, Czech Republic, 9–13 May 2016.
42. Müller-Wilm, U.; Louis, J.; Richter, R.; Gascon, F.; Niezette, M. Sentinel-2 Level-2A prototype processor:
Architecture, algorithms and first results. In Proceedings of the ESA Living Planet Symposium 2013,
Edinburgh, UK, 9–13 September 2013.
43. Fang, H.L.; Jiang, C.Y.; Li, W.J.; Wei, S.S.; Baret, F.; Chen, J.M.; Garcia-Haro, J.; Liang, S.L.; Liu, R.G.;
Myneni, R.B.; et al. Characterization and intercomparison of global moderate resolution leaf area index
(LAI) products: Analysis of climatologies and theoretical uncertainties. J. Geophys. Res.-Biogeosci. 2013, 118,
529–548. [CrossRef]
44. NASA. LAADS DAAC. Available online: [Link] (accessed on 6
April 2019).
45. ESA. Copernicus Global Land Service. Available online: [Link] (accessed on 6
April 2019).
46. Myneni, R.; Knyazikhin, Y.; Park, T. MCD15A3H MODIS/Terra+Aqua Leaf Area Index/FPAR 4-day L4
Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC. 2015. Available online: [Link]
10.5067/MODIS/MCD15A3H.006 (accessed on 6 April 2019).
47. Baret, F.; Weiss, M.; Lacaze, R.; Camacho, F.; Makhmara, H.; Pacholcyzk, P.; Smets, B. GEOV1: LAI and
FAPAR essential climate variables and FCOVER global time series capitalizing over existing products. Part1:
Principles of development and production. Remote Sens. Environ. 2013, 137, 299–309. [CrossRef]
48. Baret, F.; Weiss, M.; Verger, A.; Smets, B. ATBD FOR LAI, FAPAR AND FCOVER FROM PROBA-V PRODUCTS
AT 300M RESOLUTION (GEOV3). Available online: [Link]
files/products/ImagineS_RP2.1_ATBD-LAI300m_I1.[Link] (accessed on 30 March 2019).
49. Scikit-Learn Developers. Scikit-learn. Available online: [Link] (accessed
on 6 April 2019).
50. Python Software Foundation. Python. Available online: [Link] (accessed on 6 April 2019).
51. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.;
Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12,
2825–2830.
52. Adelabu, S.; Mutanga, O.; Adam, E. Testing the reliability and stability of the internal accuracy assessment
of random forest for classifying tree defoliation levels using different validation methods. Geocarto Int. 2015,
30, 810–821. [CrossRef]
Appl. Sci. 2019, 9, 1459 24 of 25
53. Omer, G.; Mutanga, O.; Abdel-Rahman, E.M.; Adam, E. Performance of support vector machines and
artificial neural network for mapping endangered tree species using WorldView-2 data in Dukuduku Forest,
South Africa. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2015, 8, 4825–4840. [CrossRef]
54. Dube, T.; Mutanga, O.; Adam, E.; Ismail, R. Intra-and-inter species biomass prediction in a plantation forest:
Testing the utility of high spatial resolution spaceborne multispectral rapideye sensor and advanced machine
learning algorithms. Sensors 2014, 14, 15348–15370. [CrossRef]
55. Verrelst, J.; Munoz, J.; Alonso, L.; Delegido, J.; Rivera, J.P.; Camps-Valls, G.; Moreno, J. Machine learning
regression algorithms for biophysical parameter retrieval: Opportunities for Sentinel-2 and -3. Remote Sens.
Environ. 2012, 118, 127–139. [CrossRef]
56. van Gerven, M. Computational foundations of natural intelligence. Front. Comput. Neurosci. 2017, 11, 7–30.
[CrossRef]
57. Waske, B.; Fauvel, M.; Benediktsson, J.A.; Chanussot, J. Machine learning techniques in remote sensing data
analysis. In Kernel Methods for Remote Sensing Data Analysis; Camps-Valls, G., Bruzzone, L., Eds.; John Wiley
& Sons: Hoboken, NJ, USA, 2009; pp. 3–24. ISBN 978-470-72211-4.
58. Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature
1986, 323, 533–536. [CrossRef]
59. Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators.
Neural Netw. 1989, 2, 359–366. [CrossRef]
60. Sonmez, H.; Gokceoglu, C.; Nefeslioglu, H.A.; Kayabasi, A. Estimation of rock modulus: For intact rocks
with an artificial neural network and for rock masses with a new empirical equation. Int. J. Rock Mech.
Min. Sci. 2006, 43, 224–235. [CrossRef]
61. Madhiarasan, M.; Deepa, S.N. Comparative analysis on hidden neurons estimation in multi layer perceptron
neural networks for wind speed forecasting. Artif. Intell. Rev. 2017, 48, 449–471. [CrossRef]
62. Huang, G.B. Learning capability and storage capacity of two-hidden-layer feedforward networks. IEEE Trans.
Neural Netw. 2003, 14, 274–281. [CrossRef]
63. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [CrossRef]
64. Vapnik, V.; Golowich, S.E.; Smola, A. Support vector method for function approximation, regression
estimation, and signal processing. In Proceedings of the 9th International Conference on Neural Information
Processing Systems, Denver, CO, USA, 3–5 December 1996; Mozer, M.C., Jordan, M.I., Petsche, T., Eds.;
MIT Press: Cambridge, MA, USA, 1996; pp. 281–287.
65. Mountrakis, G.; Im, J.; Ogole, C. Support vector machines in remote sensing: A review. ISPRS-J. Photogramm.
Remote Sens. 2011, 66, 247–259. [CrossRef]
66. Basak, D.; Pal, S.; Patranabis, D.C. Support vector regression. Neural Inf. Process. Lett. Rev. 2007, 11, 203–224.
67. Awad, M.; Khanna, R. Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System
Designers; Pepper, J., Weiss, S., Hauke, P., Eds.; Apress: Berkeley, CA, USA, 2015; ISBN 978-1-4302-5990-9.
68. Ramedani, Z.; Omid, M.; Keyhani, A.; Shamshirband, S.; Khoshnevisan, B. Potential of radial basis function
based support vector regression for global solar radiation prediction. Renew. Sust. Energ. Rev. 2014, 39,
1005–1011. [CrossRef]
69. Li, M.; Liu, Y.H. Learning interaction force model for endodontic shaping with support vector regression.
In Proceedings of the 2006 IEEE International Conference on Robotics and Automation, Orlando, FL, USA,
15–19 May 2006; IEEE: Piscataway, NJ, USA, 2006; pp. 3642–3647.
70. Rasmussen, C.E.; Williams, C.K. Gaussian Process for Machine Learning; Dietterich, T., Bishop, C.,
Heckerman, D., Jordan, M., Kearns, M., Eds.; MIT Press: Cambridge, MA, USA, 2006; ISBN 0-262-18253-X.
71. Scornet, E. Random forests and Kernel methods. IEEE Trans. Inf. Theory 2016, 62, 1485–1500. [CrossRef]
72. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
73. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232.
[CrossRef]
74. Natekin, A.; Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 2013, 7, 21. [CrossRef]
[PubMed]
75. Fischer, P.; Etienne, C.; Tian, J.J.; Krauss, T. Prediction of wind speeds based on digital elevation MODELS
using boosted regression trees. In Proceedings of the International Conference on Sensors & Models in
Remote Sensing & Photogrammetry, Kish Island, Iran, 23–25 November 2015; Arefi, H., Motagh, M., Eds.;
Copernicus Gesellschaft Mbh: Göttingen, Germany, 2015; pp. 197–202.
Appl. Sci. 2019, 9, 1459 25 of 25
76. Kanungo, T.; Orr, D. Predicting the readability of short web summaries. In Proceedings of the Second
ACM International Conference on Web Search and Data Mining, Barcelona, Spain, 9–12 February 2009;
Baeza-Yates, R., Boldi, P., Ribeiro-Neto, B., Cambazoglu, B.B., Eds.; ACM: New York, NY, USA, 2009;
pp. 202–211.
77. Szekely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and testing dependence by correlation of distances.
Ann. Stat. 2007, 35, 2769–2794. [CrossRef]
78. Li, R.Z.; Zhong, W.; Zhu, L.P. Feature screening via distance correlation learning. J. Am. Stat. Assoc. 2012,
107, 1129–1139. [CrossRef]
79. Zhong, W.; Zhu, L.P. An iterative approach to distance correlation-based sure independence screening. J. Stat.
Comput. Simul. 2015, 85, 2331–2345. [CrossRef]
80. Kundu, P.P.; Mitra, S. Feature selection through message passing. IEEE Trans. Cybern. 2017, 47, 4356–4366.
[CrossRef] [PubMed]
81. Guyon, I.; Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 2003, 3,
1157–1182. [CrossRef]
82. Hong, X.; Mitchell, R.J. Backward elimination model construction for regression and classification using
leave-one-out criteria. Int. J. Syst. Sci. 2007, 38, 101–113. [CrossRef]
83. Yuan, H.H.; Yang, G.J.; Li, C.C.; Wang, Y.J.; Liu, J.G.; Yu, H.Y.; Feng, H.K.; Xu, B.; Zhao, X.Q.; Yang, X.D.
Retrieving soybean Leaf Area Index from unmanned aerial vehicle hyperspectral remote sensing: Analysis
of RF, ANN, and SVM regression models. Remote Sens. 2017, 9, 309. [CrossRef]
84. Siegmann, B.; Jarmer, T. Comparison of different regression models and validation techniques for the
assessment of wheat leaf area index from hyperspectral data. Int. J. Remote Sens. 2015, 36, 4519–4534.
[CrossRef]
85. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [CrossRef] [PubMed]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license ([Link]