Tree Species Classification from Airborne Hyperspectral Images Using Spatial–Spectral Network

Hou, Chengchao; Liu, Zhengjun; Chen, Yiming; Wang, Shuo; Liu, Aixia

doi:10.3390/rs15245679

Open AccessArticle

Tree Species Classification from Airborne Hyperspectral Images Using Spatial–Spectral Network

by

Chengchao Hou

¹,

Zhengjun Liu

¹

,

Yiming Chen

^1,*,

Shuo Wang

¹

and

Aixia Liu

²

¹

Institute of Photogrammetry and Remote Sensing, Chinese Academy of Surveying and Map**, Bei**g 100036, China

²

Land Satellite Remote Sensing Application Center, Ministry of Natural Resources, Bei**g 100048, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(24), 5679; https://doi.org/10.3390/rs15245679

Submission received: 19 October 2023 / Revised: 27 November 2023 / Accepted: 6 December 2023 / Published: 10 December 2023

(This article belongs to the Special Issue Advances and Challenges on Multisource Remote Sensing Image Fusion: Datasets, New Technologies, and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Tree species identification is a critical component of forest resource monitoring, and timely and accurate acquisition of tree species information is the basis for sustainable forest management and resource assessment. Airborne hyperspectral images have rich spectral and spatial information and can detect subtle differences among tree species. To fully utilize the advantages of hyperspectral images, we propose a double-branch spatial–spectral joint network based on the SimAM attention mechanism for tree species classification. This method achieved high classification accuracy on three tree species datasets (93.31% OA value obtained in the TEF dataset, 95.7% in the Tiegang Reservoir dataset, and 98.82% in the **ongan New Area dataset). The network consists of three parts: spectral branch, spatial branch, and feature fusion, and both branches make full use of the spatial–spectral information of pixels to avoid the loss of information. In addition, the SimAM attention mechanism is added to the feature fusion part of the network to refine the features to extract more critical features for high-precision tree species classification. To validate the robustness of the proposed method, we compared this method with other advanced classification methods through a series of experiments. The results show that: (1) Compared with traditional machine learning methods (SVM, RF) and other state-of-the-art deep learning methods, the proposed method achieved the highest classification accuracy in all three tree datasets. (2) Combining spatial and spectral information and incorporating the SimAM attention mechanism into the network can improve the classification accuracy of tree species, and the classification performance of the double-branch network is better than that of the single-branch network. (3) The proposed method obtains the highest accuracy under different training sample proportions, and does not change significantly with different training sample proportions, which are stable. This study demonstrates that high-precision tree species classification can be achieved using airborne hyperspectral images and the methods proposed in this study, which have great potential in investigating and monitoring forest resources.

Keywords:

tree species classification; hyperspectral images; deep learning; spatial–spectral information; attention mechanism

1. Introduction

Forests are the mainstay of terrestrial ecosystems and are essential in maintaining ecological security and balance [1]. Conducting forest resource surveys and monitoring is important for formulating forestry guidelines and policies, protecting and utilizing planned forests, and constructing a sound ecological environment [2]. Among these, tree species identification is one of the basic and key components of forest resources monitoring, which plays a vital role in forest fire prevention [3], the monitoring of forest pests and diseases [4], the extraction of forest change information [5], and the protection of biodiversity [6]. Traditional tree species identification mainly relies on manual field surveys to identify tree species based on the external morphology of trees. Although this method has high accuracy, it has low accessibility, involves a difficult investigation, and involves high danger for plots without traffic conditions [7]. Secondly, field survey is costly and time-consuming, which makes it challenging to identify large-scale tree species in a short time.

The rapid development of remote sensing technology makes up for the deficiency of manual survey methods, which can obtain large-area image data without touching trees and realize the classification and identification of tree species in large regional scale areas without causing damage to the forest ecological environment. In particular, the hyperspectral sensor can simultaneously image the target region in tens to hundreds of continuous and subdivided spectral bands, obtaining the spatial information of the surface image as well as its spectral information, achieving the combination of spectra and image. Compared with RGB and multispectral images, hyperspectral images have rich spectral information and can detect subtle differences in the spectra of different vegetation, which has significant advantages in forest tree species classification.

In recent years, deep learning methods based on neural network have become popular with the development of computer hardware and algorithms. As an emerging research direction in the field of machine learning, it utilizes deep neural network structures that can automatically learn high-level abstract features and combine these features layer by layer to achieve efficient and accurate data classification and prediction [8,9]. Compared with traditional machine learning methods, deep learning has more robust self-adaptive and generalization capabilities, can better handle large-scale complex data, and has achieved great success in computer vision, natural language processing, speech recognition, and other fields. In the field of remote sensing, deep learning technology has attracted extensive attention from scholars, and many experts have utilized deep learning methods for tree species classification and achieved good classification results [10,11,12]. Among them, the convolutional neural network (CNN) has achieved remarkable results in computer vision, such as image classification [13], object detection [14], and semantic segmentation [15]. Due to its powerful feature extraction capability, the convolutional neural network has become the most commonly used neural network in hyperspectral tree species classification [16,17,18]. The hyperspectral image classification methods based on CNN can be mainly divided into three classes:

Classification methods based on spectral features [19,20]. This method utilizes 1D-CNN to extract features from the raw spectral information of pixels to complete classification. ** of the spatiotemporal distribution and characteristics of tree species over wide areas. Many researchers have successfully utilized remote sensing technology for tree species classification studies [32,33,34]. Park et al. [35] combined high-resolution RGB images (spatial resolution of 7 cm) acquired using UAV with machine learning algorithms to monitor trees and leaf phenology in Panama’s tropical forests. Grabska et al. [36] created nine different subsets of variables from multi-temporal Sentinel-2 data and environmental terrain data (elevation, slope, and slope direction) using a Random Forest-based variable importance selection algorithm (VSURF) and Recursive Feature Elimination (RFE). They classified the tree species using Random Forest, Support Vector Machine, and XGBoost algorithms, respectively. The results showed that the Support Vector Machine classifier outperforms the other two classifiers, obtaining the highest accuracy of 86.9%. Although RGB and multispectral remote sensing data have been widely used in tree species classification, the characteristics between some tree species (especially those of the same genus) are very similar, making it difficult to classify them finely with these two data sources.

2.2. Classification Methods Based on Hyperspectral Images

While RGB and multispectral data were reported to have potential for tree species map**, the continuous spectral information contained in hyperspectral data seems even more suitable to differentiate tree species with similar spectral properties. In previous studies, tree species classification using hyperspectral data mainly adopted traditional machine learning methods [37,38,39], such as Support Vector Machine, Random Forest, BP neural network, etc. For example, Dalponte et al. [40] used hyperspectral data and three classifiers (SVM, RF, and Maximum Likelihood method) to evaluate the accuracy of boreal forest species classification at the pixel level and crown level, respectively. However, traditional machine learning methods need to process and transform the raw data and manually extract features with distinction, such as important bands, vegetation indices, and texture features. The performance results of the methods largely depend on whether the selected features are reasonable or not. However, feature selection often relies on experience and is somewhat blind. In addition, the selected feature type depends on the specific task and dataset, which needs to be decided according to the actual situation, resulting in poor generalization ability. Wei et al. [41] proposed a fine classification method based on multi-feature fusion and deep learning. In their research, the morphological profiles, GLCM texture and endmember abundance features were leveraged to exploit the spatial information of the hyperspectral imagery. Then, the spatial information was fused with the original spectral information to generate classification results by using the deep neural network with a conditional random field (DNN + CRF) model. Although this method can yield good classification results, the spatial features are manually extracted from the raw data, which consumes time.

2.3. Attention Mechanism

As is known to all, the importance of every spectral channel and the area of the input patch is different when the network extracts features. The attention mechanisms can focus on the most informative part and decrease the weight of other regions. Many researchers have introduced an attention mechanism into hyperspectral image classification. Ma et al. [42] introduced the Convolutional Block Attention Module (CBAM) into hyperspectral images classification and proposed a Double-Branch Multi-Attention mechanism network (DBMA) for HSI classification. The experimental results demonstrated the effectiveness of the attention mechanism in hyperspectral images classification. However, current attention mechanisms often use additional sub-networks to generate attention weights [43,44,45], increasing the number of parameters in the model. The hyperspectral images have a massive amount of data compared to other remote sensing data sources. Accordingly, the number of parameters in the network model is also huge. The parameter-free attention mechanism does not introduce additional parameters to the network in generating weights, which is more suitable for hyperspectral images classification.

Code	Scientific Name	Abbreviation	Train Samples	Test Samples
1	Abies concolor	abco	2323	593
2	Abies magnifica	abma	742	113
3	Calocedrus decurrens	cade	1452	403
4	Pinus jeffreyi	pije	3654	924
5	Pinus lambertiana	pila	2205	583
6	Quercus kelloggii	quke	96	14
7	Pinus contorta	pico	741	154
8	Dead tree	dead	2745	796
Total			13,958	3580

Code	Scientific Name	Abbreviation	Train Samples	Test Samples
1	Glyptostrobus pensilis	glpe	10,013	6581
2	Cinnamomum camphora	cica	53,577	49,912
3	Eucalyptus robusta Smith	euro	14006	7868
4	Ficus altissima	fial	1783	3990
5	Platycladus orientalis	plor	19,287	10,021
6	Ficus microcarpa	fimi	10,226	6997
7	Castanopsis hystrix	cahy	9627	14,518
Total			118,519	99,887

Code	Scientific Name	Abbreviation	Train Samples	Test Samples
1	Acer negundo	acne	1128	11282
2	Salix babylonica	saba	903	9038
3	Ulmus pumila	ulpu	76	767
4	Sophora japonica	soja	2377	23,779
5	Fraxinus chinensis	frch	846	8467
6	Koelreuteria paniculata	kopa	116	1165
7	Robinia pseudoacacia	rops	28	280
8	Pyrus sorotina	pyso	5132	51,325
9	Populus simonii	posi	455	4553
10	Amygdalus persica	ampe	327	3275
11	Other	other	6987	69,915
Total			18,375	183,846

Species	SVM	RF	3D-CNN	DBMA	DBDA	ConvNeXt	SSFTT	Our
abco	39.12%	66.12%	73.33%	79.03%	87.07%	78.83%	82.20%	88.40%
abma	7.11%	14.91%	25.29%	51.26%	85.24%	64.90%	76.14%	88.08%
cade	20.02%	60.09%	78.38%	86.46%	92.30%	89.42%	90.21%	93.76%
pije	48.04%	82.34%	84.73%	94.08%	96.13%	95.15%	96.05%	97.40%
pila	34.23%	83.10%	85.65%	93.07%	99.43%	93.63%	95.26%	98.87%
quke	12.68%	42.11%	53.12%	48.62%	76.72%	84.63%	68.51%	80.17%
pico	14.64%	47.07%	70.58%	82.19%	89.47%	87.96%	89.34%	90.94%
dead	83.30%	86.63%	85.33%	85.01%	87.59%	86.71%	85.40%	89.53%
OA	44.65%	73.18%	78.89%	86.18%	92.16%	88.13%	89.52%	93.31%
AA	32.39%	60.30%	69.55%	77.47%	89.24%	85.15%	85.39%	90.89%
Kappa	0.3158	0.6675	0.7418	0.8307	0.9043	0.8551	0.8721	0.9183

Species	SVM	RF	3D-CNN	DBMA	DBDA	ConvNeXt	SSFTT	Our
glpe	25.59%	76.77%	77.06%	83.39%	98.32%	91.08%	99.21%	98.54%
cica	83.56%	94.27%	96.07%	95.34%	97.99%	98.22%	97.72%	98.34%
euro	65.06%	89.10%	96.88%	96.90%	96.92%	97.18%	96.84%	97.65%
fial	2.48%	4.30%	14.25%	11.92%	19.09%	4.41%	19.14%	24.40%
plor	63.29%	86.59%	96.02%	94.66%	98.79%	92.45%	98.08%	99.58%
fimi	35.20%	66.10%	89.54%	91.07%	98.05%	84.05%	97.86%	98.97%
cahy	50.16%	86.82%	96.49%	98.60%	98.73%	98.01%	98.44%	99.64%
OA	64.77%	85.29%	91.21%	91.45%	94.97%	92.32%	94.76%	95.7%
AA	46.48%	71.99%	80.9%	81.70%	86.84%	80.77%	86.75%	88.16%
Kappa	0.4709	0.7874	0.875	0.8791	0.9284	0.8900	0.9255	0.9389

Article Menu

Tree Species Classification from Airborne Hyperspectral Images Using Spatial–Spectral Network

Abstract

1. Introduction

2.2. Classification Methods Based on Hyperspectral Images

2.3. Attention Mechanism

3. Materials and Methodology

3.1. Dataset Introduction

3.1.1. TEF Dataset

3.1.2. Tiegang Reservoir Dataset

3.1.3. **ongan New Area Dataset

3.2. Methodology

3.2.1. Spectral Branch

3.2.2. Spatial Branch

3.2.3. Feature Fusion

3.3. Comparison Methods

4. Experiments

4.1. The Classification Results of the TEF Dataset

4.2. The Classification Results of the Tiegang Reservoir Dataset

4.3. The Classification Results for the **ongan New Area Dataset

5. Discussion

5.1. The Importance of Joint Spatial–Spectral Features

5.2. The Effectiveness of the SimAM Attention Mechanism

5.3. The Influence of Shallow Features on Tree Species Classification

5.4. T-SNE Visualization

5.5. Robustness Assessment

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Species	SVM	RF	3D-CNN	DBMA	DBDA	ConvNeXt	SSFTT	Our
acne	55.27%	62.28%	76.54%	87.58%	84.32%	90.13%	96.06%	98.64%
saba	53.58%	66.58%	74.93%	93.40%	93.86%	94.19%	98.02%	99.04%
ulpu	17.58%	45.44%	75.83%	89.31%	78.59%	94.26%	97.26%	99.43%
soja	62.91%	61.12%	80.93%	94.91%	91.10%	94.34%	99.04%	98.99%
frch	45.29%	41.97%	78.05%	94.45%	97.99%	93.81%	98.81%	99.49%
kopa	63.77%	76.99%	82.63%	95.93%	95.45%	93.39%	99.83%	99.88%
rops	1.30%	0.00%	1.43%	27.57%	0.00%	12.86%	66.43%	95.29%
pyso	72.37%	84.98%	85.80%	93.25%	95.84%	95.64%	98.39%	98.86%
posi	30.66%	43.27%	61.52%	81.66%	86.51%	81.75%	89.46%	93.33%
ampe	26.83%	35.35%	53.42%	84.06%	81.40%	87.85%	95.66%	96.43%
other	79.04%	85.61%	90.55%	95.22%	95.07%	96.73%	98.19%	99.11%
OA	68.22%	75.59%	84.15%	93.38%	93.52%	94.76%	97.94%	98.82%
AA	46.24%	54.87%	69.24%	85.21%	81.83%	84.99%	94.29%	98.04%
Kappa	0.5768	0.6699	0.7886	0.9119	0.9137	0.9301	0.9727	0.9843

Dataset Name	Method	OA Value	AA Value	Kappa Value
TEF dataset	No Attention	92.02%	88.6%	0.9025
TEF dataset	Attention	93.31%	90.89%	0.9183
Tiegang Reservoir dataset	No Attention	95.1%	87.35%	0.9303
Tiegang Reservoir dataset	Attention	95.7%	88.16%	0.9389
**ongan New Area dataset	No Attention	97.78%	94.31%	0.9705
**ongan New Area dataset	Attention	98.82%	98.04%	0.9843