Ship Classification Based on Improved Convolutional Neural Network Architecture for Intelligent Transport Systems

Leonidas, Lilian Asimwe; Jie, Yang

doi:10.3390/info12080302

Open AccessEditor’s ChoiceArticle

Ship Classification Based on Improved Convolutional Neural Network Architecture for Intelligent Transport Systems

by

Lilian Asimwe Leonidas

and

Yang Jie

^*

School of Information Engineering, Wuhan University of Technology, Wuhan 430081, China

^*

Author to whom correspondence should be addressed.

Information 2021, 12(8), 302; https://doi.org/10.3390/info12080302

Submission received: 3 July 2021 / Revised: 18 July 2021 / Accepted: 27 July 2021 / Published: 28 July 2021

(This article belongs to the Special Issue New Trends and Challenges in Intelligent Transportation Systems Optimisation, Modeling and Security)

Download

Browse Figures

Versions Notes

Abstract

:

In recent years, deep learning has been used in various applications including the classification of ship targets in inland waterways for enhancing intelligent transport systems. Various researchers introduced different classification algorithms, but they still face the problems of low accuracy and misclassification of other target objects. Hence, there is still a need to do more research on solving the above problems to prevent collisions in inland waterways. In this paper, we introduce a new convolutional neural network classification algorithm capable of classifying five classes of ships, including cargo, military, carrier, cruise and tanker ships, in inland waterways. The game of deep learning ship dataset, which is a public dataset originating from Kaggle, has been used for all experiments. Initially, the five pretrained models (which are AlexNet, VGG, Inception V3 ResNet and GoogleNet) were used on the dataset in order to select the best model based on its performance. Resnet-152 achieved the best model with an accuracy of 90.56%, and AlexNet achieved a lower accuracy of 63.42%. Furthermore, Resnet-152 was improved by adding a classification block which contained two fully connected layers, followed by ReLu for learning new characteristics of our training dataset and a dropout layer to resolve the problem of a diminishing gradient. For generalization, our proposed method was also tested on the MARVEL dataset, which consists of more than 10,000 images and 26 categories of ships. Furthermore, the proposed algorithm was compared with existing algorithms and obtained high performance compared with the others, with an accuracy of 95.8%, precision of 95.83%, recall of 95.80%, specificity of 95.07% and F1 score of 95.81%.

Keywords:

convolutional neural network; inland waterways; deep learning

1. Introduction

The purpose of ship classification is to identify various types of ships as accurately as possible, which is of great significance for monitoring the rights and interests of maritime traffic and improving coastal defense early warnings. With the improvement of all kinds of imaging technology, the ship classification method of imaging technology has become the mainstream method of ship target classification and recognition. From the data, the ship image can be roughly divided into the radar image, satellite remote-sensing image, infrared image and visible light image. The most widely used radar imaging technology is synthetic aperture radar (SAR). The advantages of SAR imaging are a wide monitoring range, short observation period and all-weather monitoring. On the other hand, the price of using radar is being vulnerable to other electromagnetic interference. Moreover, the captured ship targets only account for a few parts of the whole image. The classification method for radar images is only suitable for larger targets. The classification effect for a boat with a long distance is better than that for optical remote-sensing satellite imaging, which is easily affected by changes in ocean weather and light, making it hard to do real-time monitoring for a long time. Infrared imaging can provide rich target information and target backgrounds obtained at night or in the case of insufficient light, and it has a strong anti-jamming ability. However, infrared imaging is affected by the weather, temperature and other factors. On the sea surface, the influence of waves, clouds and other interference will greatly affect the accuracy of the image. Thus, infrared imaging cannot provide rich color information if the image quality is low. The visible light image contains gray information for multiple bands, and the image quality improves steadily, which makes the target features easier to be found and extracted. For the problem of ship classification, the actual system can get a variety of images. This can be solved using fusion methods that can produce high-resolution multispectral images from a high-resolution panchromatic image and low-resolution multispectral images [1,2].

Several traditional algorithms were suggested by Rainey et al. [3] for extraction and identification of the ship image. These include LBP, hog and sift and also classifiers such as the nearest neighbor algorithm and SVM. Arguedas [4] used LBP features to remove texture features from ship images to classify ships. Parameswarans et al. [5] used the bag of words model in classifying texts and used the bag of words model in ship classification. A two-stage ship recognition technique based on structural features was proposed. The method can effectively distinguish ships and cargo ships according to the ship image. Leclerc et al. [6] proposed a commercial ship classification algorithm based on structural feature analysis which can distinguish the features of density estimation, the position of the ship’s integral principal axis and the proportion of integral quantity of the left, middle and right parts. Through a synchronous experiment in the East China Sea experimental area, it was proven that the average classification accuracy of COSMO-SkyMed image quotient method was 89.94%. Liang **xiong et al. [7] suggested the use of a BP neural network to classify six infrared images. After pre-processing the images, the Hu invariant moment, edge image and perimeter area ratio were selected, and the accuracy of the four-layer BP neural network was about 84%. The traditional ship image classification method is based on the expert system, which can recognize the ship according to the ship type and lacks good generalization performance. Therefore, ship classification accuracy needs to be enhanced. With the rapid development of edge metering and word learning, convolutional neural networks have become a research hotspot in the field of image classification. Rainey et al. [8] created and acquired a convolutional neural network to recognize ships from satellite images and achieved good results. Liu et al. [9] proposed an improved residual network to detect and classify remote-sensing ship images which is prone to overfitting due to a small dataset. Khellal et al. [10] proposed using an extreme learning network to recognize a ship’s infrared image. This method is suitable for infrared recognition systems. After using extreme learning features, it also needs to use extreme learning machines based on integration for classification. Therefore, this method proposes a CNN model with multi-resolution input. The performance of the proposed method was evaluated with TerraSAR-X images which were composed of five maritime categories. The classification effect was different, but how the change in the image resolution affected the internal activation of the CNN was still unclear from the test. Chen ** [18] was introduced, the structure of which is shown in Figure 1.

In the residual structure shown in Figure 1, if the input is x, the weight layer is a 3 × 3 convolutional layer and the map** learned through multiple multilayer networks containing parameters in the structure is f(x), then the output of the residual structure is f(x) + x. In the network, assuming that the mth through Mth layers are composed of such multiple continuous residual structures, the forward propagation process of this part of the network is shown in Equation (1):

x_{M} = x_{m} + \sum_{i = m}^{M - 1} f (x_{i}, W_{i})

(1)

where

x_{M}

is the output of these continuous residual structures,

x_{m}

is the input of the first layer,

W_{i}

is the parameter of the ith layer from the mth layer to the Mth layer and

x_{i}

is the input of the ith layer.

When performing backpropagation, according to the chain rule, the calculation process of the gradient of the first layer in the network is shown in Equation (2):

\frac{\partial F}{\partial x_{m}} = \frac{\partial F}{\partial x_{M}} \frac{\partial x_{M}}{\partial x_{m}} = \frac{\partial F}{\partial x_{M}} (1 + \frac{\partial}{\partial x_{m}} \sum_{i = m}^{M - 1} f (x_{i}, W_{i}))

(2)

It can be found from Equation (2) that the gradient of the first layer contained a partial derivative term directly derived from the error of the layer. Even if the gradient of the latter layer was extremely small, the gradient would not disappear in this layer.

(d): InceptionV3

Google’s Inception series models from V1 to V3 start from the width of the model instead of the depth. It is believed that the size of the convolution kernel required for objects of different sizes is also different, so the parallel convolution kernel is adopted. At the same time, the Inception network also performs well in terms of model size and computational efficiency. For example, when using two 3 × 3 convolution kernels instead of a 5 × 5 convolution kernel, the expression ability is not weakened while reducing the number of parameters.

(e): GoogleNet

GoogleNet is a 22-layer deep convolutional neural network that is a variant of the Inception network, a deep convolutional neural network developed by researchers at Google. It was introduced to provide more efficiency in classification and detection. It is currently being used in classification techniques.

No_Fcn	Fcn1_Out_Features	Fcn2_In_Features	Fcn2_Out_Features	Acc (%)
1	5	-	-	91.24
2	1636	1636	5	95.79
2	1124	1124	5	95.71
2	778	778	5	95.62

Depth	Accuracy (%)
Depth	Cargo	Military	Tanker	Carrier	Cruise	Average Accuracy
18	91.87	98.00	87.56	92.81	95.98	93.24
34	92.79	96.18	88.92	96.25	95.89	94.01
50	91.85	96.93	90.46	95.74	95.68	94.13
101	92.71	97.09	89.35	97.63	96.51	94.66
152	93.00	98.00	91.00	99.00	98.00	95.80

Class	Acc	Pre	Rec	Spec	F1 Score
Cargo	93.00	90.29	93.00	97.5	91.62
Military	98.00	98.99	98.99	99.75	98.99
Cruise	98.00	98.99	97.03	99.75	98.00
Carrier	99.00	98.02	99.00	80.12	98.51
Tanker	91.00	92.86	91.00	98.25	91.92
Overall	95.80	95.83	95.80	95.07	95.81

Class	Acc	Pre	Rec	Spec	F1 Score
Cargo	88.69	81.33	88.69	94.03	84.85
Military	93.46	95.44	93.46	98.33	94.44
Cruise	88.94	97.86	88.94	98.76	93.18
Carrier	96.99	97.89	96.99	79.09	97.44
Tanker	88.65	89.85	88.65	96.44	89.25
Overall	91.35	92.47	91.35	93.33	91.83

Method	Year	Acc	Pre	Rec	Spec	F1 Score
Hierarchical ship classifier [26]	2014	82.00	-
Gnostic field + CNN [27]	2015	87.40	-	-	-	-
Parametric vector estimation + SVM [28]	2016	83.33	-	-	-	-
CNN + SVM [29]	2017	90.93	90.86	91.01	90.84	90.93
Auditory-inspired CNN [30]	2018	79.20	79.66	79.33	-	78.83
Inception v3 [6]	2018	78.73	-
Cas-ShipNet [31]	2020	95.06	95.07	95.06	98.77	95.05
Our method	2021	95.8	95.83	95.80	95.07	95.81

Article Menu

Ship Classification Based on Improved Convolutional Neural Network Architecture for Intelligent Transport Systems

Abstract

1. Introduction

3. Classification Dataset and Hyperparameter Setting

3.1. Dataset Description

3.2. Evaluation Indicators

3.3. Experiment Set-up and Process

4. Proposed Method

4.1. Network Adjustment

4.2. Experiment Results and Analysis

4.3. Analysis of Proposed Classification System with the MARVEL Dataset

4.4. Comparison of Proposed and Existing Methods

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Method	Cargo	Military	Cruise	Carrier	Tanker
Hierarchical ship classifier [26]	80	-	-	93.30	72.70
Parametric vector estimation + SVM [28]	87.50	-	-	80.00	82.50
Cas-ShipNet [31]	88.26	98.38	98.38	98.79	91.50
Ours	93.00	98.00	98.00	99.00	91.92