A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images

Khan, Muntakim Mahmud; Chowdhury, Muhammad E. H.; Arefin, A. S. M. Shamsul; Podder, Kanchon Kanti; Hossain, Md. Sakib Abrar; Alqahtani, Abdulrahman; Murugappan, M.; Khandakar, Amith; Mushtak, Adam; Nahiduzzaman, Md.

doi:10.3390/diagnostics13152537

Open AccessArticle

A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images

by

Muntakim Mahmud Khan

¹

,

Muhammad E. H. Chowdhury

^2,*

,

A. S. M. Shamsul Arefin

¹,

Kanchon Kanti Podder

¹,

Md. Sakib Abrar Hossain

¹,

Abdulrahman Alqahtani

^3,4,

M. Murugappan

^5,6,7,*

,

Amith Khandakar

²

,

Adam Mushtak

⁸ and

Md. Nahiduzzaman

^2,9

¹

Department of Biomedical Physics and Technology, University of Dhaka, Dhaka 1000, Bangladesh

²

Department of Electrical Engineering, Qatar University, Doha 2713, Qatar

³

Department of Medical Equipment Technology, College of Applied, Medical Science, Majmaah University, Majmaah City 11952, Saudi Arabia

⁴

Department of Biomedical Technology, College of Applied Medical Sciences in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia

⁵

Intelligent Signal Processing (ISP) Research Lab, Department of Electronics and Communication Engineering, Kuwait College of Science and Technology, Block 4, Doha 13133, Kuwait

⁶

Department of Electronics and Communication Engineering, School of Engineering, Vels Institute of Sciences, Technology, and Advanced Studies, Chennai 600117, India

⁷

Center of Excellence for Unmanned Aerial Systems (CoEUAS), Universiti Malaysia Perlis, Perlis 02600, Malaysia

⁸

Clinical Imaging Department, Hamad Medical Corporation, Doha 3050, Qatar

⁹

Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh

^*

Authors to whom correspondence should be addressed.

Diagnostics 2023, 13(15), 2537; https://doi.org/10.3390/diagnostics13152537

Submission received: 18 May 2023 / Revised: 20 July 2023 / Accepted: 24 July 2023 / Published: 31 July 2023

(This article belongs to the Special Issue Application of Deep Learning in the Diagnosis of Brain Diseases)

Download

Browse Figures

Versions Notes

Abstract

:

Intracranial hemorrhage (ICH) occurs when blood leaks inside the skull as a result of trauma to the skull or due to medical conditions. ICH usually requires immediate medical and surgical attention because the disease has a high mortality rate, long-term disability potential, and other potentially life-threatening complications. There are a wide range of severity levels, sizes, and morphologies of ICHs, making accurate identification challenging. Hemorrhages that are small are more likely to be missed, particularly in healthcare systems that experience high turnover when it comes to computed tomography (CT) investigations. Although many neuroimaging modalities have been developed, CT remains the standard for diagnosing trauma and hemorrhage (including non-traumatic ones). A CT scan-based diagnosis can provide time-critical, urgent ICH surgery that could save lives because CT scan-based diagnoses can be obtained rapidly. The purpose of this study is to develop a machine-learning algorithm that can detect intracranial hemorrhage based on plain CT images taken from 75 patients. CT images were preprocessed using brain windowing, skull-strip**, and image inversion techniques. Hemorrhage segmentation was performed using multiple pre-trained models on preprocessed CT images. A U-Net model with DenseNet201 pre-trained encoder outperformed other U-Net, U-Net++, and FPN (Feature Pyramid Network) models with the highest Dice similarity coefficient (DSC) and intersection over union (IoU) scores, which were previously used in many other medical applications. We presented a three-dimensional brain model highlighting hemorrhages from ground truth and predicted masks. The volume of hemorrhage was measured volumetrically to determine the size of the hematoma. This study is essential in examining ICH for diagnostic purposes in clinical practice by comparing the predicted 3D model with the ground truth.

Keywords:

computed tomography; intracranial hemorrhage; deep learning; convolution neural network; Dice similarity coefficient (DSC); intersection over union (IoU)

1. Introduction

Object detection is currently used in a number of biomedical domains, and these domains often involve a number of imaging modalities. Deep learning applications have been used to classify COVID-19, MERS, and SARS. Nowadays, machine learning is one of the hottest trends in image processing. A primary objective of image processing is to qualify images by incorporating multiple attributes. It is useful to incorporate biomedical image recognition into image processing, especially in biomedical imaging. A deep learning algorithm performs object detection not only with the simplest model but also with outstanding results for recognizing images [1,2]. Deep learning applications have been used to classify COVID-19, MERS, and SARS [3] and object detection of COVID-19 using an operational segmentation network [4]. Also, machine learning and deep learning techniques have been applied for liver tumor segmentation [5] and the severity of lungs [6]. Consequently, image segmentation and object detection can be used to detect intracranial hemorrhage.

To diagnose ICH, several imaging modalities are commonly used, including computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography-computed tomography (PET-CT). Computed tomography (CT) is commonly used to perform an instantaneous scan of the brain to detect and localize hemorrhages. Compared to MRI scanning, CT scanning provides more comprehensive volumetric and anatomical information about a patient. The quality of CT images, however, is quite challenging, particularly when examining parenchymal, ventricular, and meningeal brain tissue. Consequently, segmenting intracranial hemorrhage using machine learning algorithms can be an indispensable component of CT images. CT scans assess intracranial hemorrhages by observing density changes over time, representing clot formation, clot lysis, clot retraction, and tissue loss [7]. As delay increases morbidity and mortality among patients with severe traumatic injuries or hemorrhagic strokes, every second counts.

Recent literature reports the segmentation of objects using deep learning techniques [8]. Different methodologies and efficacies of ICH segmentation from CT scans were reported. An investigation of semantic segmentation techniques that can quantify blood loss and distinguish between six categories of cerebral hemorrhages has been proposed by Yuhang et al. [9]. Medical image segmentation was hampered by a lack of data, as data collection and annotation were challenging. To address this issue, the authors propose using a pre-trained, finely tuned U-Net model. On the test set, the highest accuracy was 94.1%, which was 10.5% better than the model trained from scratch. Segmenting complex datasets with a small amount of data demonstrates the advantages of the proposed segmentation technique. Using a series of CT scans of the brain, Mingjie et al. [10] developed a novel three-dimensional (3D) method for segmenting hemorrhage regions. This method coupled a super voxel approach for initial segmentation with three-dimensional graph cuts to divide data more precisely. Adapting the 2D segmentation of cerebral hemorrhage to a 3D approach, making greater use of the intra-frame information of CT scans, was the primary innovation of that work. Kwon et al. [11] proposed Siamese U-Net to segment the abnormal regions of ICH more precisely from patient CT images. To emphasize the convolutional properties of the abnormal regions using ICH, they incorporated the differences between the bleeding regions and the healthy template into the long skip-connection of the U-Net design. Significant enhancements were observed in the Hausdorff distance (6.81%), Dice similarity coefficient (DSC) score (9.07%), and volume percentage error (40.32%) when comparing the proposed model to the conventional U-Net model. Another method for detecting and segmenting hemorrhagic lesions was presented in [2] using deep learning algorithms. They have proposed a U-Net-based deep learning backbone to detect and segment hemorrhage strokes in CT images automatically. In the pre-processing phase, CT slices with symmetrical constraints of brain images were introduced into their model. They achieved a detection accuracy of 98.59 percent, a DSC score of 80.33 percent, and an intersection over union (IoU) of 69.19 percent. Anupama et al. [12] proposed the combination of GrabCut-based segmentation and synergistic deep learning (SDL) as a GC-SDL model and achieved better ICH identification performance. The authors used Gabor filtration to remove noise from the image, thereby improving its quality. The SDL model was then utilized in the feature extraction procedure, and finally the SoftMax layer classified various types of ICHs. The GC-SDL model obtained a sensitivity, specificity, precision, and accuracy of 94.0%, 97.78%, 95.79%, and 95.73%, respectively. Vamsi et al. [13] proposed a lightweight convolution model using VGG-16 architecture and a Random Forest algorithm and attained a DSC and accuracy of 72.92% and 97.88%, respectively. Wang et al. [14] proposed a semi-supervised model for segmenting the ICH using an inverse-sigmoid-based learning strategy, which utilized 80% of data for training and obtained a DSC score of 0.67. White Matter Fuzzy c-Means (WMFCM) was used by Gautam et al. [15] to remove components, such as the cranium. According to their methodology, they were able to achieve an average DSC score of 0.82 despite the fact that the segmentation was not based on deep learning. Apart from CT image segmentation, several research works have been conducted on brain tumor segmentation using MRI imaging, employing different state-of-the-art models. For instance, M. Balwant [16] presented a review article that explored multiple CNN networks for brain tumor segmentation. Rehman et al. [17] introduced the BrainSeg-Net, which focuses on segmenting three sub-regions: Enhancing Core (EC), Whole Tumor (WT), and Tumor Core (TC) from MRI images. The proposed BrainSeg-Net architecture demonstrates promising improvements compared to existing baseline and state-of-the-art techniques. A similar approach was adopted using the RAAGR2-Net [18]. Additionally, Wu et al. [19] proposed the De-ResUnet model for brain tissue detection. The DE-ResUnet incorporates dual encoders for both T1-weighted images and texture features to uncover hidden supplementary information. Furthermore, they developed a strengthening module that enhances the initial segmentation by specifically emphasizing brain tissue regions, guided by prior knowledge and guidance. Zhao et al. [20] employed a 4D atlas-based segmentation method using the Wilcoxon signed-rank test. The objective of their study was to develop an automatic fetal brain segmentation method using deep learning, which offers improved accuracy and reliability compared to atlas-based methods.

The objective of this experiment is to detect ICH using deep learning techniques. Hemorrhage segmentation algorithms based on supervised, semi-supervised, and image-processing have already been proposed in the literature. Obtaining a high DSC score for ICH segmentation from brain CT images is quite challenging. A multi-stage approach has been implemented to segment the ICH from the brain CT images in 2D. It is difficult for clinicians to make proper clinical decisions with good DSC without a good visualization tool. To enhance the presentation of the results, several quantitative metrics and visual representations were used in the present work. This study makes the following contributions to the body of knowledge:

This work highlights the CT image and annotation pre-processing steps in order to obtain better segmentation performance.
Several state-of-the-art DNNs such as U-Net, U-Net++, and FPN (Feature Pyramid Network)-based deep learning models were investigated to identify the best model for ICH segmentation from the CT images.
This work presented a 3D visualization tool to show the hemorrhage in the CT volume predicted using the proposed technique and compare it with a ground truth hemorrhage annotated by the experts.
The volume of the hemorrhage was also calculated from the generated mask of images.

2. Methodology

A diagram illustrating the study’s overview is shown in Figure 1. As the hemorrhage is only segmented from the images, it can be considered instance segmentation. This study used a U-Net (encoder-decoder)-like convolutional neural network (CNN) to instance segment the area of hemorrhage presented in a CT image since it performed better than other deep learning networks [21].Several pre-trained models trained on the ImageNet dataset served as encoders in segmentation tasks [22].

A pre-processing step was first performed on the raw data before different variants of the U-Net model were trained to predict ICH masks. Prior to training the deep learning models, other preparatory procedures were carried out, such as dataset evaluation, data preparation, k folds cross-validation, and data augmentation.

2.1. Dataset Description

The dataset used in this research is a publicly available dataset published in the PhysioNet database [22]. The dataset contained 82 CT scans, in which 36 CT scans represented the five types of ICH (Epidural, Subdural, Intraventricular, Subarachnoid, and Intraparenchymal) while 46 CT volumes did not have any hemorrhage (Control). The slice number for the different CT volumes was not the same and the total number of slices from all volumes was 2814; among them, 397 were slices with ICH. The hemorrhage and skull-fracture information were recorded by two expert radiologists who delineated the intracranial hemorrhage region in each slice. The data collection maintained a multistep protocol.

The dataset provided a patient’s demographic information (.csv format) as metadata. The demographic showed the age, sex, and slice number of each patient. Most importantly, the types of hemorrhage and fracture were also listed (e.g., “1” if the hemorrhage is present; otherwise, “0”) in each slice. Figure 2 represents the overview of the entire dataset.

2.2. Pre-Processing Steps

Pre-processing data are crucial to deep learning tasks, depending on the data’s nature. It is often found that pre-processing improves the quality of images, which leads to better performance of deep learning (DL) models. Several techniques were used in this study to preprocess the dataset before training the model. Pre-processing steps for the given raw data are illustrated in Figure 3. The pre-processing techniques adopted in this study are described below:

Window level (W.L) and width (W.D) adjustment: The technique of windowing involves manipulating the grayscale component of a CT image to highlight specific structures in the anatomy. It is possible to change the brightness of an image by adjusting the window level (W.L). Contrast can be adjusted by adjusting the window width (W.D) [23]. A greater window width will show a wider range of CT values [24].

As a result, when compared with a window with a limited width, transitioning from dark to light objects takes place over a larger region. In CT number displays, the middle point of the range is known as the window level, also called the window center. CT images appear brighter when window levels are reduced, and vice versa. According to our investigations, the window width and window level were set at 180 and 80, respectively, after trial and error to make the ICH more evident in the slice. The setup was consistent across all images. As a result of these parameters, intracranial hemorrhages were clearly differentiated from other tissues in the images. Window widths and window levels, however, were readjusted to 90 and −50, respectively, during image inversion, as in this window the ICH was found most apparent in the inverted slice.

Skull-strip**: In our study, skull-strip** contributed to reducing the complexity of CT image analysis. This procedure involves removing the extracranial portion of the brain and the bone portion of the skull. However, the input images had to be converted to DICOM. To convert the raw dataset from Neuroimaging Informatics Technology Initiative (NiFTI) format to DICOM image, a Python library called NiBabel 5.1.0 was used. The idea about the skull-strip** method was adopted from Najm et al. [25]. This pre-processing was conducted using a MATLAB R2021a tool with essential functions, which are available in Github (https://github.com/WuChanada/StripSkullCT (accessed on 19 September 2022)).

Image inversion: Negative transformation is a technique used in medical image processing that plays an important role. Because it is quite challenging to detect hemorrhages from a low-contrast CT image, inverting the image may help. The image inversion is performed by “ImageJ 1.53s” software.

Figure 3 illustrates the above three steps applied to a CT image. In the brain tissue, areas containing bleeding become darker, highlighting the hemorrhage clearly. Furthermore, changing the background from black to white makes it easier to focus on the hemorrhage. The images were also adjusted for the window level and width 50 and 80, respectively, after inversion using “Imagej” software. The performance of hemorrhage recognition was significantly enhanced using these methods. To compare the performance of the framework with and without inversion, the same segmentation models were trained on both inverted and non-inverted datasets.

2.3. K-Fold Cross-Validation

It is important to separate training, validation, and test samples when evaluating a deep learning model so that an unbiased evaluation may be made over the entire dataset. K-fold cross-validation was the method which was utilized in this research to evaluate the DL models across the entire dataset. In this work, we used k = 5 to obtain 5-fold cross-validation. It helps the process to appraise the performance and how accurately the model is performing [26,27].

A custom-built MATLAB script was used to distribute the images and ground truth mask of different subjects among the five different folds. In each fold, 80% of the data were used as a training set and 20% of the data were used for the test set, while 20% of the training set was used for validation.

2.4. Data Augmentation

The number of images with hemorrhage and without hemorrhage are not equal in the dataset. There is an imbalance in the number of slices for each case. We thus have to augment the dataset to make them equally representative in the training set. It should be noted that the validation and test sets were not augmented. The validation set was employed to reduce over-fitting [6,28,29]. In this study, two specific geometric transfer functions were applied for the augmentation process: rotation with multiple angles and translation. The rotation operation can be denoted by an affine matrix:

A_{R} = [\begin{matrix} c o s α & s i n α & 0 \\ - s i n α & c o s α & 0 \\ 0 & 0 & 1 \end{matrix}]

(1)

Here the values of angle (α) were defined by the following:

α = {15°, 30°, 60°, 90°, 180°, 270°}

(2)

The translation operation can be denoted by the following matrix:

A_{T} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ T_{x} & T_{y} & 1 \end{matrix}]

(3)

Here the value of

T_{x}

and

T_{y}

were defined by the following:

(T_{x}, T_{y}) = {(0.1, 0.1), (0.1, - 0.1), (- 0.1, 0.1), (- 0.1, - 0.1)}

(4)

In our training dataset, there were 9720 images generated by performing data augmentation in each training fold. Figure 4 shows samples of augmented images for different values of rotation and translation.

2.5. Different Image Segmentation Models

Several segmentation models with U-Net, U-Net++, and FPN architecture were used in this research. An illustration of different architectures is given in Figure 5 and briefly introduced below:

U-Net: In deep learning, U-Net architecture is commonly used for biomedical image segmentation [30,31,32,33]. This architecture resembles the geometry of the letter “U” in its structure. Encoder or Contracting Path and Decoder or Expansion Path are its two primary elements. Encoder and decoder components both follow a symmetric path. The normalization and activation functions are used for every convolutional operation. The process commences upsampling when the transpose convolution occurs. This mechanism is in charge of producing masks.
U-Net++: U-NET++, the updated architecture of U-Net, is frequently employed for more precise image segmentation. U-Net++ is composed of U-Nets with varying depths, but all of their decoders are linked densely and at the same resolution using newly developed skip paths. U-Net++ offers two significant enhancements over U-Net. These are:
- Redesigning skip connections which reduce the semantic gap for ease of optimization.
- Introducing the new technique of skip connection called Dense Skip Connection [34].
FPN (Feature Pyramid Network): As with the previously discussed architecture, this one consists of a decoder-encoder path and follows convolutional operation. However, FPN [35] extracts the feature from the decoder at each level. This extraction contains convoluted parameters as well. The resolution of the first extracted feature is the lowest. After multiple features are generated, the resolution will progressively increase. The total dimension of features resembles a pyramid almost exactly.

Pre-Trained Backbone

Transfer learning is a popular method for applying the learning parameters of a model trained on a large dataset to a moderate or custom dataset. Residual Net, Inception Network, and DenseNet are prominent classification algorithms for medical image classification and were trained on the same large benchmark ImageNet dataset [36]. In our investigation, we evaluated these three segmentation architectures with 11 versions/depths of each of these models. In this work, vanila U-Net, a U-Net model with Densenet121, 161, and 201; Resnet18 and 152; and Inceptionv4 and Inceptionv2resnet pretrained encoders while U-Net++ with DenseNet201, ResNet18, and FPN with DenseNet201 pretrained encoders were evaluated.

2.6. Loss Function

The loss function is an essential factor for adapting the model to the collected data [37]. Forward propagation is often referred to a process of applying a dataset to a model for training. At the end of each epoch, this model produces a mask that is compared to the ground truth mask, and the error between the predicted mask and ground truth mask is calculated using the loss function and optimized using a technique called backpropagation. Consequently, backpropagation is responsible for matching the dissimilarity between a mask generated by forward propagation and the desired output. Accordingly, the model’s weights and biases are adjusted to generate new output that resembles the ground truth more closely.

The concept of loss function can be explained from an artificial neural network. If the total weight vector is W_T for x₁, x₂, … x_L number of inputs and the bias represent θ. The output can be expressed as:

Y_L = f (W_T. x + θ)

(5)

Now if

\hat{Y}

_L represents the predicted output for the activation function of x, and if the targeted output denotes Y_L, we can define the loss function as:

g (W_{T}, θ) = \frac{1}{m} \sum_{i = 1}^{m} L ({\hat{Y}}_{L}, Y_{L})

(6)

where g (average loss) was minimized by the loss function.

Several types of loss functions are used depending on the investigation, such as cross entropy loss [38], binary cross entropy, dice loss, etc. In our study, binary cross entropy (BCE) and DICE loss were used. The concept of BCE comes from the classification where the loss function is calculated at the pixel level [39,40]. The loss function, L (BCE), can be expressed as:

L (BCE) = \frac{1}{m} \sum_{i = 1}^{m} - (Y_{i} l o g (Y_{i})) + (1 - Y_{i}) (1 - l o g (Y_{i}))

(7)

where Yi denotes the ith pixel of the ground truth mask and Y

\hat{i}

denotes the ith pixel of the predicted mask. The term “m” denotes the maximum number of pixels in the image. On the other hand, the DICE loss is evaluated for the segmentation, and it can be expressed as a similarity index between the predicted and ground truth masks. In Equation (8), 1 is added to both the denominator and numerator to prevent the loss function from producing a zero value [40]. The dice loss L (DICE) can be expressed as:

L (DICE) = 1 - \frac{2 \sum_{i = 1}^{m} (Y i . Y \hat{i}) + 1}{\sum_{i = 1}^{m} Y i + \sum_{i = 1}^{m} Y \hat{i} + 1}

(8)

To start, the best answer is studied for both aforementioned loss functions. However, because the BCE loss performed better than the DICE loss in the initial inquiry, the detailed analysis is conducted with BCE loss.

2.7. Evaluation Matrices

In this study, several parameters such as Dice similarity coefficient (DSC) (e.g., F1 score), accuracy, and intersection over union (IoU) were evaluated, which determine how well the model can predict. The accuracy, DSC, and IoU can be introduced as follows:

Accuracy = \frac{T P + T N}{T P + T N + F P + F N}

(9)

Intersection over Union (IoU) = \frac{T P}{T P + F P + F N}

(10)

Dice Similarity Coefficient (DSC) = \frac{2 T P}{2 T P + F P + F N}

(11)

where, TP, TN, FP, FN represent true positive, true negative, false positive, and false negative, respectively.

2.8. Volumetric Representation of ICH

The slices of ground truth mask for an individual head CT scan were reconstructed three dimensionally and compared in volumetric representation for predicted and ground truth masks for the sake of comparison. Three major techniques are employed to construct 3D models from 2D series of images. The data are displayed in 3D using data visualization techniques. These are: i. volume rendering (VR), ii. multi-planner rendering (MPR), and iii. surface rendering (SR) [41].

It was observed from initial experiments that volume rendering showed better visual output. Therefore, volume rendering outputs were reported in the rest of the article. The studies relied heavily on ImageJ, a powerful open-source medical image-processing software [42]. Showing the predicted mask from the segmentation model as a 3D model allows the computation of the volume of hemorrhage in volumetric perspective and will help clinicians to understand the 3D morphology of the bleed inside the brain. Figure 6 demonstrates the volume rendering technique from CT images with the head before and after skull-strip**.

The overlaying process was carried out for the predicted & ground truth masks, overlaying it with corresponding images. The composit-1 was created by merging with channel 1 and channel 2 (highlighted in red color) while composite-2 was created by merging with channel 1 and channel 3 (highlighted in red color) which are shown in Figure 7. Then overlaid images created a stack which was intended to construct 3D models.

2.9. Measurement of the Volume of Hemorrhage

This study proposes the measurement of the bleed detected in CT images. The volumetric concept of the predicted mask was compared to the ground truth mask. However, the dataset contains a binary mask which indicates intracranial hemorrhage. We can consider the hemorrhage part of the masks as the region of interest (ROI). Each mask has a specific ROI, which can be combined to calculate the volume of hemorrhage. We can compare the total area of the ROI between the ground truth mask and predicted mask. Consider that the area of pixel function of a single image ground truth mask in the ROI is Δ(x, y) and the ROI of the predicted mask is Δ′(x, y), as shown below:

A (x, y) = \sum_{i = 1}^{n} Δ (x_{i}, y_{i})

(12)

A^{'} (x, y) = \sum_{i = 1}^{n} Δ^{'} (x_{i}, y_{i})

(13)

Finally, the volume was calculated for both areas by multiplying them with the CT slice thickness (5 mm [22] for this study). Now if the slice thickness is denoted as µ, we can express the volumes as follows:

V (x, y) = µ . A (x, y) = µ \sum_{i = 1}^{n} Δ (x_{i}, y_{i})

(14)

V^{'} (x, y) = µ . A^{'} (x, y) = µ \sum_{i = 1}^{n} Δ^{'} (x_{i}, y_{i})

(15)

The series of masks for a particular patient where the individual ROI was marked and computed was created using a tool called a wand-tracing tool in ImageJ software [42] (https://imagej.nih.gov/ij/docs/guide/146-19.html (accessed on 21 August 2022)). It traced the sharp edge of the mask and measured the mean, max, and area of the ROI. Based on the study, we kept the unit of physical dimension for our images in ‘mm × mm’. The field of view (FOV) can be measured using a formula as:

Pixel Size = (FOV (mm))/(Matrix size)

(16)

In this study, the FOV had been selected as 256 mm × 256 mm, where the matrix size (resolution) was converted to 512 × 512. Thus, the pixel size was 0.5 mm, which lies between the ranges, as mentioned in [43]. Table 1 represents the prerequisite parameters for measuring intracranial hematoma.

2.10. Experimental Setup

2.10.1. ICH Segmentation Model

In our investigation, the training, validation, and testing of the models were conducted using a Python-based deep learning framework using the PyTorch library (version 2.0). The deep learning models were trained on Tesla T4 GPU, which is available in Google Colab. In our experiment, we used Adam as the optimizer function with a learning rate of 0.0001 and a batch size of 8, as shown in Table 2.

We trained the model for 20 epochs and used an early stop** criterion (training will stop when there is no improvement in validation loss for 5 consecutive epochs).

2.10.2. 3D Construction of Brain and ICH

There are many assistive tools that can generate 3D images from 2D series of images. ImageJ software contains several plugins that can pre-process, generate, and calculate the quantities for 3D models from 2D series of images. Table 3 summarizes the experimental setup for the 3D construction of ICH using ImageJ software. The experimental setup was taken to visualize a proper 3D model. The interpolation mode was used as trilinear in this study. This technique was used in several applications in the literature [44,45,46] for constructing a 3D model from CT scans. The sampling value was taken as 1.0 to obtain faster simulation. In addition, the investigations showed optimal 3D visualization by regulating the parameters as mentioned in Table 3.

3. Results and Discussion

In this study, the segmentation of ICH from the 2D CT slices was carried out using three different segmentation architectures (U-Net, U--Net++, and FPN) and eleven different segmentation models with different variants of Resnet, DenseNet, and inception models. These models were evaluated by DSC score, IoU, accuracy, and loss coefficient. In the following sub-section, the results of different models are reported, which is followed by sub-sections to introduce the volumetric representation of ICH and finally the volume measurement of intracranial hematoma.

3.1. Intracranial Hemorrhage Segmentation

The detection procedures were accomplished based on segmenting the mask for ICH. This approach was employed to detect the hemorrhage portion from the images using 2D segmentation pipelines for several pre-trained models, as mentioned earlier. Table 4 represents the performance matrices (loss, accuracy, DSC, and IoU) of different segmentation models. The highest DSC score, 85.76%, was achieved using the Densenet201_U-Net model for 5-fold cross validation. The IoU of 84.3% for Densenet201_U-Net was also the highest among the other models investigated in this study. The model secured 1.407% better DSC than the second-highest-performing model inceptionv4_U-Net. This suggests the superiority of the Densenet201_U-Net model in hemorrhage segmentation.

Since multiple pre-trained models were employed to investigate the performance of segmentations, the visual comparison among the predicted mask from each model was considered a qualitative evaluation process. Figure 8 shows the original CT images along with the ground truth masks and the predicted hemorrhages from three variants of U-Net architecture. The predicted region is highlighted a solid red color on the images.

3.2. Reconstruction of 3D Models of ICH

The results of 2D segmentation generated a stack of masks from each fold of the test set, which were utilized for visualization as a volumetric model. The generated 3D models followed the same projection (Figure 9), where the xy, yz, and zx planes represent the axial, sagittal, and coronal planes, respectively, for the full brain CT images.

The entire structure of the ICH for a single patient was taken from the test fold. The predicted mask maintained the exact same serial number corresponding with the ground truth mask and CT image. Thus, it was possible to represent an entire 3D model for a single patient of intracranial hemorrhage. Figure 9 represents a volumetric view of ICH for patient no. 51.

Our investigation also examined a full brain model after merging with CT scans and corresponding masks. The simple procedure was accomplished by overlying the images with different corresponding masks (ground truth and predicted) and transfiguring it into volume viewer plugins of ImageJ. For constructing the brain models, the same parameters were applied as the ICH modeling. Figure 10 shows four different models using the volume viewer of ImageJ, where the ground truth is shown at the top left corner in the figure. A visual comparison is made for the other three best-performing models.

3.3. Volume Measurement of Intracranial Hematoma

This investigation measured the volume of ICH from two different masks: ground truth and predicted masks, which were predicted by the segmentation module. In this experiment, only the best-performing model, Densenet201_U-Net, was considered for the volume measurement of the intracranial hematoma. Before measuring the volume, the dimension, scales, and bit type were adjusted for all binary masks.

The volume measurement of intracranial hematoma is a sensitive task as it contains vital information of the bleed within the skull. Table 5 represents a statistical analysis between the ground truth and predicted mask in the volume measurement of the bleed in the skull for a single CT scan of an example patient (#49) from the dataset. The average area of the hematoma in the ground truth was found to be 190.15 mm², while 228.9 mm² was found in the predicted mask. Also, volume, median of ROI, and standard deviation were calculated for both ground truth and predicted masks. Table 5 shows that 20.04% of error was found in the predicted masks of the Densenet201_U-Net model.

The performance of our proposed investigation was compared with the existing literature. For instance, Justin et al. [14] used 254 training samples of CT images and obtained a 67% DSC score. Their dataset was collected from the Radiological Society of North America (RSNA). On the other hand, Vamsi et al. [13] included 578 brain CT images, 463 of which were stroke images, and obtained a DSC score of 72.92%. Another comparative study performed by Li et al. [2] achieved a DSC score and IoU of 80.03% and 69.19%, resepectively, using a U-Net-based deep learning network for the automatic detection and segmentation of ICH from CT images. Gautam et al. [15] found 82% of average DSC similarity using 20 brain CT volumes using White Matter Fuzzy c-Means (WMFCM) clustering and wavelet-based thresholding. Another study was conducted by Bahaduria et al. [47] using Fuzzy c-mean clustering and a region-based contour method to segment ICH. Although their approach did not use a deep learning application, they achieved an average DSC score of 87.4%. In this study, our proposed model achieved a DSC score of 85.757% along with an IoU of 84.3%. Our proposed model outperformed the previous literature [2,13,14,15] on the basis of DSC score by 23.757%, 12.655%, 5.757%, and 3.757%, respectively. Table 6 summarizes the comparative performances of the proposed method with the recent works in the litertaure.

3.4. Limitations of the Proposed Model and Future Works

There were a couple of limitations we discovered during our investigation. First, our proposed DenseNet201_U-Net demonstrated superior performance over other investigated networks reported in the literature. In spite of this, it was computationally intensive and had a lengthy runtime. It often took several hours to complete the training process. Additionally, the model and its intermediate variables require a large amount of memory to be stored and processed.

In addition, the study’s 3D construction method involved additional software and manual work, which made it time-consuming.

Future research will explore improvements and variations to address these limitations and further enhance the performance of these state-of-the-art segmentation models. The first priority will be to reduce the computation complexity for dense models. As part of our study, we are also planning to analyze a large dataset and propose a new algorithm for classifying the five subtypes of intracranial hemorrhage. Furthermore, we will apply a new approach to 3D modeling that uses predicted masks for automatic construction. Finally, we plan to develop a cloud-based application to segment hemorrhages from CT slices in real time.

4. Conclusions

In summary, deep learning algorithms pose challenges when segmenting intracranial hemorrhages of CT scans because of low resolutions and high variables in stroke location, contrast, and shape. Through our investigations into deep learning algorithms for the segmentation of intracranial hemorrhages, we have gained crucial insight. An algorithm was designed to predict hemorrhage based on an image and then generate a binary mask that was compared with the actual mask. Our training, validation, and testing datasets were created using three pre-processing techniques (windowing, skull-strip**, and inversion). The study compared 11 state-of-the-art segmentation models based on U-Net, U-Net++, and FPN architectures based on DSC and IoU scores. According to DSC and IoU scores, our proposed model, namely Dense-Net201_U-Net, achieved 85.757% and 84.3%, respectively. Using the best-performing model, we can produce accurate ICH and brain 3D models for CT imaging that will enable us to visualize bleeding inside the brain. Additionally, intracranial hematoma volumes can be measured from CT images to calculate bleed volumes, and this information can be used for longitudinal studies to track a patient’s condition over time. The results of this study demonstrate the capability of deep learning algorithms for accurately segmenting intracranial hemorrhages, providing clinicians with valuable tools for quantifying and visualizing hemorrhage volumes in CT images.

Author Contributions

Conceptualization, M.M.K., M.E.H.C. and A.S.M.S.A.; data curation, M.M.K., M.S.A.H., A.M., K.K.P., M.N. and A.K.; formal analysis, M.M.K., A.M., M.N., K.K.P. and M.S.A.H.; funding acquisition M.E.H.C.; investigation, M.M.K., A.M., K.K.P., M.S.A.H. and A.K.; methodology, M.E.H.C., M.M.K., A.S.M.S.A., A.A., K.K.P. and M.M.; software, A.K., A.M., M.N. and M.M.; project administration, A.S.M.S.A. and M.E.H.C.; resources, M.E.H.C. and K.K.P.; supervision, A.S.M.S.A., M.E.H.C., A.A. and M.M.; validation A.S.M.S.A., A.M., A.A., M.M. and M.E.H.C. All authors equally contributed to the original paper drafting and reviewing and editing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Qatar University student grant QUST-1-CENG-2023-796, and this study was also supported via funding from Prince Sattam Bin Abdulaziz University project number (PSAU/2023/R/1444). The statements made herein are solely the responsibility of the authors. A part of open access publication cost was covered by Qatar National Library.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset used in this study can be made available upon a reasonable request to the corresponding authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pak, M.; Kim, S. A review of deep learning in image recognition. In Proceedings of the 2017 IEEE 4th International Conference on Computer Applications and Information Processing Technology (CAIPT), Kuta Bali, Indonesia, 8–10 August 2017; pp. 1–3. [Google Scholar]
Li, L.; Wei, M.; Liu, B.; Atchaneeyasakul, K.; Zhou, F.; Pan, Z.; Kumar, S.A.; Zhang, J.Y.; Pu, Y.; Liebeskind, D.S. Deep learning for hemorrhagic lesion detection and segmentation on brain CT images. IEEE J. Biomed. Health Inform. 2020, 25, 1646–1659. [Google Scholar] [CrossRef] [PubMed]
Tahir, A.M.; Qiblawey, Y.; Khandakar, A.; Rahman, T.; Khurshid, U.; Musharavati, F.; Islam, M.; Kiranyaz, S.; Al-Maadeed, S.; Chowdhury, M.E. Deep learning for reliable classification of COVID-19, MERS, and SARS from chest X-ray images. Cogn. Comput. 2022, 14, 1752–1772. [Google Scholar] [CrossRef]
Degerli, A.; Kiranyaz, S.; Chowdhury, M.E.; Gabbouj, M. Osegnet: Operational segmentation network for Covid-19 detection using chest X-ray images. In Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16–19 October 2022; pp. 2306–2310. [Google Scholar]
Gul, S.; Khan, M.S.; Bibi, A.; Khandakar, A.; Ayari, M.A.; Chowdhury, M.E. Deep learning techniques for liver and liver tumor segmentation: A review. Comput. Biol. Med. 2022, 147, 105620. [Google Scholar] [CrossRef] [PubMed]
Qiblawey, Y.; Tahir, A.; Chowdhury, M.E.; Khandakar, A.; Kiranyaz, S.; Rahman, T.; Ibtehaz, N.; Mahmud, S.; Maadeed, S.A.; Musharavati, F. Detection and severity classification of COVID-19 in CT images using deep learning. Diagnostics 2021, 11, 893. [Google Scholar] [CrossRef] [PubMed]
Parizel, P.; Makkat, S.; Van Miert, E.; Van Goethem, J.; Van den Hauwe, L.; De Schepper, A. Intracranial hemorrhage: Principles of CT and MRI interpretation. Eur. Radiol. 2001, 11, 1770–1783. [Google Scholar] [CrossRef]
Zhu, G.; Chen, H.; Jiang, B.; Chen, F.; **e, Y.; Wintermark, M. Application of Deep Learning to Ischemic and Hemorrhagic Stroke Computed Tomography and Magnetic Resonance Imaging. In Seminars in Ultrasound, CT and MRI; Elsevier: Amsterdam, The Netherlands, 2022; Volume 43, pp. 147–152. [Google Scholar]
Qiu, Y.; Chang, C.S.; Yan, J.L.; Ko, L.; Chang, T.S. Semantic segmentation of intracranial hemorrhages in head CT scans. In Proceedings of the 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS), Bei**g, China, 18–20 October 2019; pp. 112–115. [Google Scholar]
Goncharov, M.; Pisov, M.; Shevtsov, A.; Shirokikh, B.; Kurmukov, A.; Blokhin, I.; Chernina, V.; Solovev, A.; Gombolevskiy, V.; Morozov, S. CT-Based COVID-19 triage: Deep multitask learning improves joint identification and severity quantification. Med. Image Anal. 2021, 71, 102054. [Google Scholar] [CrossRef]
Kwon, D.; Ahn, J.; Kim, J.; Choi, I.; Jeong, S.; Lee, Y.-S.; Park, J.; Lee, M. Siamese U-Net with healthy template for accurate segmentation of intracranial hemorrhage. In Proceedings of the Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, 13–17 October 2019; pp. 848–855. [Google Scholar]
Anupama, C.; Sivaram, M.; Lydia, E.L.; Gupta, D.; Shankar, K. Synergic deep learning model–based automated detection and classification of brain intracranial hemorrhage images in wearable networks. Pers. Ubiquitous Comput. 2022, 26, 1–10. [Google Scholar] [CrossRef]
Vamsi, B.; Bhattacharyya, D.; Midhunchakkravarthy, D.; Kim, J.-Y. Early Detection of Hemorrhagic Stroke Using a Lightweight Deep Learning Neural Network Model. Trait. Du Signal 2021, 38, 1727–1736. [Google Scholar] [CrossRef]
Wang, J.L.; Farooq, H.; Zhuang, H.; Ibrahim, A.K. Segmentation of intracranial hemorrhage using semi-supervised multi-task attention-based U-net. Appl. Sci. 2020, 10, 3297. [Google Scholar] [CrossRef]
Gautam, A.; Raman, B. Automatic segmentation of intracerebral hemorrhage from brain CT images. In Machine Intelligence and Signal Analysis; Springer: Berlin/Heidelberg, Germany, 2019; pp. 753–764. [Google Scholar]
Balwant, M. A Review on Convolutional Neural Networks for Brain Tumor Segmentation: Methods, Datasets, Libraries, and Future Directions. IRBM 2022, 43, 521–537. [Google Scholar] [CrossRef]
Rehman, M.U.; Cho, S.; Kim, J.; Chong, K.T. Brainseg-net: Brain tumor mr image segmentation via enhanced encoder–decoder network. Diagnostics 2021, 11, 169. [Google Scholar] [CrossRef]
Rehman, M.U.; Ryu, J.; Nizami, I.F.; Chong, K.T. RAAGR2-Net: A brain tumor segmentation network using parallel processing of multiple spatial frames. Comput. Biol. Med. 2023, 152, 106426. [Google Scholar] [CrossRef] [PubMed]
Wu, L.; Hu, S.; Liu, C. MR brain segmentation based on DE-ResUnet combining texture features and background knowledge. Biomed. Signal Process. Control 2022, 75, 103541. [Google Scholar] [CrossRef]
Zhao, L.; Asis-Cruz, J.; Feng, X.; Wu, Y.; Kapse, K.; Largent, A.; Quistorff, J.; Lopez, C.; Wu, D.; Qing, K. Automated 3D fetal brain segmentation using an optimized deep learning approach. Am. J. Neuroradiol. 2022, 43, 448–454. [Google Scholar] [CrossRef] [PubMed]
Salman Khan, M.; Ullah, A.; Khan, K.N.; Riaz, H.; Yousafzai, Y.M.; Rahman, T.; Chowdhury, M.E.; Abul Kashem, S.B. Deep Learning Assisted Automated Assessment of Thalassaemia from Haemoglobin Electrophoresis Images. Diagnostics 2022, 12, 2405. [Google Scholar] [CrossRef] [PubMed]
Hssayeni, M.; Croock, M.; Salman, A.; Al-khafaji, H.; Yahya, Z.; Ghoraani, B. Computed tomography images for intracranial hemorrhage detection and segmentation. Intracranial Hemorrhage Segm. Using A Deep. Convolutional Model Data 2020, 5, 14. [Google Scholar]
Murphy, A.; Baba, Y. Windowing (CT). Reference Article, Radiopaedia. org. ed. 2022. Available online: https://radiopaedia.org/articles/windowing-ct?lang=us (accessed on 17 May 2023).
Davidson, L.E.; Kuk, J.L.; Church, T.S.; Ross, R.; Sakkas, G.K.; Karatzaferi, C.; Zintzaras, E.; Giannaki, C.D.; Liakopoulos, V.; Lavdas, E.; et al. Protocol for measurement of liver fat by computed tomography. J. Appl. Physiol. 2006, 100, 864–868. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Najm, M.; Kuang, H.; Federico, A.; Jogiat, U.; Goyal, M.; Hill, M.D.; Demchuk, A.; Menon, B.K.; Qiu, W. Automated brain extraction from head CT and CTA images using convex optimization with shape propagation. Comput. Methods Programs Biomed. 2019, 176, 1–8. [Google Scholar] [CrossRef]
Srinivasan, K.; Cherukuri, A.; Vincent, P.R.; Garg, A.; Chen, B.-Y. An efficient implementation of artificial neural networks with K-fold cross-validation for process optimization. J. Internet Technol. 2019, 20, 1213–1225. [Google Scholar]
Chowdhury, M.E.; Rahman, T.; Khandakar, A.; Mahmud, S. Classification of viral, bacterial, and COVID-19 pneumonia using deep learning framework from chest X-ray images. Comput. Sci. Inf. Technol. 2022, 12, 1–21. [Google Scholar]
Khandakar, A.; Chowdhury, M.E.H.; Reaz, M.B.I.; Ali, S.H.M.; Kiranyaz, S.; Rahman, T.; Chowdhury, M.H.; Ayari, M.A.; Alfkey, R.; Bakar, A.A.A.; et al. A Novel Machine Learning Approach for Severity Classification of Diabetic Foot Complications Using Thermogram Images. Sensors 2022, 22, 4249. [Google Scholar] [CrossRef]
Rahman, T.; Khandakar, A.; Islam, K.R.; Soliman, M.M.; Islam, M.T.; Elsayed, A.; Qiblawey, Y.; Mahmud, S.; Rahman, A.; Musharavati, F. HipXNet: Deep Learning Approaches to Detect Aseptic Loos-Ening of Hip Implants Using X-ray Images. IEEE Access 2022, 10, 53359–53373. [Google Scholar] [CrossRef]
Weng, Y.; Zhou, T.; Li, Y.; Qiu, X. Nas-unet: Neural architecture search for medical image segmentation. IEEE Access 2019, 7, 44247–44257. [Google Scholar] [CrossRef]
Tran, S.-T.; Cheng, C.-H.; Nguyen, T.-T.; Le, M.-H.; Liu, D.-G. TMD-Unet: Triple-Unet with multi-scale input features and dense skip connection for medical image segmentation. Healthcare 2021, 9, 54. [Google Scholar] [CrossRef] [PubMed]
Peeples, J.K.; Jameson, J.F.; Kotta, N.M.; Grasman, J.M.; Stoppel, W.L.; Zare, A. Jointly optimized spatial histogram UNET architecture (JOSHUA) for adipose tissue segmentation. BME Front. 2022, 2022, 9854084. [Google Scholar] [CrossRef]
Rahman, T.; Chowdhury, M.E.; Khandakar, A.; Mahbub, Z.B.; Hossain, M.S.A.; Alhatou, A.; Abdalla, E.; Muthiyal, S.; Islam, K.F.; Kashem, S.B.A. BIO-CXRNET: A robust multimodal stacking machine learning technique for mortality risk prediction of COVID-19 patients using chest X-ray images and clinical data. Neural Comput. Appl. 2023, 35, 17461–17483. [Google Scholar] [CrossRef] [PubMed]
Zhou, Z.; Siddiquee, M.M.R.; Tajbakhsh, N.; Liang, J. Unet++: Redesigning skip connections to exploit multiscale features in image segmentation. IEEE Trans. Med. Imaging 2019, 39, 1856–1867. [Google Scholar] [CrossRef]
Lin, T.-Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
Deng, J.; Dong, W.; Socher, R.; Li, L.-J.; Li, K.; Li, F. Imagenet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
Xu, C.; Lu, C.; Liang, X.; Gao, J.; Zheng, W.; Wang, T.; Yan, S. Multi-loss regularized deep neural network. IEEE Trans. Circuits Syst. Video Technol. 2015, 26, 2273–2283. [Google Scholar] [CrossRef]
Tahir, A.M.; Chowdhury, M.E.; Khandakar, A.; Rahman, T.; Qiblawey, Y.; Khurshid, U.; Kiranyaz, S.; Ibtehaz, N.; Rahman, M.S.; Al-Maadeed, S. COVID-19 infection localization and severity grading from chest X-ray images. Comput. Biol. Med. 2021, 139, 105002. [Google Scholar] [CrossRef]
Yi-de, M.; Qing, L.; Zhi-Bai, Q. Automated image segmentation using improved PCNN model based on cross-entropy. In Proceedings of the 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, Hong Kong, China, 20–22 October 2004; pp. 743–746. [Google Scholar]
Jadon, S. A survey of loss functions for semantic segmentation. In Proceedings of the 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), Vina del Mar, Chile, 27–29 October 2020; pp. 1–7. [Google Scholar]
Kumar, T.; Vijai, A. 3D reconstruction of face from 2D CT scan images. Procedia Eng. 2012, 30, 970–977. [Google Scholar] [CrossRef] [Green Version]
Abràmoff, M.D.; Magalhães, P.J.; Ram, S.J. Image processing with ImageJ. Biophotonics Int. 2004, 11, 36–42. [Google Scholar]
Bernstein, M.A.; Fain, S.B.; Riederer, S.J. Effect of windowing and zero-filled reconstruction of MRI data on spatial resolution and acquisition strategy. J. Magn. Reson. Imaging Off. J. Int. Soc. Magn. Reson. Med. 2001, 14, 270–280. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Shi, L.; Liu, B.; Yu, H.; Wei, C.; Wei, L.; Zeng, L.; Wang, G. Review of CT image reconstruction open source toolkits. J. X-ray Sci. Technol. 2020, 28, 619–639. [Google Scholar] [CrossRef] [PubMed]
Makela, T.; Clarysse, P.; Sipila, O.; Pauna, N.; Pham, Q.C.; Katila, T.; Magnin, I. A review of cardiac image registration methods. IEEE Trans. Med. Imaging 2002, 21, 1011–1021. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bockisch, A.; Beyer, T.; Antoch, G.; Freudenberg, L.S.; Kühl, H.; Debatin, J.F.; Müller, S.P. Positron emission tomography/computed tomography–imaging protocols, artifacts, and pitfalls. Mol. Imaging Biol. 2004, 6, 188–199. [Google Scholar] [CrossRef]
Bhadauria, H.S.; Dewal, M.L. Intracranial hemorrhage detection using spatial fuzzy c-mean and region-based active contour on brain CT imaging. Signal Image Video Process. 2014, 8, 357–364. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of ICH detection using deep learning-driven segmentation.

Figure 2. Overview of the entire dataset. (a) Age-gender distribution of the entire dataset, (b) gender distribution of the entire dataset, (c) sub-categorical (belongs to ICH class) image distribution for the image dataset used in this research.

Figure 3. (a) Summary of data pre-processing, (b) CT image before and after inversion method.

Figure 4. Visualization of image augmentation with different values of rotation and translation. (a) Rotating augmentations performed at 90°, 180°, and 270° counterclockwise. (b) Rotation augmentations performed at 15°, 30°, and 60° counterclockwise. (c) Translation.

Figure 5. Schematic diagram of different architectures for segmentation model. The yellow and blue colored blocks represent the convolutional layer. The skip connections between the encoder and decoder are highlighted with solid and dotted arrows.

Figure 6. Volumetric rendering: (a) head slices including skull and brain and (b) after skull-strip** from the same patient’s CT images (The intracranial hemorrhage is highlighted with red color).

Figure 7. Development of 3D brain models from two different composites.

Figure 8. Sample CT images from test set are shown overlaying with ground truth (second row) mask of ICH and predicted mask with Densenet201_U-Net, Resnet18_U-Net, and Inceptionv4_U-Net (from third left column to fifth), respectively.

Figure 9. Volumetric view of ICH from ground truth mask and masked from three different predicted models.

Figure 10. Volumetric view of full brain model overlaid with CT scans with ground truth masks and predicted masks from three best performing models.

Table 1. Prerequisite parameters for measuring intracranial hematoma.

Parameters	Value
Resolution	512 × 512
Physical dimension (mm $\times$ mm)	256 × 256
Method	Wand tracing
Slice thickness (mm)	5

Table 2. Details of ICH segmentation model training parameters.

Parameters	Value
Batch size	8
Learning rate	0.0001
Optimizer function	Adam
Number of epochs	20
Loss type	Dice loss
Number of folds	5
Learning factor	0.2
Early stop** epochs	5

Table 3. Experimental setup for 3D construction of ICH using ImageJ software.

Parameters	Value
Feature	Volume viewer 2.0
Method of interpolation	Trilinear
Z-aspect	3
Sampling value	1
Transfer function	Linear
Global alpha offset	50% ± 3%
Object color (ICH)	Red
Ambient, diffuse, specular, and shine	50% ± 5%

Table 4. Performance matrices to compare the performance of different state-of-the-art segmentation models for 2D segmentation.

Network	Loss	Acc. (%)	IoU (%)	DSC (%)
U-Net	0.73385	99.9	79.87	81.275
DenseNet201_U-Net	0.41253	99.91	84.3	85.76
DenseNet161_U-Net	0.003	99.92	81.48	82.91
DenseNet121_U-Net	0.4276	99.89	76.69	78.3
ResNet18_U-Net	0.0027	99.93	81.04	82.1
ResNet152_U-Net	0.0321	99.64	81.69	79.52
InceptionV4_U-Net	0.0027	99.93	83.03	84.35
InceptionV2-ResNet_U-Net	0.0029	99.93	78.99	80.37
DenseNet201_U-Net++	0.7036	99.77	68.01	70.14
ResNet18_U-Net++	0.614	99.88	79.9	81.47
DenseNet201_FPN	0.8018	99.93	84.09	84.09

Table 5. Statistical analysis of hematoma contained from ground truth mask and predicted mask.

Parameters	Ground Truth Mask (GT)	Predicted Mask (PM)	% of Error = \|PM − GT\|/GT × 100
Total Area (mm²)	1901.05	2289	20.04%
Volume (mm³)	9505.25	11,445
Median of ROI (mm²)	161.9505	204.5
Standard deviation	86.85599677	149.566373
Average Area of hematoma (mm²)	190.105	228.9

Table 6. Comparison of the proposed method with existing literatures.

Authors	Methodology and Approach	Metric (%)
Wang et al. [14,47]	Semi-supervised multitask attention-based U-Net	DSC = 67
Vamsi et al. [13]	Lightweight deep learning-based neural network	DSC = 72.92
Li et al. [2]	U-Net-based deep learning for hemorrhage detection and segmentation	DSC = 80.03 IoU = 69.19
Gautam et al. [15]	Automatic segmentation using WMFCM clustering	Average DSC = 82
Bahaduria et al. [47]	ICH detection using fuzzy c-means and region-based contour method	Average DSC = 87.4
Proposed	Automatic ICH segmentation using DenseNet201_U-Net	DSC = 85.757 IoU = 84.3

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khan, M.M.; Chowdhury, M.E.H.; Arefin, A.S.M.S.; Podder, K.K.; Hossain, M.S.A.; Alqahtani, A.; Murugappan, M.; Khandakar, A.; Mushtak, A.; Nahiduzzaman, M. A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images. Diagnostics 2023, 13, 2537. https://doi.org/10.3390/diagnostics13152537

AMA Style

Khan MM, Chowdhury MEH, Arefin ASMS, Podder KK, Hossain MSA, Alqahtani A, Murugappan M, Khandakar A, Mushtak A, Nahiduzzaman M. A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images. Diagnostics. 2023; 13(15):2537. https://doi.org/10.3390/diagnostics13152537

Chicago/Turabian Style

Khan, Muntakim Mahmud, Muhammad E. H. Chowdhury, A. S. M. Shamsul Arefin, Kanchon Kanti Podder, Md. Sakib Abrar Hossain, Abdulrahman Alqahtani, M. Murugappan, Amith Khandakar, Adam Mushtak, and Md. Nahiduzzaman. 2023. "A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images" Diagnostics 13, no. 15: 2537. https://doi.org/10.3390/diagnostics13152537

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Deep Learning-Based Automatic Segmentation and 3D Visualization Technique for Intracranial Hemorrhage Detection Using Computed Tomography Images

Abstract

1. Introduction

2. Methodology

2.1. Dataset Description

2.2. Pre-Processing Steps

2.3. K-Fold Cross-Validation

2.4. Data Augmentation

2.5. Different Image Segmentation Models

Pre-Trained Backbone

2.6. Loss Function

2.7. Evaluation Matrices

2.8. Volumetric Representation of ICH

2.9. Measurement of the Volume of Hemorrhage

2.10. Experimental Setup

2.10.1. ICH Segmentation Model

2.10.2. 3D Construction of Brain and ICH

3. Results and Discussion

3.1. Intracranial Hemorrhage Segmentation

3.2. Reconstruction of 3D Models of ICH

3.3. Volume Measurement of Intracranial Hematoma

3.4. Limitations of the Proposed Model and Future Works

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI