Fully Automatic Segmentation and Objective Assessment of ... - arXiv

3 downloads 0 Views 3MB Size Report
Xujiong Ye 5, Greg Slabaugh 6, Raad Mohiaddin 1,2, Tom Wong 1, Jennifer ...... Darrow, R., Slavin, G.S., Fung, M.M., Dando, J., Foley, L., Dumoulin, C.L., Reddy ...
Fully Automatic Segmentation and Objective Assessment of Atrial Scars for Longstanding Persistent Atrial Fibrillation Patients Using Late Gadolinium-Enhanced MRI Guang Yang 1,2†, Xiahai Zhuang 3†, Habib Khan 1, Shouvik Haldar 1, Eva Nyktari 1, Lei Li 4, Rick Wage 1, Xujiong Ye 5, Greg Slabaugh 6, Raad Mohiaddin 1,2, Tom Wong 1, Jennifer Keegan 1,2‡, David Firmin 1,2‡ 1

Cardiovascular Biomedical Research Unit, Royal Brompton Hospital, SW3 6NP, London, U.K.

2

National Heart and Lung Institute, Imperial College London, London, SW7 2AZ, U.K.

3

School of Data Science, Fudan University, Shanghai, 201203, China.

4

Dept. of Eng. Mechanics, School of NAOCE, Shanghai Jiao Tong University, Shanghai, 200240, China.

5

School of Computer Science, University of Lincoln, LN6 7TS, Lincoln, U.K.

6

Department of Computer Science, City University London, EC1V 0HB, London, U.K.

Running Head: Fully Automatic Scar Segmentation and Assessment for Longstanding AF Patients (78 Characters) Main Body Word Count: 5000 Abstract Word Count: 207 Figures: 8 Tables: 2



Corresponding authors: Guang Yang and Xiahai Zhuang

Emails: [email protected] and [email protected]

Joint senior authors

Phone: (0044) 020 7352 8121; Fax: (0044) 020 7351 8699 Address: Cardiovascular Research Centre, Royal Brompton Hospital, SW3 6NP, London, U.K.

1

ABSTRACT Purpose: Atrial fibrillation (AF) is the most common cardiac arrhythmia and is correlated with increased morbidity and mortality. It is associated with atrial fibrosis, which may be assessed non-invasively using late gadolinium-enhanced (LGE) magnetic resonance imaging (MRI) where scar tissue is visualised as a region of signal enhancement. In this study, we proposed a novel fully automatic pipeline to achieve an accurate and objective atrial scarring segmentation and assessment of LGE MRI scans for the AF patients. Methods: Our fully automatic pipeline uniquely combined: (1) a multi-atlas based whole heart segmentation (MA-WHS) to determine the cardiac anatomy from an MRI Roadmap acquisition which is then mapped to LGE MRI, and (2) a super-pixel and supervised learning based approach to delineate the distribution and extent of atrial scarring in LGE MRI. Results: Both our MA-WHS and atrial scarring segmentation showed accurate delineations of cardiac anatomy (mean Dice = 89%) and atrial scarring (mean Dice =79%) respectively compared to the established ground truth from manual segmentation. Compared with previously studied methods with manual interventions, our innovative pipeline demonstrated comparable results, but was computed fully automatically. Conclusion: The proposed segmentation methods allow LGE MRI to be used as an objective assessment tool for localisation, visualisation and quantification of atrial scarring. KEYWORDS: Late Gadolinium-Enhanced MRI; Cardiovascular Magnetic Resonance Imaging; Medical Image Segmentation; Whole Heart Segmentation; Atrial Fibrillation. ABBREVIATIONS: AF: Atrial Fibrillation; LA: Left Atrium; PV: Pulmonary Veins; SVM: Support Vector Machine; CV: Cross-Validation; ROI: Region of Interest.

2

I.

INTRODUCTION

A. Background Atrial fibrillation (AF) is the most common arrhythmia of clinical significance. It occurs when chaotic and disorganised electrical activity develops in the atria, causing muscle cells to contract irregularly and rapidly. It is associated with structural remodelling, including fibrotic changes in the left atrium [1] and can cause increased morbidity, especially stroke and heart failure. It also results in poor mental health, dementia, and increased mortality [2–4]. The electrical impulses that trigger AF frequently originate in the pulmonary veins (PV). Radio frequency ablation treatment aims to eliminate AF by electrically isolating the PV. However, the success rate for a single catheter ablation procedure is just 30-50% at 5 years follow-up [5, 6] and multiple ablations are frequently required. The current clinical gold standard for assessment of atrial scarring is electro-anatomical mapping (EAM), performed during an electrophysiological (EP) study [7]. However, this is an invasive technique which uses ionising radiation the accuracy is suboptimal, with reported errors of up to 10 mm in the localisation of scar tissue [8, 9]. Late gadolinium enhancement (LGE) magnetic resonance imaging (MRI) is an established non-invasive technique for detecting myocardial scar tissue [10]. With this technique, healthy and scar tissues are differentiated by their altered wash-in and wash-out contrast agent kinetics, which result in scar tissue being seen as a region of enhanced or high signal intensity while healthy tissue is ‘nulled’. While 2D breath-hold LGE MRI is well-established for ventricular imaging, there is a growing interest in imaging the thinner walled atria for identification of native and ablation scarring in AF patients [11–14]. This requires higher spatial resolution and contiguous coverage and data are best acquired as a 3D volume during free-breathing with diaphragmatic respiratory-gating. Atrial 3D LGE imaging has been used to: (1) assess patient suitability for AF ablation by identifying potential non-responders [12, 15–20], and (2) define the most appropriate ablation approach [16, 17, 21]. In addition, visualisation and quantification of native and post-ablation atrial scarring derived from LGE MRI has been used to guide initial and follow-up ablation procedures [17, 18, 22–25]. Histopathological studies in pigs have validated LGE MRI for the characterisation of AF ablation-induced wall injury [26]. 3

Visualisation and quantification of atrial scarring requires objective, robust and accurate segmentation of the enhanced scar regions from the LGE MRI images. Essentially, there are two segmentations required: one showing the cardiac anatomy (geometry), particularly the LA wall and PV, the other delineating the enhanced scar regions. The former segmentation is required to rule out confounding enhanced tissues from other parts of the heart, e.g., the mitral valve and aorta, or the enhancement from non-heart structures while the latter is a prerequisite for visualisation and quantification. Segmentation of the atrial scarring from LGE MRI images is a very challenging problem. Firstly, the LA wall is very thin and scarring is hard to distinguish even by experienced expert cardiologists specialised in cardiac MRI. Secondly, residual respiratory motion, heart rate variability, low signal-to-noise ratio (SNR), and contrast agent wash-out during the long acquisition (current scanning time 10mins) frequently result in image quality being poor. Moreover, artifactual enhanced signal from surrounding tissues may result in a large number of false positives.

B. Related Work Oakes et al. [12] quantified the enhanced atrial scarring by analysing the intensity histogram of the manually segmented LA wall while Perry et al. [27] applied k-means clustering. A grand challenge for evaluation and benchmarking of various atrial scarring segmentation methods showed promising results [28]. Despite interest in these developed segmentation techniques, most of them have relied on manual segmentation of the LA wall and PV. This has several drawbacks: (1) it is a time-consuming task; (2) there are intra- and inter-observer variations; and (3) it is less reproducible for a multi-centre and multi-scanner study. Moreover, a number of studies have assumed a fixed thickness of the LA wall although there is no evidence that this is the case. Depending on the actual wall thickness, subsequent reorientation and interpolation of the MR images results in varying partial volume effects, which affect the apparent thickness of the LA wall. Inaccurate manual segmentation of the LA wall and PV can further complicate the delineation of the atrial scarring and its quantification can be error-prone. This could be one of the major reasons that there are currently on-going concerns regarding the correlation between atrial scarring identified by LGE MRI (enhanced regions) and ‘gold standard’ EAM (low voltage regions) [29]. 4

The LA and PV would ideally be segmented from the cardiac and respiratory-gated LGE MRI dataset. However, this is difficult as normal tissue is ‘nulled’ and only scar tissue is seen with high signal. Other options are to segment them from a separately acquired breath-hold magnetic resonance angiogram (MRA) study [19, 30, 31] or from a respiratory and cardiac and respiratory gated 3D Roadmap acquisition, i.e., using a balanced steady state free precession (b-SSFP) sequence [32]. While MRA shows the LA and PV with high contrast, these acquisitions are generally un-gated and acquired in an inspiratory breath-hold. The anatomy extracted from MRA therefore can be highly deformed compared to that in the LGE MRI study. Although the 3D Roadmap acquisition takes longer to acquire, it is in the same respiratory phase as the LGE MRI and the extracted anatomy can be better matched. Cardiac anatomy has previously been defined by atlas based segmentation of MRA [31] and by using a statistical shape model [32] on 3D Roadmap data. Table 1 provides a summary of previously published methods on atrial scarring segmentation using LGE MRI.

C. Our Contributions In this paper, we present a novel fully automatic segmentation and objective assessment of atrial scarring for longstanding persistent AF patients scanned by LGE MRI. The LA chamber and PV are defined using a multi-atlas based whole heart segmentation (MA-WHS) method on Roadmap MRI images, which are acquired using a respiratory and cardiac gated 3D b-SSFP sequence. LA and PV geometry is resolved by mapping the segmented Roadmap anatomy to LGE MRI using the DICOM header data and is further refined by affine and nonrigid registration steps. The LGE MRI images are over-segmented by a novel Simple Linear Iterative Clustering (SLIC) based super-pixels method [33]. Then a support vector machines (SVM) based supervised classification is applied to segment the atrial scarring within the segmented LA and PV geometry. In this study, two validation steps have been performed—one for the LA chamber and PV segmentation and one for the atrial scarring segmentation—both against established ground truth from manual segmentations by experienced expert cardiologists specialised in cardiac MRI.

5

II.

MATERIALS AND METHODS

A. Data Acquisition Cardiac MR data were acquired on a Siemens Magnetom Avanto 1.5T scanner (Siemens Medical Systems, Erlangen, Germany). Transverse navigator-gated 3D LGE MRI [11, 12, 34] was performed using an inversion prepared segmented gradient echo sequence (TE/TR 2.2ms/5.2ms) 15 minutes after gadolinium (Gd) administration when a transient steady-state of Gd wash-in and wash-out of normal myocardium had been reached [35]. LGE MRI images were scanned with a field-of-view 380×380mm2 and reconstructed to 60–68 slices at 0.75×0.75×2mm3. Coronal navigator-gated 3D b-SSFP (TE/TR 1ms/2.3ms) Roadmap data were acquired with the following parameters: 80 slices at 1.6×1.6×3.2mm3, reconstructed to 160 slices at 0.8×0.8×1.6mm3, field-of-view 380×380mm2, acceleration factor of 2 using GRAPPA, partial Fourier 6/8, acquisition window 125ms positioned within the subject-specific rest period, single R-wave gating, chemical shift fat suppression, flip angle 70°. Off resonant blood from the lungs arriving in the LA and PV can result in signal loss [36], which in our application, is minimised by using the shortest TE/TR possible. This was achieved by using non-selective RF excitation [37]. Both 3D LGE MRI and Roadmap data were acquired during free-breathing using a crossedpairs navigator positioned over the dome of the right hemi-diaphragm with navigator acceptance window size of 5mm and CLAWS respiratory motion control [38]. The nominal acquisition duration was 204–232 cardiac cycles for 3D LGE MRI and 241 cardiac cycles for Roadmap assuming 100% respiratory efficiency.

6

B. Patients In agreement with the local regional ethics committee, cardiac MRI was performed in longstanding persistent AF patients between 2011–2013. The image quality of each dataset was scored by a senior cardiac MRI physicist on a Likert-type scale—0 (non-diagnostic), 1 (poor), 2 (fair), 3 (good) and 4 (very good)—depending on the level of SNR, appropriate TI, and the existence of navigator beam and ghost artefacts. Thirty seven cases with image quality greater or equal to 2 have been retrospectively entered into this study including 11 pre-ablation (included ~65% of pre-ablation cases) and 26 post-ablation scans (included ~92% of post-ablation cases).

C. Multi-Atlas Whole Heart Segmentation (MA-WHS) MA-WHS Method A multi-atlas approach [39, 40] was developed to derive the whole heart segmentation of the Roadmap acquisition and then mapped to LGE MRI (Figure 1 (a)). First we obtained 30 MRI Roadmap studies from the Left Atrium Segmentation Grand Challenge organised by King’s College London [41] together with manual segmentations of the left atrium, pulmonary veins and appendages. In these, we further labelled the right and left ventricles, the right atrium, the aorta and the pulmonary artery, to generate 30 whole heart atlases. These 30 MRI Roadmap studies were employed only for building an independent multiatlas dataset, which will then be used for segmenting our Roadmap studies that linked with the LGE MRI scans for the AF patients. Let be the target image to be segmented, ,

and

be the set of atlases, where

are respectively the intensity image and corresponding segmentation label

image of the -th atlas. For each atlas, MA-WHS performs an atlas-to-target registration, by maximizing the similarity between the images, to derive the set of warped atlases, 1) in which

, and is the resulting transformation of the registration and

{

, are

respectively the warped atlas intensity image and corresponding segmentation result. Here, we employ the hierarchical registration for segmentation propagation, which was specifically designed for the whole heart MRI images and consists of three steps, namely the global affine 7

registration for localisation of the whole heart, the local affine registration for the initialisation of the substructures, and the fully deformable registration for local detail refinement [42]. Image similarity metrics evaluate how similar the atlas and target image are. In this work we propose to use the spatially encoded mutual information (SEMI) method, which has been shown to be robust against intensity non-uniformity and different intensity contrast [43], that is 2) where

are the SEMI and computed based on the spatially encoded joint histogram, ∑

3) Here,

(

) and

(

(

)

(

)

) are Parzen window estimation and

is a weighting

function to encode the spatial information [43]. After the multi-atlas propagation, a label fusion algorithm is required to generate one final segmentation of the LA from the 30 propagated results, 4) The recent literatures have many new methods [44–51] on improving multi-atlas segmentation using sophisticatedly designed algorithms, which generally need to evaluate local similarity between patches from the atlases and the target image for local weighted label fusion, ∑

5) in which local weight the atlas.

and

indicate the labels of the background and left atrium, respectively, and the is determined by the local similarity

between the target image and

is the Kronecker delta function which returns 1 when

and returns 0

otherwise. For the LA segmentation, we propose to use the multi-scale patch based label fusion (MSPLF). The multi-scale space theory can handle different level information within a small patch and has been applied to feature extraction/detection and image matching [40, 51–57]. The patches we compute from different scale spaces can represent the different levels of structural information, with low scale capturing local fine structure and high scale suppressing fine structure but providing global structural information of the image. To avoid increasing the computational complexity, we adopt the multi-resolution implementation and couple it with the 8

MSP where the high-scale patch can be efficiently computed using a low-resolution image space. The local similarity between two images using the MSP measure is computed, as follows, ∑

6) where

is the target image from

scale-space which is computed from

the convolution of the target image with Gaussian kernel function with scale . Here, we compute the local similarity in multi-scale image using the conditional probability of the images, 7)

(

where

and

)

and the conditional image probability is obtained from the

joint and marginal image probability which can be calculated using the Parzen window estimation [58]. For each patient, the Roadmap dataset was then registered to the LGE MRI dataset using the DICOM header data, and then refined by affine and nonrigid registration steps [43]. The resulting transformation was applied to the MA-WHS derived cardiac anatomy to define the endocardial LA boundary and PV on the LGE MRI dataset for each patient.

9

D. Atrial Scarring Segmentation Over-Segmentation by Simple Linear Iterative Clustering (SLIC) Based Super-Pixels We used a Simple Linear Iterative Clustering (SLIC) based super-pixel method [33] to oversegment LGE MRI images in order to separate potential enhanced atrial scarring regions from other tissues (Figure 1 (b)). Super-pixel algorithms group pixels into perceptually meaningful patches with similar size, which can be used to replace the regular pixel grid. In this study, we used a SLIC based super-pixel method, which has been successfully applied to solve various medical image analysis problems, e.g., [59, 60]. It has also demonstrated better segmentation accuracy and superior adherence to object boundaries, and it is faster and more memory efficient compared to other state-of-the-art super-pixels methods [33]. Based on local k-means clustering, the SLIC method iteratively groups pixels into super-pixels. The clustering proximity is estimated in both intensity and spatial domains that is √

8)

in which √(

√( )

( )

,

) measures the pixel intensity difference of a gray scale image and (

) describes the spatial distance between each pixel and the

geometric centre of the super-pixel. SLIC is initialised by sampling the target slice of the LGE MRI image into a regular grid space with grid interval of SLIC limits the size of search region of similar pixels to (namely local k-means clustering). In addition, parameter intensity similarity

and spatial proximity

pixels. To speed up the iteration, around the super-pixel centre balances the weighting between

.

Support Vector Machines (SVM) Based Classification After SLIC segmentation, we proposed to use Support Vector Machines (SVM) to classify the over-segmented super-pixels into enhanced atrial scarring regions and non-enhanced tissues. SVM provide a powerful technique for supervised binary classification [61]. In order to train the SVM classifier, we built a training dataset containing enhanced and nonenhanced super-pixel patches (refer to Supporting Material Section A1). This has been done by (1) experienced expert cardiologists specialised in cardiac MRI performing manual mouse clicks 10

to select the enhanced scar regions; (2) combining the mouse clicks and SLIC segmentation to label the enhanced super-pixels (refer to Supporting Material Figure S1); (3) applying morphological dilation to the segmented endocardial LA boundary and PV from MA-WHS to extract the LA wall and PV; (4) finding the overlapped regions of the LA wall and PV and the labelled enhanced super-pixels and (5) labelling the other super-pixels overlapped with LA wall and PV as non-enhancement (refer to Supporting Material Figure S2). Instead of extracting texture or shape features of these labelled super-pixels, we computed the pixel-intensity based features to feed to the SVM classifier. Feature selection was done using minimum redundancy and maximum relevance method [62]. In this study, we applied the mutual information quotient scheme [62]. The selected features will be presented in the Results section and will be used for the further SVM based classification procedure. The parameters of the SVM with a RBF kernel were optimised using cross-validation with a grid search scheme [63] (refer to Supporting Material Section A4).

E. Results Evaluation and Validation Evaluation and Validation of the MA-WHS One experienced cardiologist (>5 years’ experience and specialised in cardiac MRI) manually segmented the endocardial LA boundary and labelled the PV slice-by-slice in the LGE MRI images for all the patients. A second senior cardiologist (>25 years’ experience and specialised in cardiac MRI) confirmed the manual segmentation. The evaluation and validation of our MAWHS has been done against this manual segmentation, which is assumed to be the ground truth. We used five metrics: DICE, JACCARD, PRECISION,

distance [64] and Average

Surface Distance (ASD) [65] (refer to Supporting Material Table S1).

11

Ground Truth Definition of the Atrial Scarring We formed the ground truth of the enhanced atrial scarring on the LGE MRI images using the following steps: (1)

Steps (1)–(4) as listed in the ‘SVM Based Classification’ section.

(2)

Once we extracted the enhanced super-pixels, they were combined to create a binary image for each slice, i.e., 1 for enhanced super-pixels and 0 for unenhanced.

(3)

The binary image was overlaid on the original LGE MRI images and our cardiologists performed manual corrections to create the final boundaries (ground truth) of the enhanced atrial scarring. In so doing, we minimised the bias towards a better performance of the segmentation using classified super-pixels.

Intra- and Inter-Observer Variances of the Manual Atrial Scarring Segmentation In this study, two cardiologists performed the manual mouse clicks based ground truth construction procedures in order to account for the inter-observer variance. Eight randomly selected persistent AF patients (4 pre- and 4 post-ablation cases) were entered for this test. In addition, one cardiologist performed the manual mouse clicks twice at two different time points (1 month in between) to estimate the intra-observer variance. The DICE metric was used to measure the intra- and inter-observer variances of the ground truth construction.

Evaluation and Validation of the Fully-Automated Atrial Scarring Segmentation We evaluated our SVM based classification by: (i) leave-one-patient-out cross-validation (LOO CV), which provides an unbiased predictor and is capable of creating sufficient training data for studies with small sample size [66]; (ii) the cross validated classification accuracy, sensitivity, specificity, and average area under the receiver operating characteristic (ROC) curve (AUC) and (iii) the balanced error rate (BER) [67]. We also applied 10-fold CV to evaluate the robustness of our method when there are fewer manual labelled training datasets. Lastly, we divided our data into (a) a training/CV dataset (25 patients) and (b) an independent testing dataset (12 patients). For the final atrial scarring segmentation we also performed results evaluation using DICE, JACCARD, PRECISION and NPV measurements.

and

metrics were not

applied because we have multiple discrete regions of the enhanced atrial scarring for each LGE 12

MRI volume. In addition, we also calculated the percentage of fibrosis extent [68, 19] (refer to Supporting Material Section A6).

Comparison Study In order to demonstrate the efficacy of our method, we also compared it with two standard methods published in previous studies: (a) Simple thresholding based method (Thr) [34]. The threshold value for each LGE MRI volume was chosen via empirical evaluation. (b) Conventional standard deviation (SD) [12] based method (2, 4 and 6 SDs were tested). These methods were selected as they have minimum parameter tuning and could be most accurately reproduced. We compared the atrial scarring segmentation from each method against the ground truth using the LA and PV boundaries derived from our fully automatic MA-WHS segmentation. This was then repeated using the manually delineated LA and PV boundaries. The image intensities have also been normalised with respect to the intensities of the blood pool regions [28], which were extracted by a morphological erosion from the endocardial LA boundary.

13

III.

RESULTS

A. Whole Heart Segmentation Results Firstly, we have performed a study on various label fusion strategies. Figure 2 (a) shows the comparison results of the Dice scores using the different label fusion schemes, e.g., by majority vote (MV), by local weighted voting (LWV) [44], by joint label fusion (JLF) [50], by patch fusion one scale (PF), and by our proposed MSP. Our MSP was statistically significantly better than the other label fusion schemes (p < 0.05). Secondly, Figure 2 (b) shows the quantitative results of our MA-WHS method compared to the ground truth. Figure 3 shows the reconstructed 3D surface mesh of the endocardial LA boundary and PV extracted using our MA-WHS method. We can observe that for most of the LA and the proximal PV, the segmentation accuracy is high; however, the segmentation in the more distal PV is more error-prone. In general, manual delineation always has more detailed segmentations of the PV structures and the extent of PV was under-estimated using our MA-WHS. However, for AF patients, it is only the very proximal regions that are of importance. B. Fully-Automated Atrial Scarring Segmentation Results Intra- and Inter-Observer Variances of the Ground Truth Construction Figure 4 demonstrates the intra- and inter-observer variances of the manual atrial scarring delineation. Results show a very good agreement (mean DICE scores ranging from 86%–92%) between the performances of the cardiologist (>5 years’ experience) who did the manual mouse clicks twice (1 month between the two time-points). Compared to the performance of our second cardiologist (>25 years’ experience), mean DICE scores range from 83%–91%. In summary, results of the DICE show that our atrial scarring ground truth construction (combining mouse clicks and super-pixels) has low intra- and inter-observer variance and that it is therefore valid for evaluation of the atrial scarring segmentation algorithms.

Evaluation and Validation Results of the Fully-Automated Atrial Scarring Segmentation After minimum redundancy and maximum relevance based feature selection, 3 out of 16 features (minimum, mean and standard deviation) were selected and used in building the SVM model. Table 2 tabulates the SVM classification results of distinguishing enhanced atrial scarring 14

regions from non-enhanced tissues. Using LOO CV, we obtained 88% accuracy and 0.16 BER. ROC analysis shows an AUC of 0.91 (refer to Supporting Material Figure S3). In terms of the final segmentation accuracy, we achieved mean DICE of 79%. We have also validated the proposed pipeline using 10-fold cross-validation that obtained similar accuracy. For the validation on the separated independent testing datasets, the fixed SVM model was blindly built. Compared to LOO CV on 37 datasets we achieved similar accuracy (86%) and sensitivity (92%), but lower specificity (64%) and mean DICE (71%) were obtained. This validation using separate testing datasets showed that our method can still perform well while fixing the classification model and using it to segment the new input data. For the pre-ablation cases, using our fully automatic pipeline, the measured native fibrosis associated with AF was 26.9±11.2% (Figure 5 (a)). Compared to the pre-ablation cases, the fibrosis extent ratio was found to be 32.8±6.4% for the post-ablation cases (Figure 5 (a)). There was no significant difference found for the fibrosis extent derived using the ground truth segmentation of the atrial scarring (23.4±7.3% for the pre-ablation cases and 30.9±6.1% for the post-ablation cases). Comparing the fibrosis extent in our pre-ablation and post-ablation cases, both our fully automatic pipeline and the ground truth segmentation found significant differences in between (Figure 5 (a)). Bland-Altman analysis demonstrated that the fibrosis extent derived by our fully automatic segmentation pipeline has good agreement with the one derived using the ground truth segmentation (Figure 5 (b) and (c)). Figure 6 shows the comparison results with simple thresholding and conventional standard deviation methods. Figure 6 (red bars) show that our method worked equally well in pre-ablation and post-ablation studies (median DICE score 80% for the post-ablation cases vs. median DICE score 76% for the pre-ablation cases and overall no significant difference by Wilcoxon rank-sum test, p=0.087). Overall, the atrial scarring segmentation results obtained using our method outperformed the simple thresholding and conventional standard deviation methods significantly (Figure 6). Figure 7 shows our final atrial scarring segmentation results compared to the ground truth. The segmentation results have been derived from the LOO CV (i.e., training on 36 datasets and making prediction on the one dataset that has been left). For the first exemplar pre-ablation case (Figure 7 (a-c)), we can observe underestimated enhancement segmentation. Segmentation of the second pre-ablation case (Figure 7 (d-f)) shows clear accordance compared to the ground truth 15

despite some overestimated region near the right inferior pulmonary vein. Segmentation of both post-ablation cases exhibit good agreement with the ground truth (Figure 7 (i) vs. (h) and Figure 7 (l) vs. (k)).

16

IV.

DISCUSSION

In this study, we developed a novel fully automatic segmentation pipeline to detect enhanced atrial scarring in LGE MRI images. Overall, results of this study offer compelling evidence that our fully automatic pipeline is capable of detecting enhanced atrial scarring from LGE MRI images acquired from a longstanding persistent AF cohort. Segmentation of the atrial scarring from LGE MRI images is very challenging. This is not only because the atrial scarring is difficult to distinguish in the thin LA wall but also because the image quality can be poor due to motion artefacts, noise contamination and contrast agent washout during the long acquisition. Moreover, the enhancement from the surrounding tissues and enhanced blood flow are confounding issues for atrial scarring segmentation and result in increased false positives. However, most of these confounding enhancement regions can be distinguished subject to accurate heart anatomy delineation using our MA-WHS (Figure 7 (c), (f), (i) and (l)). Due to the subjective understanding of the LGE MRI images, our cardiologists may miss labelling some enhanced regions (double green arrows in Figure 7 (g)), but they can still be found using our supervised learning based fully automatic segmentation pipeline. In our study, the size of the super-pixels was restricted by the LA wall thickness and initialised to 4 pixels. We chose

was

based on visual inspections of the over-segmentation

results. In order to evaluate the effect of

we calculated the Dice scores between the ground

truth LA and PV segmentations and the super-pixel derived LA and PV regions (Figure 8 (a)). Results showed that when

we achieved the highest mean Dice score and therefore the best

adherence, yet no significant differences were found by setting

. This can

be attributed to the fact that we have a relatively small size of the super-pixels and the compactness term

has less effect on the segmentation results.

Interestingly, the minimum redundancy and maximum relevance method has selected simple but effective features for our further SVM classification on SLIC segmented super-pixels. Three features were selected for the SVM classification, i.e., the mean, the standard deviation and the min of the super-pixels. The feature ‘mean’ corresponded to a simple thresholding on the superpixel intensity values. The feature ‘standard deviation’ was selected because it quantified local intensity variations in the scar and healthy regions. The feature ‘min’ was selected as a strong discriminator due to the fact that in the enhanced atrial scarring regions the ‘min’ intensity values 17

are much higher than the ‘min’ intensity values of the normal regions. However, the feature ‘max’ was not selected as a discriminator as for some labelled normal regions, there may be relatively high pixel intensities that could be false positives. There are limitations of the current work. The fast and irregular heart rate in patients prior to ablation resulted in only 11 pre-ablation studies having good enough quality to be included in this study. Together with 26 post-ablation studies, our total number of datasets was limited to 37. To tackle the problem of having limited patient data, LOO CV was performed to achieve an unbiased predictor for limited datasets. The comparison study reported here is also limited. A number of advanced techniques have been proposed such as unsupervised learning based clustering and graph-cuts based methods [28]. However, implementation of these is difficult as the fine-tuned hyper-parameters used are not always clearly described and the methodologies cannot be reproduced exactly. Moreover, our patient cohort is different from that in which these algorithms were optimised and tested. In this manuscript, we have therefore only compared our technique against the simple thresholding and conventional standard deviation based methods as these have fully-standard implementations. When compared to manual segmentation (ground truth) in post-ablation scans, these standard techniques gave median DICE of 38%–48% while our fully automatic technique achieved a median DICE of 80%. The results that we obtained here with the standard techniques are similar to those reported with these same techniques in the benchmarking study described in [28] while the latter score is similar to the best-performing methods reported in that same study. Moreover, we observed relatively large variance of the SD number above the mean signal intensity of the atrial blood pool regions (3.8±1.2 SDs derived using our fully automatic pipeline). This reflects not only on differences in gadolinium uptake in the scarred regions but also on correct setting of the inversion time. This might be one of the reasons that the method of using a fixed SD performed less well for our datasets. Of note is that in the benchmarking study the variances of all of the techniques tested are large while in our manuscript, the results are more consistent with a relatively small variance (boxplot in red as seen in Figure 6). This may be due to our patient cohort being more tightly defined while in the previous study, datasets were analysed from patients at multiple institutions using a variety of imaging protocols.

18

V.

CONCLUSIONS

To the best of our knowledge, this is the first study that developed a fully automatic segmentation pipeline for atrial scarring segmentation with quantitative validation for LGE MRI scans. The proposed pipeline has demonstrated an effective and efficient way to objectively segment and assess the atrial scarring. Our validation results have shown that both our MA-WHS and super-pixel classification based atrial scarring segmentation have obtained satisfactory accuracy. The current study was performed using real clinical data, and we can envisage an integration of our pipeline to clinical routines. In so doing, a patient-specific LA and PV geometry model and an objective atrial scarring segmentation can be obtained rapidly for individual AF patient without manual processing.

19

ACKNOWLEDGEMENTS This study was funded by the British Heart Foundation Project Grant (Project Number: PG/16/78/32402), the NIHR Cardiovascular Biomedical Research Unit, Royal Brompton Hospital & Harefield NHS Foundation Trust and Imperial College London, and the Chinese NSFC research fund (81301283) and the NSFC-RS fund (81511130090). Data were obtained during the NIHR Efficacy and Mechanism Evaluation Programme (Project Number: 12/127/127).

20

REFERENCES

1.

Allessie, M.: Electrical, contractile and structural remodeling during atrial fibrillation. Cardiovasc. Res. 54, 230–246 (2002).

2.

Stewart, S., Hart, C.L., Hole, D.J., McMurray, J.J. V: A population-based study of the long-term risks associated with atrial fibrillation: 20-Year follow-up of the Renfrew/Paisley study. Am. J. Med. 113, 359–364 (2002).

3.

Chugh, S.S., Havmoeller, R., Narayanan, K., Singh, D., Rienstra, M., Benjamin, E.J., Gillum, R.F., Kim, Y.-H., McAnulty, J.H., Zheng, Z.-J., Forouzanfar, M.H., Naghavi, M., Mensah, G.A., Ezzati, M., Murray, C.J.L.: Worldwide Epidemiology of Atrial Fibrillation: A Global Burden of Disease 2010 Study. Circulation. 129, 837–847 (2014).

4.

January, C.T., Wann, L.S., Alpert, J.S., Calkins, H., Cigarroa, J.E., Cleveland, J.C., Conti, J.B., Ellinor, P.T., Ezekowitz, M.D., Field, M.E., Murray, K.T., Sacco, R.L., Stevenson, W.G., Tchou, P.J., Tracy, C.M., Yancy, C.W.: 2014 AHA/ACC/HRS Guideline for the Management of Patients With Atrial Fibrillation: Executive Summary. J. Am. Coll. Cardiol. 64, 2246–2280 (2014).

5.

Ouyang, F., Tilz, R., Chun, J., Schmidt, B., Wissner, E., Zerm, T., Neven, K., Köktürk, B., Konstantinidou, M., Metzner, A., Fuernkranz, A., Kuck, K.H.: Long-term results of catheter ablation in paroxysmal atrial fibrillation: Lessons from a 5-year follow-up. Circulation. 122, 2368–2377 (2010).

6.

Weerasooriya, R., Khairy, P., Litalien, J., MacLe, L., Hocini, M., Sacher, F., Lellouche, N., Knecht, S., Wright, M., Nault, I., Miyazaki, S., Scavee, C., Clementy, J., Haissaguerre, M., Jais, P.: Catheter ablation for atrial fibrillation: Are results maintained at 5 years of follow-up? J. Am. Coll. Cardiol. 57, 160–166 (2011).

7.

Calkins, H., Kuck, K.H., Cappato, R., Brugada, J., Camm, A.J., Chen, S.-A., Crijns, H.J.G., Damiano, R.J., Davies, D.W., DiMarco, J., Edgerton, J., Ellenbogen, K., Ezekowitz, M.D., Haines, D.E., Haissaguerre, M., Hindricks, G., Iesaka, Y., Jackman, W., Jalife, J., Jais, P., Kalman, J., Keane, D., Kim, Y.-H., Kirchhof, P., Klein, G., Kottkamp, H., Kumagai, K., Lindsay, B.D., Mansour, M., Marchlinski, F.E., McCarthy, P.M., Mont, J.L., Morady, F., Nademanee, K., Nakagawa, H., Natale, A., Nattel, S., Packer, D.L., 21

Pappone, C., Prystowsky, E., Raviele, A., Reddy, V., Ruskin, J.N., Shemin, R.J., Tsao, H.M., Wilber, D., Heart Rhythm Society Task Force on Catheter and Surgical Ablation of Atrial Fibrillation: 2012 HRS/EHRA/ECAS expert consensus statement on catheter and surgical ablation of atrial fibrillation: recommendations for patient selection, procedural techniques, patient management and follow-up, definitions, endpoints, and research trial design. Heart Rhythm. 9, 632–696.e21 (2012). 8.

Zhong, H., Lacomis, J.M., Schwartzman, D.: On the accuracy of CartoMerge for guiding posterior left atrial ablation in man. Heart Rhythm. 4, 595–602 (2007).

9.

Schmidt, E.J., Mallozzi, R.P., Thiagalingam, A., Holmvang, G., D’Avila, A., Guhde, R., Darrow, R., Slavin, G.S., Fung, M.M., Dando, J., Foley, L., Dumoulin, C.L., Reddy, V.Y.: Electroanatomic mapping and radiofrequency ablation of porcine left atria and atrioventricular nodes using magnetic resonance catheter tracking. Circ. Arrhythm. Electrophysiol. 2, 695–704 (2009).

10.

Simonetti, O.P., Kim, R.J., Fieno, D.S., Hillenbrand, H.B., Wu, E., Bundy, J.M., Finn, J.P., Judd, R.M.: An improved MR imaging technique for the visualization of myocardial infarction. Radiology. 218, 215–23 (2001).

11.

Peters, D.C., Wylie, J. V, Hauser, T.H., Kissinger, K. V, Botnar, R.M., Essebag, V., Josephson, M.E., Manning, W.J.: Detection of pulmonary vein and left atrial scar after catheter ablation with three-dimensional navigator-gated delayed enhancement MR imaging: initial experience. Radiology. 243, 690–695 (2007).

12.

Oakes, R.S., Badger, T.J., Kholmovski, E.G., Akoum, N., Burgon, N.S., Fish, E.N., Blauer, J.J.E., Rao, S.N., Dibella, E.V.R., Segerson, N.M., Daccarett, M., Windfelder, J., McGann, C.J., Parker, D., MacLeod, R.S., Marrouche, N.F.: Detection and quantification of left atrial structural remodeling with delayed-enhancement magnetic resonance imaging in patients with atrial fibrillation. Circulation. 119, 1758–1767 (2009).

13.

Akçakaya, M., Rayatzadeh, H., Basha, T., Hong, S.N., Chan, R.H., Kissinger, K. V, Hauser, T.H., Josephson, M.E., Manning, W.J.: Accelerated late gadolinium enhancement cardiac mr imaging with isotropic spatial resolution using compressed sensing: Initial experience. Radiology. 264, 691–699 (2012).

14.

Vergara, G.R., Vijayakumar, S., Kholmovski, E.G., Blauer, J.J.E., Guttman, M. a., Gloschat, C., Payne, G., Vij, K., Akoum, N.W., Daccarett, M., McGann, C.J., MacLeod, 22

R.S., Marrouche, N.F.: Real-time magnetic resonance imagingguided radiofrequency atrial ablation and visualization of lesion formation at 3 Tesla. Hear. Rhythm. 8, 295–303 (2011). 15.

Marrouche, N.F., Wilber, D., Hindricks, G., Jais, P., Akoum, N., Marchlinski, F., Kholmovski, E., Burgon, N., Hu, N., Mont, L., Deneke, T., Duytschaever, M., Neumann, T., Mansour, M., Mahnkopf, C., Herweg, B., Daoud, E., Wissner, E., Bansmann, P., Brachmann, J.: Association of atrial tissue fibrosis identified by delayed enhancement MRI and atrial fibrillation catheter ablation: the DECAAF study. JAMA. 311, 498–506 (2014).

16.

Akoum, N., Daccarett, M., McGann, C., Segerson, N., Vergara, G., Kuppahally, S., Badger, T., Burgon, N., Haslam, T., Kholmovski, E., MacLeod, R., Marrouche, N.: Atrial fibrosis helps select the appropriate patient and strategy in catheter ablation of atrial fibrillation: A DE-MRI guided approach. J. Cardiovasc. Electrophysiol. 22, 16–22 (2011).

17.

Vergara, G.R., Marrouche, N.F.: Tailored management of atrial fibrillation using a LGEMRI based model: From the clinic to the electrophysiology laboratory. J. Cardiovasc. Electrophysiol. 22, 481–487 (2011).

18.

Malcolme-Lawes, L.C., Juli, C., Karim, R., Bai, W., Quest, R., Lim, P.B., Jamil-Copley, S., Kojodjojo, P., Ariff, B., Davies, D.W., Rueckert, D., Francis, D.P., Hunter, R., Jones, D., Boubertakh, R., Petersen, S.E., Schilling, R., Kanagaratnam, P., Peters, N.S.: Automated analysis of atrial late gadolinium enhancement imaging that correlates with endocardial voltage and clinical outcomes: A 2-center study. Hear. Rhythm. 10, 1184– 1189 (2013).

19.

Ravanelli, D., Dal Piaz, E.C., Centonze, M., Casagranda, G., Marini, M., Del Greco, M., Karim, R., Rhode, K., Valentini, A.: A novel skeleton based quantification and 3-D volumetric visualization of left atrium fibrosis using late gadolinium enhancement magnetic resonance imaging. IEEE Trans. Med. Imaging. 33, 566–576 (2014).

20.

McGann, C., Akoum, N., Patel, A., Kholmovski, E., Revelo, P., Damal, K., Wilson, B., Cates, J., Harrison, A., Ranjan, R., Burgon, N.S., Greene, T., Kim, D., DiBella, E.V.R., Parker, D., MacLeod, R.S., Marrouche, N.F.: Atrial fibrillation ablation outcome is predicted by left atrial remodeling on MRI. Circ. Arrhythmia Electrophysiol. 7, 23–30 (2014). 23

21.

Jadidi, A.S., Cochet, H., Shah, A.J., Kim, S.J., Duncan, E., Miyazaki, S., Sermesant, M., Lehrmann, H., Lederlin, M., Linton, N., Forclaz, A., Nault, I., Rivard, L., Wright, M., Liu, X., Scherr, D., Wilton, S.B., Roten, L., Pascale, P., Derval, N., Sacher, F., Knecht, S., Keyl, C., Hocini, M., Montaudon, M., Laurent, F., Haïssaguerre, M., Jaïs, P.: Inverse relationship between fractionated electrograms and atrial fibrosis in persistent atrial fibrillation: Combined magnetic resonance imaging and high-density mapping. J. Am. Coll. Cardiol. 62, 802–812 (2013).

22.

Reddy, V.Y., Schmidt, E.J., Holmvang, G., Fung, M.: Arrhythmia recurrence after atrial fibrillation ablation: Can magnetic resonance imaging identify gaps in atrial ablation lines? J. Cardiovasc. Electrophysiol. 19, 434–437 (2008).

23.

Badger, T.J., Daccarett, M., Akoum, N.W., Adjei-Poku, Y. a., Burgon, N.S., Haslam, T.S., Kalvaitis, S., Kuppahally, S., Vergara, G., McMullen, L., Anderson, P. a., Kholmovski, E., MacLeod, R.S., Marrouche, N.F.: Evaluation of left atrial lesions after initial and repeat atrial fibrillation ablation; Lessons learned from delayed-enhancement MRI in repeat ablation procedures. Circ. Arrhythmia Electrophysiol. 3, 249–259 (2010).

24.

Kapa, S., Desjardins, B., Callans, D.J., Marchlinski, F.E., Dixit, S.: Contact electroanatomic mapping derived voltage criteria for characterizing left atrial scar in patients undergoing ablation for atrial fibrillation. J. Cardiovasc. Electrophysiol. 25, 1044–52 (2014).

25.

Bisbal, F., Guiu, E., Cabanas-Grandío, P., Berruezo, A., Prat-Gonzalez, S., Vidal, B., Garrido, C., Andreu, D., Fernandez-Armenta, J., Tolosana, J.M., Arbelo, E., De Caralt, T.M., Perea, R.J., Brugada, J., Mont, L.: CMR-guided approach to localize and ablate gaps in repeat AF ablation procedure. JACC Cardiovasc. Imaging. 7, 653–663 (2014).

26.

Harrison, J.L., Jensen, H.K., Peel, S. a., Chiribiri, A., Grondal, A.K., Bloch, L.O., Pedersen, S.F., Bentzon, J.F., Kolbitsch, C., Karim, R., Williams, S.E., Linton, N.W., Rhode, K.S., Gill, J., Cooklin, M., Rinaldi, C. a., Wright, M., Kim, W.Y., Schaeffter, T., Razavi, R.S., O’Neill, M.D.: Cardiac magnetic resonance and electroanatomical mapping of acute and chronic atrial ablation injury: A histological validation study. Eur. Heart J. 35, 1486–1495 (2014).

27.

Perry, D., Morris, A., Burgon, N., McGann, C., MacLeod, R., Cates, J.: Automatic classification of scar tissue in late gadolinium enhancement cardiac MRI for the 24

assessment of left-atrial wall injury after radiofrequency ablation. In: van Ginneken, B. and Novak, C.L. (eds.) In SPIE Medical Imaging. p. 83151D (2012). 28.

Karim, R., Housden, R.J., Balasubramaniam, M., Chen, Z., Perry, D., Uddin, A., AlBeyatti, Y., Palkhi, E., Acheampong, P., Obom, S., Hennemuth, A., Lu, Y., Bai, W., Shi, W., Gao, Y., Peitgen, H.-O., Radau, P., Razavi, R., Tannenbaum, A., Rueckert, D., Cates, J., Schaeffter, T., Peters, D., MacLeod, R., Rhode, K.: Evaluation of current algorithms for segmentation of scar tissue from late gadolinium enhancement cardiovascular magnetic resonance of the left atrium: an open-access grand challenge. J. Cardiovasc. Magn. Reson. 15, 105–122 (2013).

29.

Harrison, J.L., Sohns, C., Linton, N.W., Karim, R., Williams, S.E., Rhode, K.S., Gill, J., Cooklin, M., Rinaldi, C. a., Wright, M., Schaeffter, T., Razavi, R.S., O’Neill, M.D.: Repeat Left Atrial Catheter Ablation: Cardiac Magnetic Resonance Prediction of Endocardial Voltage and Gaps in Ablation Lesion Sets. Circ. Arrhythmia Electrophysiol. 8, 270–278 (2015).

30.

Knowles, B.R., Caulfield, D., Cooklin, M., Rinaldi, C.A., Gill, J., Bostock, J., Razavi, R., Schaeffter, T., Rhode, K.S.: 3-D visualization of acute RF ablation lesions using MRI for the simultaneous determination of the patterns of necrosis and edema. IEEE Trans. Biomed. Eng. 57, 1467–1475 (2010).

31.

Tao, Q., Ipek, E.G., Shahzad, R., Berendsen, F.F., Nazarian, S., van der Geest, R.J.: Fully automatic segmentation of left atrium and pulmonary veins in late gadolinium-enhanced MRI: Towards objective atrial scar assessment. J. Magn. Reson. Imaging. 44, 346–354 (2016).

32.

Karim, R., Arujuna, A., Housden, R.J., Gill, J., Cliffe, H., Matharu, K., Rinaldi, C. a, O’Neill, M., Rueckert, D., Razavi, R., Schaeffter, T., Rhode, K.: A Method to Standardize Quantification of Left Atrial Scar from Delayed-Enhancement MR Images. Transl. Eng. Heal. Med. IEEE J. 2, 1–15 (2014).

33.

Achanta, R., Shaji, A., Smith, K., Lucchi, A.: SLIC superpixels compared to state-of-theart superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2274–2281 (2012).

34.

Peters, D.C., Wylie, J. V., Hauser, T.H., Nezafat, R., Han, Y., Woo, J.J., Taclas, J., Kissinger, K. V., Goddu, B., Josephson, M.E., Manning, W.J.: Recurrence of Atrial Fibrillation Correlates With the Extent of Post-Procedural Late Gadolinium Enhancement. 25

A Pilot Study. JACC Cardiovasc. Imaging. 2, 308–316 (2009). 35.

Keegan, J., Jhooti, P., Babu-Narayan, S. V., Drivas, P., Ernst, S., Firmin, D.N.: Improved respiratory efficiency of 3D late gadolinium enhancement imaging using the continuously adaptive windowing strategy (CLAWS). Magn. Reson. Med. 71, 1064–1074 (2014).

36.

Moghari, M.H., Peters, D.C., Smink, J., Goepfert, L., Kissinger, K. V, Goddu, B., Hauser, T.H., Josephson, M.E., Manning, W.J., Nezafat, R.: Pulmonary vein inflow artifact reduction for free-breathing left atrium late gadolinium enhancement. Magn. Reson. Med. 66, 180–6 (2011).

37.

Krishnam, M.S., Tomasian, A., Deshpande, V., Tran, L., Laub, G., Finn, J.P., Ruehm, S.G.: Noncontrast 3D steady-state free-precession magnetic resonance angiography of the whole chest using nonselective radiofrequency excitation over a large field of view: comparison with single-phase 3D contrast-enhanced magnetic resonance angiography. Invest. Radiol. 43, 411–20 (2008).

38.

Keegan, J., Drivas, P., Firmin, D.N.: Navigator artifact reduction in three-dimensional late gadolinium enhancement imaging of the atria. Magn. Reson. Med. 785, 779–785 (2013).

39.

Zhuang, X., Bai, W., Song, J., Zhan, S., Qian, X., Shi, W., Lian, Y., Rueckert, D.: Multiatlas whole heart segmentation of CT data using conditional entropy for atlas ranking and selection. Med. Phys. 42, 3822–33 (2015).

40.

Zhuang, X., Shen, J.: Multi-scale patch and multi-modality atlases for whole heart segmentation of MRI. Med. Image Anal. 31, 77–87 (2016).

41.

Tobon-Gomez, C., Geers, A., Peters, J., Weese, J., Pinto, K., Karim, R., Schaeffter, T., Razavi, R., Rhode, K.: Benchmark for algorithms segmenting the left atrium from 3D CT and MRI datasets. IEEE Trans. Med. Imaging. 34, 1460–1473 (2015).

42.

Zhuang, X., Rhode, K.S., Razavi, R.S., Hawkes, D.J., Ourselin, S.: A registration-based propagation framework for automatic whole heart segmentation of cardiac MRI. IEEE Trans. Med. Imaging. 29, 1612–1625 (2010).

43.

Zhuang, X., Arridge, S., Hawkes, D.J., Ourselin, S.: A nonrigid registration framework using spatially encoded mutual information and free-form deformations. IEEE Trans. Med. Imaging. 30, 1819–28 (2011).

44.

Artaechevarria, X., Munoz-Barrutia, A., Ortiz-de-Solorzano, C.: Combination Strategies in Multi-Atlas Image Segmentation: Application to Brain MR Data. IEEE Trans. Med. 26

Imaging. 28, 1266–1277 (2009). 45.

Bai, W., Shi, W., Ledig, C., Rueckert, D.: Multi-atlas segmentation with augmented features for cardiac MR images. Med. Image Anal. 19, 98–109 (2015).

46.

Coupé, P., Manjón, J. V., Fonov, V., Pruessner, J., Robles, M., Collins, D.L.: Nonlocal Patch-Based Label Fusion for Hippocampus Segmentation. In: Medical Image Computing and Computer-Assisted Intervention. pp. 129–136 (2010).

47.

Eskildsen, S.F., Coupé, P., Fonov, V., Manjón, J. V., Leung, K.K., Guizard, N., Wassef, S.N., Østergaard, L.R., Collins, D.L.: BEaST: Brain extraction based on nonlocal segmentation technique. Neuroimage. 59, 2362–2373 (2012).

48.

Hu, S., Coupé, P., Pruessner, J.C., Collins, D.L.: Nonlocal regularization for active appearance model: Application to medial temporal lobe segmentation. Hum. Brain Mapp. 35, 377–395 (2014).

49.

Sabuncu, M.R., Yeo, B.T.T., Van Leemput, K., Fischl, B., Golland, P.: A generative model for image segmentation based on label fusion. IEEE Trans. Med. Imaging. 29, 1714–29 (2010).

50.

Wang, H., Suh, J.W., Das, S.R., Pluta, J.B., Craige, C., Yushkevich, P.A.: Multi-Atlas Segmentation with Joint Label Fusion. IEEE Trans. Pattern Anal. Mach. Intell. 35, 611– 623 (2013).

51.

Wu, G., Kim, M., Sanroma, G., Wang, Q., Munsell, B.C., Shen, D., Alzheimer’s Disease Neuroimaging Initiative: Hierarchical multi-atlas label fusion with multi-scale feature representation and label-specific patch partition. Neuroimage. 106, 34–46 (2015).

52.

Lindeberg, T.: Feature detection with automatic scale selection. Int. J. Comput. Vis. 30, 79–116 (1998).

53.

Lindeberg, T.: Generalized Gaussian Scale-Space Axiomatics Comprising Linear ScaleSpace, Affine Scale-Space and Spatio-Temporal Scale-Space. J. Math. Imaging Vis. 40, 36–81 (2011).

54.

Holden, M., Griffin, L.D., Saeed, N., Hill, D.L.G.: Multi-channel Mutual Information Using Scale Space. In: Medical Image Computing and Computer-Assisted Intervention. pp. 797–804 (2004).

55.

Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 60, 91–110 (2004). 27

56.

Leutenegger, S., Chli, M., Siegwart, R.Y.: BRISK: Binary Robust invariant scalable keypoints. In: 2011 International Conference on Computer Vision. pp. 2548–2555. IEEE (2011).

57.

Wu, Y., Liu, G., Huang, M., Guo, J., Jiang, J., Yang, W., Chen, W., Feng, Q.: Prostate Segmentation Based on Variant Scale Patch and Local Independent Projection. IEEE Trans. Med. Imaging. 33, 1290–1303 (2014).

58.

Thévenaz, P., Unser, M.: Optimization of mutual information for multiresolution image registration. IEEE Trans. Image Process. 9, 2083–99 (2000).

59.

Soltaninejad, M., Yang, G., Lambrou, T., Allinson, N., Jones, T.L., Barrick, T.R., Howe, F.A., Ye, X.: Automated brain tumour detection and segmentation using superpixel-based extremely randomized trees in FLAIR MRI. Int. J. Comput. Assist. Radiol. Surg. (2016).

60.

Zhao, Y., Zheng, Y., Liu, Y., Yang, J., Zhao, Y., Chen, D., Wang, Y.: Intensity and Compactness Enabled Saliency Estimation for Leakage Detection in Diabetic and Malarial Retinopathy. IEEE Trans. Med. Imaging. 36, 51–63 (2017).

61.

Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press (2000).

62.

Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. Pattern Anal. Mach. Intell. IEEE Trans. 27, 1226–1238 (2005).

63.

Hsu, C., Chang, C., Lin, C.: A practical guide to support vector classification. (2003).

64.

Huttenlocher, D.P., Klanderman, G.A., Rucklidge, W.J.: Comparing images using the Hausdorff distance. IEEE Trans. Pattern Anal. Mach. Intell. 15, 850–863 (1993).

65.

Zhuang, X.: Challenges and methodologies of fully automatic whole heart segmentation: a review. J. Healthc. Eng. 4, 371–408 (2013).

66.

Efron, B.: Estimating the Error Rate of a Prediction Rule: Improvement on CrossValidation. J. Am. Stat. Assoc. 78, 316 (1983).

67.

Chen, Y., Lin, C.: Combining SVMs with various feature selection strategies. In: Guyon, I., Nikravesh, M., Gunn, S., and Zadeh, L.A. (eds.) Feature Extraction: Foundations and Applications. pp. 315–324. Springer Berlin Heidelberg (2006).

68.

McGann, C.J., Kholmovski, E.G., Oakes, R.S., Blauer, J.J.E., Daccarett, M., Segerson, N., Airey, K.J., Akoum, N., Fish, E., Badger, T.J., DiBella, E.V.R., Parker, D., MacLeod, 28

R.S., Marrouche, N.F.: New Magnetic Resonance Imaging-Based Method for Defining the Extent of Left Atrial Wall Injury After the Ablation of Atrial Fibrillation. J. Am. Coll. Cardiol. 52, 1263–1271 (2008).

29

Table Captions Table 1: Summary of the previously published methods for atrial scarring segmentation and ours. Table 2: Quantitative evaluation of the atrial scarring segmentation. Three validation schemes were used (i.e., LOO CV, 10-fold CV, and training/CV with separate testing). The SVM based classification was evaluated using accuracy, sensitivity, specificity, BER and AUC. The final segmentation was evaluated against the ground truth using (Precision, NPV, Jaccard index and Dice score). Abbreviations: LOO–leave-one-patient-out; CV–cross-validation; BER–balanced error rate; AUC–area under curve; NPV–negative predictive value.

30

Figure Captions Figure 1: (a) Flowchart of the LA+PV segmentation via MA-WHS and its validation. (b) Flowchart of the fully automatic atrial scarring segmentation including atrial scarring ground truth construction, super-pixel and SVM classification based segmentation and leave-one-patient-out cross-validation. Abbreviations: LA+PV–left atrium and pulmonary veins; MAS–multiatlas propagation based segmentation; MSP-LF–multi-scale patch based label fusion; WHS–whole heart segmentation. Figure 2: (a) Comparison results (Dice scores of the WHS) of using different label fusion algorithms. (‘*’ = p