Electrocardiographic biomarkers to predict atrial fibrillation in sinus rhythm electrocardiograms

Ancor Sanz-García; Alberto Cecconi; Alberto Vera; Juan Miguel Camarasaltas; Fernando Alfonso; Guillermo Jose Ortega; Jesus Jimenez-Borreguero

doi:10.1136/heartjnl-2021-319120

Article Text

PDF

PDF +
Supplementary
Material

Cardiac risk factors and prevention

Original research

Electrocardiographic biomarkers to predict atrial fibrillation in sinus rhythm electrocardiograms

Free

Ancor Sanz-García1,
Alberto Cecconi2,
http://orcid.org/0000-0003-2181-0961Alberto Vera2,
Juan Miguel Camarasaltas3,
Fernando Alfonso2,
http://orcid.org/0000-0002-7840-6145Guillermo Jose Ortega1,4,
Jesus Jimenez-Borreguero2

¹ Data Analysis Unit, Hospital Universitario de la Princesa, Madrid, Spain
² Cardiology Department, Hospital Universitario de la Princesa, Madrid, Spain
³ Informatics Department, Hospital Universitario de la Princesa, Madrid, Spain
⁴ CONICET; Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina

Correspondence to Dr Guillermo Jose Ortega, Hospital Universitario de la Princesa, 28006 Madrid, Spain; guillermojose.ortega{at}salud.madrid.org

Abstract

Objective Early prediction of atrial fibrillation (AF) development would improve patient outcomes. We propose a simple and cheap ECG based score to predict AF development.

Methods A cohort of 16 316 patients was analysed. ECG measures provided by the computer-assisted ECG software were used to identify patients. A first group included patients in sinus rhythm who showed an ECG with AF at any time later (n=505). A second group included patients with all their ECGs in sinus rhythm (n=15 811). By using a training set (75% of the cohort) the initial sinus rhythm ECGs of both groups were analysed and a predictive risk score based on a multivariate logistic model was constructed.

Results A multivariate regression model was constructed with 32 variables showing a predictive value characterised by an area under the curve (AUC) of 0.776 (95% CI: 0.738 to 0.814). The subsequent risk score included the following variables: age, duration of P-wave in aVF, V4 and V5; duration of T-wave in V3, mean QT interval adjusted for heart rate, transverse P-wave clockwise rotation, transverse P-wave terminal angle and transverse QRS complex terminal vector magnitude. Risk score values ranged from 0 (no risk) to 5 (high risk). The predictive validity of the score reached an AUC of 0.764 (95% CI: 0.722 to 0.806) with a global specificity of 61% and a sensitivity of 55%.

Conclusions The automatic assessment of ECG biomarkers from ECGs in sinus rhythm is able to predict the risk for AF providing a low-cost screening strategy for early detection of this pathology.

atrial fibrillation
biomarkers

Data availability statement

No data are available.

https://doi.org/10.1136/heartjnl-2021-319120

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Atrial fibrillation (AF) is the most prevalent arrhythmia1 and may be associated with several life-threatening complications such as embolic stroke, heart failure and dementia.2 3 Approximately 33 million people worldwide present AF.4 AF prevalence is directly proportional to age, being three to four times higher in patients with age >80 years than in those in the 60-year to 70-year range.5 In Spain, a country with high life expectancy, AF prevalence reaches 17% in the elderly (>80 years) population becoming a major source of concern for the country’s health system.6 Since silent AF is present in one‐third of patients with this arrhythmia, cardiovascular complication is frequently the clinical debut of the disease. In this sense, several studies suggest that current AF prevalence may be largely underestimated,4 7 reinforcing the necessity of implementing effective, low cost and fast screening interventions capable of estimating the risk of developing AF.

Current development of big data techniques allows performing massive analysis of clinical data in cardiology.8 In parallel, quantitative methods to detect ECG biomarkers have been increasingly used over the last years,9 leading to the development of highly sophisticated ECG software capable of quantifying and identifying hundreds of ECG measures from standard 10 s 12-lead recordings, thus providing the physician with reliable interpretations.

Nowadays, most health centres around the world store huge amounts of already quantified and interpreted ECGs, which, after a proper analysis,10 could be used to investigate new biomarkers, as for example to predict AF appearance.

The aim of the present work was to establish a global prediction model and a risk score for AF development using biomarkers extracted by automatic ECG assessment of a large cohort of ECGs. Patients with several ECGs over time were classified in two groups: patients with an ECG showing AF preceded by one in sinus rhythm (SR) and patients with SR in all their ECGs. The comparison between these two groups allowed developing a global model from which we obtained ECG biomarkers for predicting AF risk, which were later used to establish the Atrial Fibrillation Automatic Assessment (AFAA) risk score.

Methods

Data and study population

A retrospective cohort study was conducted at the University Hospital La Princesa (Madrid, Spain) between 5 May 2010 and 4 February 2019. A total of 132 772 patients (329 670 ECG recordings) were analysed. ECGs were originally requested by several units, including outpatient medical centres, emergency room, among others (see online supplemental figure 1 for further details).

Supplemental material

[heartjnl-2021-319120supp001.pdf]

All data needed for the analysis—ECG measures and interpretations—were obtained from the ECG files (in XML format) and stored for further analysis. The only additional data available in the ECG files were age and sex.

ECG recordings

All analysed ECGs came from routine 10 s 12-lead measurements, processed, quantified and interpreted by the Philips DXL Algorithm11 and stored in XML format. In addition to heart rate and rhythms, the software algorithm provides a quantified analysis of amplitude, duration, area and shape for every P-wave, QRS complex, ST segment and T-wave in each lead, resulting in 566 variables for every 10 s ECG recording.

Data cleaning

Several ECGs and their corresponding patients were first discarded: ECGs with low quality and artefacts (25 958 ECGs, 6961 patients), ECGs from patients with unknown age (42 643 ECGs; 6405 patients) and ECGs from patients with only one ECG (71 384 ECGs; 71 384 patients). After excluding these data, a cohort of 48 022 patients (189 685 ECGs) was obtained.

Studied cohort

After an exhaustive selection of ECGs—including a cardiologist’s assessment—from patients with AF and non-AF (see figure 1 and online supplemental text S1 for a detailed description of the selection process), a cohort of 16 316 patients’ ECGs in SR was obtained. In this cohort, 505 ECGs in SR correspond to those patients who showed an ECG with AF at any later time, this group was named as SR-AF. The other group, that is, the SR–SR group, included patients’ ECGs with all their future ECGs in SR (n=15 811). This selection process is sketched in figure 2 for the case of two representative patients.

Figure 1

Patient selection flowchart. AF, atrial fibrillation; SR, sinus rhythm.

Figure 2

Sketch of ECG selection in patients representative of both groups, SR–SR and SR–AF. The last two ECGs of patients belonging to the SR–SR group are selected if the interval between them is in the range between 1 week and 2 years (thick blue line). For the SR–AF group, the first ECG in AF and the preceding one in SR are selected if the interval between them is in the range between 1 week and 2 years (thick red line). The bluish rectangle shows the actual ECGs selected for analysis from both groups. AF, atrial fibrillation; SR, sinus rhythm.

Data analysis

Training and test sets

The studied cohort was split in training and test datasets in a way that 75% of the patients were randomly assigned to the training set and the remaining 25% to the test set. This was performed in such a way that none of the patients’ ECGs belongs to both data sets and they were only assigned exclusively to one or the other dataset. Both sets maintained the proportion of SR–AF and SR–SR ECGs of the original cohort.

Variable selection

By using solely the training set, a preliminary selection of variables was conducted with a univariate logistic regression under the objective of studying the association between ECG measures and the outcome (SR–AF or SR–SR group). Only variables with p<0.05 (adjusted by the Bonferroni correction) were considered, resulting in 228 significant variables. Subsequently, we applied a variance inflation factor (VIF) test to measure the inflation in the variances of the parameter estimates caused by collinearities and we determined which predictors fulfilled the criterion of VIF <4 as a control of non-collinearity.12 Only 47 variables passed this last step. This is explained by the fact that 37 variables were measured in each lead, producing high level of collinearity between them. Finally, variables showing not available (NA) data in more than 1% of the cases were removed (instead of being imputed, as explained in the online supplemental text S2), resulting in 32 variables, namely: age, sex, distance between ECGs, aVF pdur, V3 pdur, V4 pdur, V5 pdur, V3 pppparea, II qamp, II ramp, V1 ramp, I rdur, aVL rdur, V1 rdur, V3 rdur, aVR samp, V1 samp, V4 samp, V3 sdur, aVL.qrsdur, aVF.qrsdur, V3 tptpdur printstddev, stfrontaxis, tfrontaxis, meanqtc, transpcwrot, transptermangle, transqrsinitmag, transqrstermmag, frontqrsinitangle, sagpcwrot. The definition of these terms according to the Philips nomenclature is explained in online supplemental table S1.

Global model

Before fitting the multivariate model, ECGs presenting missing values (NA) in any variable in the training set were removed from both the training and the test sets (n=18 in the SR–AF group and n=198 in the SR–SR group). This procedure was chosen instead of imputing values as it was explained in online supplemental text S2. Thereafter, a multivariate logistic model was constructed by using the predictors selected in the training cohort. To assess the validity of the model for predicting AF risk, we determined the area under the curve (AUC) of the receiver operating characteristic (ROC) curve of the model in the test set.

AFAA risk score

In order to translate the previous model to a clinically meaningful score capable of predicting AF risk, we determined a risk score based on the categorisation of the continuous variables (n=32) to combine them with the variables that were already categorical (n=1).

Continuous variables were categorised by considering their relationship with the outcome variable and determining the range of values corresponding to the lowest AF incidence and then, the range’s length was used to construct as many categories as it allowed. Subsequently, all the new categorical variables and those that were already categorical were fitted in a multivariate logistic regression. Thereafter, the model was used to determine the OR for the selected variables according to a stepwise algorithm based on the Akaike’s information criterion. The estimated multivariate model coefficients of the resulting significant (p<0.05) variables were used as a weight of the corresponding variables in the model. The final score for each patient was calculated as the overall sum of those values, that is, for each patient and for each significant variable in the model that presented the range in which the values were considered significant in the multivariate model, we assigned the corresponding points of the estimated coefficient.13 As with the global model, the model was trained with the training set and its validity was established by using the AUC of the ROC of the test set and the corresponding 95% CI. Comparisons between AUCs were done by using the Delong test.

All calculations were performed using our own codes and base functions in GNU Octave and R, V.3.5.1.

Results

Cohort characteristics

A total of 132 772 patients were considered for eligibility. After cleaning and applying the exclusion criteria, a final cohort of 16 316 patients was selected (distributed in 505 patients for the SR–AF group and 15 811 for the SR–SR group). The median age was 66 years (25th–75th percentile: 52–79 years), 8340 (51%) were women and the mean elapsed time between the previous-to-last ECG and the last ECG was 9.4±6.4 months. In the SR–AF group, median age was 82 years (25th–75th percentile: 71–87 years), 267 (53%) patients were women and the mean of elapsed time was 10±6.6 months; the SR–SR group presented a median age of 66 years (25th–75th percentile: 52–78 years), a 51% (8073) of female patients and a mean elapsed time between the ECG recordings of 9.4±6.4 months. Unlike sex, age and elapsed time between ECGs presented statistically significant differences between groups (p<0.001, p=0.038, respectively).

Global model

A global model was constructed by using the 32 variables obtained in the univariate selection process performed on the training set. After performing multivariate logistic regression using the training dataset, the predictive validity of the model was assessed using the test dataset, obtaining an AUC of 0.776 (95% CI: 0.738 to 0.814) (figure 3). Alternatively, a Lasso regression was also conducted resulting in a similar model to the one obtained by the logistic regression (see online supplemental text S3). In view of that, we kept for the risk score construction those significant variables obtained from the multivariate logistic regression.

Figure 3

ROC curve of the general model for the test cohort. The bold line shows the value of the ROC curve. The values in the centre of the graph represent AUC and the 95% CI. AUC, area under the curve; ROC, receiver operating characteristic

AFAA risk score

Although the predictive power of the global model looked appropriate, its implementation in clinical practice would be complex. Thus, we constructed a risk model. First, we categorised the continuous variables with which we subsequently performed a logistic regression in the training set (online supplemental table S2 and determined the corresponding ORs (table 1). The estimated coefficients of the statistically significant variables of the global model were used as weights and scores of the following risk factors: age, duration of P-wave in aVF, V4 and V5; duration of T-wave in V3, mean QT interval adjusted for heart rate, transverse P-wave clockwise rotation, transverse P-wave terminal angle and transverse QRS complex terminal vector magnitude (table 2). Although elapsed time between ECGs presented significant differences, this variable was not used in the score model since its use is not feasible in clinical practice. Score values ranged from 0 to 4, being 0 no risk of AF and 4 high risk of AF. The representation of AF probability according to the score is shown in figure 4, this probability reached 0.7%, 0.8%, 2%, 7%, 9% and 66% for each of the possible integer values of the score, that is, 0 to 5, respectively. The performance of the score was estimated from the AUC of the ROC curve generated by applying the score to the test cohort, reaching 0.764 (95% CI: 0.722 to 0.806) (figure 5A). The global specificity was 61%, the sensitivity was 55% and the Youden index presented a threshold of 1.75, with a specificity of 67% and a sensitivity of 75%. Further details of the score validity can be found in online supplemental table S3. In the online supplemental information can be found the effect of age on the global model.

Figure 4

Probability of AF based on risk score values. Bars show the number of patients in the training cohort for each score value (non-AF in grey and AF in black). The trend line shows the estimated probability of AF. The table below represents the percentage of patients in the training cohort for each score value. AF, atrial fibrillation.

Figure 5

(A) ROC curve of the risk score for the test cohort. The bold line shows the value of the ROC curve. The values at the centre of the graph represent AUC and the 95% CI. (B) ROC curves of models analysing different age scenarios. ROC curves of logistic regression models constructed using datasets representing four scenarios: black line, including all patients (global model); red line, excluding those patients under 65 years; blue line, patients from the SR group randomly selected to equalise the age, sex and temporal distance between ECGs; green line, dataset that only considers age in the model. AUC, area under the curve, ROC, receiver operating characteristic; SR, sinus rhythm.

View this table:

Table 1

OR of selected variables

View this table:

Table 2

Variable weights of the risk score

Effect of age on the global model

Since the SR–AF group was (on average) 16 years older than the SR–SR group, age could be considered as a critical factor able to explain the differences between both groups. To rule out the age effect, we used the same procedure employed to construct the global model in three different scenarios: removing those patients under 65 years; by matching patients from SR–SR group by age, sex and intervals between ECGs and finally, using only ages. Comparison of the AUCs for the different scenarios (figure 5B) showed that the initial model, considering all the variables, outperformed the other scenarios, except for the scenario that excluded those patients aged <65 years, when considering the AUCs instead of the p values. Specifically, we found an AUC of 0.776 (95% CI: 0.738 to 0.814) for the initial model, while AUC was 0.781 (95% CI: 0.744 to 0.8177, p=0.86) for the scenario that excluded those patients <65 years, 0.653 (95% CI: 0.593 to 0.714, p<0.001) for the scenario that randomly selected patients from SR–SR group in order to equalise the age, sex and elapsed time between ECGs, and 0.764 (95% CI: 0.723 to 0.804, p=0.36) for the only age model. Variables selected for each model can be found in online supplemental table S4.

Discussion

The present work aimed to identify AF predictive biomarkers by using data obtained from an automatic ECG measure extraction software. From the comparison of 16 316 SR ECGs and considering 566 variables, we determined a global predictive model and a risk score for AF. The global predictive model presented a discrimination power of approximately 0.8 in the test cohort, in line with previous predictive studies.14 15 Recently, an artificial intelligence-oriented study, less based on traditional ECG knowledge, reported better discrimination values16 than the ones presented here but at the expense of disregarding, for instance, critical aspects such as age difference between groups. Moreover, since the artificial intelligence approach using convolution neural networks worked itself as a biomarker, specific ECG features could not be presented as predictors of AF. The approach presented here aimed to define a potential risk model suitable for clinical practice using identifiable critical ECG measures.

The OR of the model that originates the risk score includes several risk factors, and most importantly, thresholds, some of which have been already related to AF prediction. Age is a well-established AF risk factor,5 which in our case presented a similar threshold to that of a risk score recently described.17 As expected, P-wave-related variables were also correlated with increased AF risk, which is in accordance with studies focused on electrophysiological markers of AF.18 In particular, the P-wave duration threshold has been reported to have a U-shape relationship with AF risk, that is, extreme durations are related with higher AF risk. For instance, the Copenhagen ECG study showed that a P-wave duration shorter than 89 ms is an AF risk factor,19 whereas an excessive duration has also been associated with increased risk.20 Similarly, we found here that both short and long distances of P-wave were risk factors. Our risk score shows that a positive P-wave duration in lead aVF—aVF.pdur—longer that 200 ms contributes to increasing the risk of developing AF. Although this limit is well beyond normal21 or even pathological P-wave duration values, it should be remarked that this value was obtained from what the automatic software quantification declares as a correct value. Likely, in some cases the inclusion of U-wave or the final part of a biphasic T-waves in the P-wave measurement may lead to a P-wave duration overestimation. This fact serves to remind the realm on which this work should be considered, that is, a risk score based on the ECG automatic assessment.

P-wave axis is commonly reported in ECGs, though little attention is given to this measure; however, abnormal values of this parameter are also a marker of AF risk,22 which could be related to our results. Long PR interval has also been described as a risk factor in the Framingham Heart Study risk score.14 Likewise, in our model a higher variability of the PR interval along the 10 s ECG was associated with increased AF risk. Noteworthy, increased prevalence of AF has been related to long QT interval.23 The relationship between long QT and AF might be explained by the fact that electrolytic disorders might produce both long QTc and an increased risk of AF.24 Interestingly, the thresholds of QTc interval reported in the Copenhagen ECG study25 and the study by Perez et al 22 (≥420 ms and >450 ms, respectively) are close to the value described here. All the ECG findings herein reported may be expression of structural heart disease or conduction abnormalities, as both are associated with ECG changes and AF development.

AF prevalence in the final cohort (3.2%) was comparable with the prevalence described in Spain,6 thus validating the representativeness of our sample. One of the major concerns about our results was the potential confounding effect of age, since prevalence is directly associated with increasing age5 and the SR–AF group was 16 years older than the SR–SR group. However, we found that the performance of models considering different age scenarios was worst than that of the main model. Besides, ECG biomarkers were similar, supporting the validity of ECGs.

The present work has several limitations. The cohort was recruited in a single hospital, but its demographic characteristics are similar to the general population.6 In addition, the cohort presented a group imbalance (lower number AF patients) which is inherent to this type of studies and was partially solved in the analysis carried out using scenarios involving age differences. Another concern is related to the automatic interpretation of AF. However, the exclusion criteria, the previously reported successful use of this algorithm,10 and the interpretation of randomly selected SR ECGs and all the AF ECGs made by experienced cardiologists provide sufficient confidence in the correctness of this procedure. The information provided by medical records were not considered in this study since it was inaccessible at this time. This is an issue on which we are working on to improve the discriminant power of our risk model.

We have found several biomarkers in SR ECGs that can be integrated in a score model able to predict the risk of developing AF, which would increase the cost-effectiveness of screening strategies for early detection of AF.

Key messages

What is already known on this subject?

Although some studies deal with the subject automatically predicting the appearance of new atrial fibrillation (AF) in patients with ECGs in sinus rhythm, none of them provide specific ECG features as predictors of developing AF.

What might this study add?

Our study provides a new score to identify populations at higher risk of developing AF based on the automatic ECG interpretation. In addition, the detected independent AF predictors could be explained by using rational evidence-based arguments.

How might this impact on clinical practice?

Identifying general population at higher risk of developing AF by using a 10 s 12-lead ECG recording would increase the cost-effectiveness of screening strategies for early detection of this pathology.

Data availability statement

No data are available.

Ethics statements

Patient consent for publication

Ethics approval

The Clinical Research Ethics Committee of Hospital de la Princesa approved this study with a waiver of obtaining informed consent from patients.

References

↵
2. Morin DP ,
3. Bernard ML ,
4. Madias C , et al
. The state of the art: atrial fibrillation epidemiology, prevention, and treatment. Mayo Clin Proc 2016;91:1778–810.doi:10.1016/j.mayocp.2016.08.022 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27825618
OpenUrl PubMed
↵
2. Ding M ,
3. Qiu C
. Atrial fibrillation, cognitive decline, and dementia: an epidemiologic review. Curr Epidemiol Rep 2018;5:252–61.doi:10.1007/s40471-018-0159-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30148041
OpenUrl PubMed
↵
2. Gómez-Outes A ,
3. Lagunar-Ruíz J ,
4. Terleira-Fernández A-I , et al
. Causes of death in anticoagulated patients with atrial fibrillation. J Am Coll Cardiol 2016;68:2508–21.doi:10.1016/j.jacc.2016.09.944 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27931607
OpenUrl FREE Full Text
↵
2. Rahman F ,
3. Kwan GF ,
4. Benjamin EJ
. Global epidemiology of atrial fibrillation. Nat Rev Cardiol 2014;11:639–54.doi:10.1038/nrcardio.2014.118 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25113750
OpenUrl CrossRef PubMed
↵
2. Zoni-Berisso M ,
3. Lercari F ,
4. Carazza T , et al
. Epidemiology of atrial fibrillation: European perspective. Clin Epidemiol 2014;6:213–20.doi:10.2147/CLEP.S47385 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24966695
OpenUrl CrossRef PubMed
↵
2. Gómez-Doblas JJ ,
3. Muñiz J ,
4. Martin JJA , et al
. Prevalence of atrial fibrillation in Spain. OFRECE study results. Rev Esp Cardiol 2014;67:259–69.doi:10.1016/j.rec.2013.07.014 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24774588
OpenUrl CrossRef PubMed
↵
2. Turakhia MP ,
3. Shafrin J ,
4. Bognar K , et al
. Estimated prevalence of undiagnosed atrial fibrillation in the United States. PLoS One 2018;13:e0195088. doi:10.1371/journal.pone.0195088 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29649277
OpenUrl PubMed
↵
2. Rumsfeld JS ,
3. Joynt KE ,
4. Maddox TM
. Big data analytics to improve cardiovascular care: promise and challenges. Nat Rev Cardiol 2016;13:350–9.doi:10.1038/nrcardio.2016.42 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27009423
OpenUrl CrossRef PubMed
↵
2. Smulyan H
. The computerized ECG: Friend and foe. Am J Med 2019;132:153–60.doi:10.1016/j.amjmed.2018.08.025 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30205084
OpenUrl PubMed
↵
2. Sanz-García A ,
3. Cecconi A ,
4. Alday E , et al
. Usefulness of computer-assisted ECG analysis in the pre-operative evaluation of noncardiac surgery. Eur J Anaesthesiol 2020;37:1075–7.doi:10.1097/EJA.0000000000001256 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33027228
OpenUrl PubMed
↵
The Philips DXL ECG algorithm physician’s guide. Available: http://incenter.medical.philips.com/doclib/enc/fetch/2000/4504/577242/577243/577246/581601/711562/DXL_ECG_Algorithm_Physician_s_Guide_(ENG)_Ed.2.pdf?nodeid=5955504&vernum=1
↵
2. Kutner M ,
3. Nachtsheim C ,
4. Neter J
. Applied linear statistical models. 4 edn. McGraw-Hill: Irwin, 2004.
↵
2. Zhang Z ,
3. Zhang H ,
4. Khanal MK
. Development of scoring system for risk stratification in clinical medicine: a step-by-step tutorial. Ann Transl Med 2017;5:436.doi:10.21037/atm.2017.08.22 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29201888
OpenUrl PubMed
↵
2. Schnabel RB ,
3. Sullivan LM ,
4. Levy D , et al
. Development of a risk score for atrial fibrillation (Framingham heart study): a community-based cohort study. Lancet 2009;373:739–45.doi:10.1016/S0140-6736(09)60443-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19249635
OpenUrl CrossRef PubMed Web of Science
↵
2. Alonso A ,
3. Krijthe BP ,
4. Aspelund T , et al
. Simple risk model predicts incidence of atrial fibrillation in a racially and geographically diverse population: the CHARGE-AF Consortium. J Am Heart Assoc 2013;2:e000102. doi:10.1161/JAHA.112.000102 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23537808
OpenUrl Abstract/FREE Full Text
↵
2. Attia ZI ,
3. Noseworthy PA ,
4. Lopez-Jimenez F , et al
. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet 2019;394:861–7.doi:10.1016/S0140-6736(19)31721-0 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31378392
OpenUrl CrossRef PubMed
↵
2. Li Y-G ,
3. Pastori D ,
4. Farcomeni A , et al
. A simple clinical risk score (C_₂HEST) for predicting incident atrial fibrillation in Asian subjects: derivation in 471,446 Chinese subjects, with internal validation and external application in 451,199 Korean subjects. Chest 2019;155:510–8.doi:10.1016/j.chest.2018.09.011 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30292759
OpenUrl PubMed
↵
2. Dilaveris PE ,
3. Gialafos EJ ,
4. Sideris SK , et al
. Simple electrocardiographic markers for the prediction of paroxysmal idiopathic atrial fibrillation. Am Heart J 1998;135:733–8.doi:10.1016/S0002-8703(98)70030-4 pmid:http://www.ncbi.nlm.nih.gov/pubmed/9588401
OpenUrl CrossRef PubMed Web of Science
↵
2. Nielsen JB ,
3. Kühl JT ,
4. Pietersen A , et al
. P-wave duration and the risk of atrial fibrillation: results from the Copenhagen ECG study. Heart Rhythm 2015;12:1887–95.doi:10.1016/j.hrthm.2015.04.026 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25916567
OpenUrl PubMed
↵
2. Soliman EZ ,
3. Prineas RJ ,
4. Case LD , et al
. Ethnic distribution of ECG predictors of atrial fibrillation and its impact on understanding the ethnic distribution of ischemic stroke in the Atherosclerosis risk in communities (ARIC) study. Stroke 2009;40:1204–11.doi:10.1161/STROKEAHA.108.534735 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19213946
OpenUrl Abstract/FREE Full Text
↵
2. Palhares DMF ,
3. Marcolino MS ,
4. Santos TMM , et al
. Normal limits of the electrocardiogram derived from a large database of Brazilian primary care patients. BMC Cardiovasc Disord 2017;17:1–23.doi:10.1186/s12872-017-0572-8
OpenUrl
↵
2. Perez MV ,
3. Dewey FE ,
4. Marcus R , et al
. Electrocardiographic predictors of atrial fibrillation. Am Heart J 2009;158:622–8.doi:10.1016/j.ahj.2009.08.002 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19781423
OpenUrl CrossRef PubMed Web of Science
↵
2. Johnson JN ,
3. Tester DJ ,
4. Perry J , et al
. Prevalence of early-onset atrial fibrillation in congenital long QT syndrome. Heart Rhythm 2008;5:704–9.doi:10.1016/j.hrthm.2008.02.007 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18452873
OpenUrl CrossRef PubMed Web of Science
↵
2. Khan AM ,
3. Lubitz SA ,
4. Sullivan LM , et al
. Low serum magnesium and the development of atrial fibrillation in the community: the Framingham heart study. Circulation 2013;127:33–8.doi:10.1161/CIRCULATIONAHA.111.082511 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23172839
OpenUrl Abstract/FREE Full Text
↵
2. Nielsen JB ,
3. Graff C ,
4. Pietersen A , et al
. J‐shaped association between QTc interval duration and the risk of atrial fibrillation: results from the Copenhagen ECG study. J Am Coll Cardiol 2013;61:2557–64.doi:10.1016/j.jacc.2013.03.032 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23583581
OpenUrl FREE Full Text

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

AS-G and AC are joint first authors.
GJO and JJ-B are joint senior authors.
GJO and JJ-B contributed equally.
Contributors JJ-B conceived this study. GJO and AS carried out the numerical analysis, JMC collected, selected and provided XML files. JJ-B, AC, JMC and FA evaluated and interpreted numerical results and all ECGs. AS carried out the numerical analysis, performed all the statistical analysis and the literature search. All co-authors produced the initial draft of the manuscript and reviewed the final manuscript version. GJO and JJ-B are guarantors of this paper.
Funding Authors received a research grant from the Carlos III Institute of Health under the health strategy action 2020–2022 with reference PI20/00792.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Linked Articles

Editorial
Predicting incident atrial fibrillation in sinus rhythm: more than just trusting the ‘black box’

Anthony Kashou Peter Noseworthy
Heart 2021; 107 1770-1771 Published Online First: 07 Sep 2021. doi: 10.1136/heartjnl-2021-319385

[1] ↵

Morin DP ,
Bernard ML ,
Madias C , et al
. The state of the art: atrial fibrillation epidemiology, prevention, and treatment. Mayo Clin Proc 2016;91:1778–810.doi:10.1016/j.mayocp.2016.08.022 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27825618
OpenUrl PubMed

[3] Morin DP ,

[4] Bernard ML ,

[5] Madias C , et al

[6] ↵

Ding M ,
Qiu C
. Atrial fibrillation, cognitive decline, and dementia: an epidemiologic review. Curr Epidemiol Rep 2018;5:252–61.doi:10.1007/s40471-018-0159-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30148041
OpenUrl PubMed

[8] Ding M ,

[9] Qiu C

[10] ↵

Gómez-Outes A ,
Lagunar-Ruíz J ,
Terleira-Fernández A-I , et al
. Causes of death in anticoagulated patients with atrial fibrillation. J Am Coll Cardiol 2016;68:2508–21.doi:10.1016/j.jacc.2016.09.944 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27931607
OpenUrl FREE Full Text

[12] Gómez-Outes A ,

[13] Lagunar-Ruíz J ,

[14] Terleira-Fernández A-I , et al

[15] ↵

Rahman F ,
Kwan GF ,
Benjamin EJ
. Global epidemiology of atrial fibrillation. Nat Rev Cardiol 2014;11:639–54.doi:10.1038/nrcardio.2014.118 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25113750
OpenUrl CrossRef PubMed

[17] Rahman F ,

[18] Kwan GF ,

[19] Benjamin EJ

[20] ↵

Zoni-Berisso M ,
Lercari F ,
Carazza T , et al
. Epidemiology of atrial fibrillation: European perspective. Clin Epidemiol 2014;6:213–20.doi:10.2147/CLEP.S47385 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24966695
OpenUrl CrossRef PubMed

[22] Zoni-Berisso M ,

[23] Lercari F ,

[24] Carazza T , et al

[25] ↵

Gómez-Doblas JJ ,
Muñiz J ,
Martin JJA , et al
. Prevalence of atrial fibrillation in Spain. OFRECE study results. Rev Esp Cardiol 2014;67:259–69.doi:10.1016/j.rec.2013.07.014 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24774588
OpenUrl CrossRef PubMed

[27] Gómez-Doblas JJ ,

[28] Muñiz J ,

[29] Martin JJA , et al

[30] ↵

Turakhia MP ,
Shafrin J ,
Bognar K , et al
. Estimated prevalence of undiagnosed atrial fibrillation in the United States. PLoS One 2018;13:e0195088. doi:10.1371/journal.pone.0195088 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29649277
OpenUrl PubMed

[32] Turakhia MP ,

[33] Shafrin J ,

[34] Bognar K , et al

[35] ↵

Rumsfeld JS ,
Joynt KE ,
Maddox TM
. Big data analytics to improve cardiovascular care: promise and challenges. Nat Rev Cardiol 2016;13:350–9.doi:10.1038/nrcardio.2016.42 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27009423
OpenUrl CrossRef PubMed

[37] Rumsfeld JS ,

[38] Joynt KE ,

[39] Maddox TM

[40] ↵

Smulyan H
. The computerized ECG: Friend and foe. Am J Med 2019;132:153–60.doi:10.1016/j.amjmed.2018.08.025 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30205084
OpenUrl PubMed

[42] Smulyan H

[43] ↵

Sanz-García A ,
Cecconi A ,
Alday E , et al
. Usefulness of computer-assisted ECG analysis in the pre-operative evaluation of noncardiac surgery. Eur J Anaesthesiol 2020;37:1075–7.doi:10.1097/EJA.0000000000001256 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33027228
OpenUrl PubMed

[45] Sanz-García A ,

[46] Cecconi A ,

[47] Alday E , et al

[48] ↵
The Philips DXL ECG algorithm physician’s guide. Available: http://incenter.medical.philips.com/doclib/enc/fetch/2000/4504/577242/577243/577246/581601/711562/DXL_ECG_Algorithm_Physician_s_Guide_(ENG)_Ed.2.pdf?nodeid=5955504&vernum=1

[49] ↵

Kutner M ,
Nachtsheim C ,
Neter J
. Applied linear statistical models. 4 edn. McGraw-Hill: Irwin, 2004.

[51] Kutner M ,

[52] Nachtsheim C ,

[53] Neter J

[54] ↵

Zhang Z ,
Zhang H ,
Khanal MK
. Development of scoring system for risk stratification in clinical medicine: a step-by-step tutorial. Ann Transl Med 2017;5:436.doi:10.21037/atm.2017.08.22 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29201888
OpenUrl PubMed

[56] Zhang Z ,

[57] Zhang H ,

[58] Khanal MK

[59] ↵

Schnabel RB ,
Sullivan LM ,
Levy D , et al
. Development of a risk score for atrial fibrillation (Framingham heart study): a community-based cohort study. Lancet 2009;373:739–45.doi:10.1016/S0140-6736(09)60443-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19249635
OpenUrl CrossRef PubMed Web of Science

[61] Schnabel RB ,

[62] Sullivan LM ,

[63] Levy D , et al

[64] ↵

Alonso A ,
Krijthe BP ,
Aspelund T , et al
. Simple risk model predicts incidence of atrial fibrillation in a racially and geographically diverse population: the CHARGE-AF Consortium. J Am Heart Assoc 2013;2:e000102. doi:10.1161/JAHA.112.000102 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23537808
OpenUrl Abstract/FREE Full Text

[66] Alonso A ,

[67] Krijthe BP ,

[68] Aspelund T , et al

[69] ↵

Attia ZI ,
Noseworthy PA ,
Lopez-Jimenez F , et al
. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet 2019;394:861–7.doi:10.1016/S0140-6736(19)31721-0 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31378392
OpenUrl CrossRef PubMed

[71] Attia ZI ,

[72] Noseworthy PA ,

[73] Lopez-Jimenez F , et al

[74] ↵

Li Y-G ,
Pastori D ,
Farcomeni A , et al
. A simple clinical risk score (C_₂HEST) for predicting incident atrial fibrillation in Asian subjects: derivation in 471,446 Chinese subjects, with internal validation and external application in 451,199 Korean subjects. Chest 2019;155:510–8.doi:10.1016/j.chest.2018.09.011 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30292759
OpenUrl PubMed

[76] Li Y-G ,

[77] Pastori D ,

[78] Farcomeni A , et al

[79] ↵

Dilaveris PE ,
Gialafos EJ ,
Sideris SK , et al
. Simple electrocardiographic markers for the prediction of paroxysmal idiopathic atrial fibrillation. Am Heart J 1998;135:733–8.doi:10.1016/S0002-8703(98)70030-4 pmid:http://www.ncbi.nlm.nih.gov/pubmed/9588401
OpenUrl CrossRef PubMed Web of Science

[81] Dilaveris PE ,

[82] Gialafos EJ ,

[83] Sideris SK , et al

[84] ↵

Nielsen JB ,
Kühl JT ,
Pietersen A , et al
. P-wave duration and the risk of atrial fibrillation: results from the Copenhagen ECG study. Heart Rhythm 2015;12:1887–95.doi:10.1016/j.hrthm.2015.04.026 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25916567
OpenUrl PubMed

[86] Nielsen JB ,

[87] Kühl JT ,

[88] Pietersen A , et al

[89] ↵

Soliman EZ ,
Prineas RJ ,
Case LD , et al
. Ethnic distribution of ECG predictors of atrial fibrillation and its impact on understanding the ethnic distribution of ischemic stroke in the Atherosclerosis risk in communities (ARIC) study. Stroke 2009;40:1204–11.doi:10.1161/STROKEAHA.108.534735 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19213946
OpenUrl Abstract/FREE Full Text

[91] Soliman EZ ,

[92] Prineas RJ ,

[93] Case LD , et al

[94] ↵

Palhares DMF ,
Marcolino MS ,
Santos TMM , et al
. Normal limits of the electrocardiogram derived from a large database of Brazilian primary care patients. BMC Cardiovasc Disord 2017;17:1–23.doi:10.1186/s12872-017-0572-8
OpenUrl

[96] Palhares DMF ,

[97] Marcolino MS ,

[98] Santos TMM , et al

[99] ↵

Perez MV ,
Dewey FE ,
Marcus R , et al
. Electrocardiographic predictors of atrial fibrillation. Am Heart J 2009;158:622–8.doi:10.1016/j.ahj.2009.08.002 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19781423
OpenUrl CrossRef PubMed Web of Science

[101] Perez MV ,

[102] Dewey FE ,

[103] Marcus R , et al

[104] ↵

Johnson JN ,
Tester DJ ,
Perry J , et al
. Prevalence of early-onset atrial fibrillation in congenital long QT syndrome. Heart Rhythm 2008;5:704–9.doi:10.1016/j.hrthm.2008.02.007 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18452873
OpenUrl CrossRef PubMed Web of Science

[106] Johnson JN ,

[107] Tester DJ ,

[108] Perry J , et al

[109] ↵

Khan AM ,
Lubitz SA ,
Sullivan LM , et al
. Low serum magnesium and the development of atrial fibrillation in the community: the Framingham heart study. Circulation 2013;127:33–8.doi:10.1161/CIRCULATIONAHA.111.082511 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23172839
OpenUrl Abstract/FREE Full Text

[111] Khan AM ,

[112] Lubitz SA ,

[113] Sullivan LM , et al

[114] ↵

Nielsen JB ,
Graff C ,
Pietersen A , et al
. J‐shaped association between QTc interval duration and the risk of atrial fibrillation: results from the Copenhagen ECG study. J Am Coll Cardiol 2013;61:2557–64.doi:10.1016/j.jacc.2013.03.032 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23583581
OpenUrl FREE Full Text

[116] Nielsen JB ,

[117] Graff C ,

[118] Pietersen A , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

Introduction

Methods

Data and study population

Supplemental material

ECG recordings

Data cleaning

Studied cohort

Data analysis

Training and test sets

Variable selection

Global model

AFAA risk score

Results

Cohort characteristics

Global model

AFAA risk score

Effect of age on the global model

Discussion

Key messages

What is already known on this subject?

What might this study add?

How might this impact on clinical practice?

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

References

Supplementary materials

Supplementary Data

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password