Objective To derive and validate risk algorithms so that the risks of four clinical outcomes associated with statin use can be estimated for individual patients.
Design Prospective open cohort study using routinely collected data from 368 QResearch general practices in England and Wales to develop the scores. The scores were validated using two separate sets of practices—188 separate QResearch practices and 364 practices contributing to the THIN database.
Subjects In the QResearch derivation cohort 225 922 new users of statins and 1 778 770 non-users of statins were studied. In the QResearch validation cohort 118 372 statin users and 877 812 non-users of statins were studied. In the THIN validation cohort, we studied 282 056 statin users and 1 923 840 non-users of statins were studied.
Methods Cox proportional hazards models in the derivation cohort to derive risk equations. Measures of calibration and discrimination in both validation cohorts.
Outcomes 5-Year risk of moderate/serious myopathic events; moderate/serious liver dysfunction; acute renal failure and cataract.
Results The performance of three of the risk prediction algorithms in the THIN cohort was very good. For example, in women, the algorithm for moderate/serious myopathy explained 42.15% of the variation. The corresponding D statistics was 1.75. The acute renal failure algorithm explained 59.62% of the variation (D statistic=2.49). The cataract algorithm explained 59.14% of the variation (D statistic=2.46). The algorithms to predict moderate/severe liver dysfunction only explained 15.55% of the variation (D statistics=0.89). The performance of each algorithm was similar for both sexes when tested on the QResearch validation cohort.
Conclusions The algorithms to predict acute renal failure, moderate/serious myopathy and cataract could be used to identify patients at increased risk of these adverse effects enabling patients to be monitored more closely. Further research is needed to develop a better algorithm to predict liver dysfunction.
Statistics from Altmetric.com
Cardiovascular disease is the leading cause of premature death and a major cause of disability in the UK.1 The better identification and management of high-risk patients is a major priority.2 Validated cardiovascular risk prediction algorithms, such as QRISK2,3 4 are used to identify high-risk patients most likely to benefit from interventions including smoking cessation, weight reduction, blood controls and lipid modification using statins.5
The implementation of the Department of Health Vascular Screening Programme2 could result in an extra 1.8 million patients starting long-term statin treatment.5a While the benefits of statins are relatively well established for patients at high risk of cardiovascular disease, a number of studies have found that patients starting statins are at increased risk of adverse effects, including liver dysfunction,5a 7 myopathy,5a–9 acute renal failure6 7 and cataracts.5a
In order to consent to treatment, patients need information on their baseline cardiovascular risk and how that would be reduced with interventions. They also need information on the risks and benefits of interventions as indicated in both the British National Formulary and NICE guidelines on lipid modification.11 Utilities are therefore needed to individualise risk estimates for patients so that this information can be discussed during the consultation. Clinicians also need to identify those at greatest risk of adverse events so that they can be more closely monitored.
In this paper, we develop and validate four new risk prediction algorithms to quantify an individual's absolute risk of an adverse clinical outcome which is likely to change with statin use. The outcomes included are cataract, moderate/serious myopathy, moderate/serious liver dysfunction and acute renal failure, which we found to be associated with statin use in a companion paper.5a As with previous algorithms,3 12 13 we have used readily available data held within primary care electronic health records as this is both generalisable to the general population and can be integrated back into the same clinical setting for use at the point of care.
Study design and data source
We carried out a prospective cohort study in a large population of primary care patients from an open cohort study using the QResearch database (version 24). We included all practices in England and Wales which had been using their EMIS computer system for at least a year. We randomly allocated two-thirds of practices to the derivation dataset and the remaining third to a validation dataset. We identified an open cohort of patients aged 30–84 years drawn from patients registered with practices between 01 Jan 2002 and 30 June 2008. We excluded patients who did not have a postcode-related Townsend score (about 4% of the population) and those who had been prescribed statins before the study start date. Entry to the cohort was the latest of study start date, 12 months after the patient registered with the practice or date of first statin prescription. Patients were censored at the earliest date of the diagnosis of interest, death, deregistration with the practice, last upload of computerised data or the study end date (31 December 2008).
We used an identical sample of patients drawn from practices contributing to the validated THIN database, although the study end date for this sample was 30 June 2008, as an independent and external validation dataset.
We examined the following outcomes as these outcomes were found to be associated with statin use in our previous paper:5a
Acute renal failure (from relevant Read codes recorded during follow-up).
Moderate or serious liver dysfunction defined as alanine transaminase >120 IU/l (ie, more than three times upper limit of normal) among patients without diagnosed chronic liver disease at baseline as this is the severity at which guidelines recommend that treatment is discontinued.11
Moderate or serious myopathic events,8 9 14–16 which for our study were defined as a diagnosis of myopathy or rhabdomyolysis or raised creatine kinase of four or more times the upper limit of normal (>560 in women and >696 in men) as this represents an event where treatment is likely to be discontinued.
Cataract (from relevant Read codes recorded during follow-up)
Using the trials reported in a meta-analysis5 of statins in patients with risk factors for cardiovascular disease, we combined the results for major coronary events and cerebrovascular events using a random effects model to calculate a summary relative risk (RR) for cardiovascular disease only including trials which reported both outcomes and assuming that individual patients did not have both outcomes. We combined this RR with the QRisk2 cardiovascular disease score3 to estimate the risk of cardiovascular disease for an individual taking statins
Risk factors and statin exposure
As described in detail in the companion paper,5a we identified new users of statins during the study period, and excluded current and past users at baseline, with the remaining patients classified as non-users of statins. We compiled a long list of potential predictor variables which included statin use and established risk factors for each disease from the literature or existing risk prediction scores using similar definitions where possible.
Derivation and validation of the models
We developed and validated the risk prediction algorithms using established methods.3 9 13 17–19 We used multiple imputation to replace missing values for body mass index, and smoking status and used these values in our main analyses.20–23 We carried out five imputations. We used Cox's proportional hazards models to estimate the coefficients for each risk factor for men and women separately, using Rubin's rules to combine the results across the imputed datasets. We used fractional polynomials to model non-linear risk relationships with continuous variables.24 We included statins as a binary variable since our previous paper indicated no substantial difference between risk of most outcomes by type or dose of statin.5a We used backward selection procedures to eliminate variables using a p value of 0.01 to obtain parsimonious models.
We examined interactions between predictor variables and age, and we included significant variables and significant interaction terms in the final models. We took the regression coefficients for each variable from the final models and used these as weights which we combined with the baseline survivor function for each outcome evaluated at 5 years to derive risk equations at 5 years' follow-up.
We used multiple imputation in both validation cohorts to replace missing values for body mass index and smoking. We then applied the algorithms obtained from the derivation cohort to both validation cohorts and calculated measures of discrimination (D statistic25 and area under the receiver operating characteristic curve (ROC statistic)) and calibration (comparing observed with predicted risks by tenth of predicted risk). We applied each risk score to the QResearch validation cohort to define the threshold for the top 10% of patients at risk of each adverse event.
We used the THIN validation sample for our main validation as this is from practices using a different clinical computer system from that of QResearch practices. We used all the available data on each database to maximise the power and also generalisability of the results. We used STATA (version 11) for all analyses.
Numbers needed to treat/harm
As described in the companion paper in the BMJ,5a we also calculated the numbers needed to treat (NNT) or harm (NNH) for each outcome over 5 years for patients at high risk of cardiovascular disease based on a QRISK 2 score of ≥20%, since this high-risk group is eligible for statin treatment. We also calculated the number of additional cases per 10 000 patients treated over 5 years to enable comparisons and supplement data published elsewhere.
Overall study population
Overall, 557 QResearch practices in England and Wales met our inclusion criteria, of which 369 were randomly assigned to the derivation dataset with the remainder assigned to a validation cohort. We identified 2 121 786 patients aged 30–84 years in the QResearch derivation cohort. We excluded 9513 (0.45%) past users of statins and 107 581 (5.07%) current users, leaving 1 778 770 patients (83.83%) who had not been prescribed statins and 225 922 (10.65%) new users for the main analysis. Of the new users of statins, 159 790 (70.73%) were prescribed simvastatin, 50 328 (22.28%) atorvastatin; 8103 (3.59%) pravastatin; 4497 (1.99%) rosuvastatin and 3204 (1.42%) fluvastatin.
For the validation cohorts, we identified 188 practices from the QResearch database, from which we identified 118 372 new users of statins and 877 812 non-users of statins. We also identified a separate set of 382 practices from the THIN database. Of the 2 319 181 patients aged 30–84 years in the THIN database, we excluded 7743 (0.33%) past users of statins and 105 542 (4.55%) current users, leaving 1 923 840 who had not been prescribed statins, and 282 056 (12.16%) new users for the main analysis
The baseline characteristics of the three cohorts were very similar. Table 1 compares characteristics of the QResearch derivation cohort and the THIN validation cohort (characteristics of the QResearch validation cohort are available from the authors).
As in previous studies,3 12 13 the patterns of missing data supported the use of multiple imputation to replace missing values for smoking and body mass index (further details available from the authors). As previously described,5a compared with non-users, new users of statins in each cohort tended to be older, more likely to be male and have comorbidities such as atrial fibrillation, cardiovascular disease, peripheral vascular disease, treated hypertension, diabetes, and chronic kidney disease. They were also more likely to have computer recorded results for liver function tests at baseline and during follow-up. For example, in the derivation cohort, 58.14% of new users of statins had a recorded LFT at baseline compared with 9.12% of non-users. At follow-up the respective figures were 85.69% and 33.44%.
Overall, in the QResearch derivation cohort we identified 1969 incident cases of acute renal failure; 36 541 incident cases of cataract; 15 020 cases of moderate/serious liver dysfunction; 1406 incident cases of moderate/serious myopathy, of which 312 (22.19%) were serious. The corresponding figures for the QResearch validation cohort were 1329 for acute renal failure, 23 441 for cataract and 8546 for abnormal liver dysfunction and 750 for myopathy. The corresponding numbers for the THIN cohort were 2166 for acute renal failure, 27 158 for cataract, 17 430 for abnormal liver dysfunction and 1238 for moderate/serious myopathy.
Table 2 shows the predictor variables selected for the final model for myopathy. We found a threefold increased risk of moderate/serious myopathy in new users of statins for women and almost sixfold increase for men (table 2).
The risk of myopathy varied by ethnic group, with Caribbean and Black African groups having the highest risks. For example, compared with men in the white reference category, Caribbean men had a sixfold increase in risk (adjusted HR 6.57, 95% CI 4.55 to 9.48) and Black African men had nearly eightfold increased risk (adjusted HR 7.87 95% CI 5.30 to 11.68). Risks were also increased among men in the ‘Other ethnic group’.
Men prescribed corticosteroids had a twofold increase in risk and women a threefold increase. Women with type 1 diabetes had a fivefold increased risk. Hypothyroidism, type 1 diabetes, chronic liver disease and treated hypertension were significant risk factors for myopathy in women but not men.
The validation statistics (table 3) for myopathy showed that the risk prediction equation when applied to the THIN database explained 42.15% of the variation in women and 35.98% for men. The ROC statistics were 0.739 for women and 0.717 for men. The D statistic was 1.75 for women and 1.53 for men. The performance was very similar when tested on the QResearch validation sample.
Acute renal failure
Statin use was associated with a 56% increased risk of acute renal failure for women and a 61% increased risk for men (table 4). Both types of diabetes, congestive cardiac failure, corticosteroids, treated hypertension and pre-existing chronic kidney disease were all significant predictors of acute renal failure with highest risks apparent for those with type 1 diabetes and chronic kidney disease. Heavy smokers had a twofold increase in risk of acute renal failure in both men and women. We found no significant differences in risk by ethnic group.
The results of the validation statistics for the algorithms to predict acute renal failure (table 3) showed that the risk prediction equation when tested using the THIN database explained 59.62% of the variation in women and 59.68% for men. The ROC statistics for women were 0.831 and 0.847 for men. The D statistic was high at 2.49 in men and women, suggesting very good discrimination. The performance of the algorithms was very similar when tested on the QResearch validation sample.
Moderate/serious liver dysfunction
The significant risk factors for moderate/serious liver disease are shown in table 5. Statin use was associated with a 53% increased risk in men and women. There was a suggestion of a dose–response effect with smoking, with heavy smokers having the highest risks of liver dysfunction compared with non-smokers. Of the comorbidities, a diagnosis of rheumatoid arthritis was the strongest predictor with an adjusted HR of 2.42 (95% CI 2.13 to 2.76) for women and 1.90 (95% CI 1.53 to 2.36) for men.
The validation statistics for the algorithms to predict moderate/serious liver dysfunction showed that the risk prediction equation when tested using the THIN database only explained 15.55% of the variation in women and 10.83% for men. The ROC statistics values were 0.646 for women and 0.612 for men. The D statistic was low at 0.89 for women and 0.71 for men. The performance of the algorithms was slightly improved when tested on the QResearch validation sample.
Table 6 shows the adjusted HRs for each of the selected risk factors for cataract. Statin use was associated with a 30–32% increased risk in men and women. There were marked ethnicity differences in risk of cataract with all non-white ethnic groups having significantly higher risks than the white reference group despite adjustment for confounding variables. Smoking had a small effect on risk, whereas type 1 diabetes was associated with a more than ninefold increase in risk. Other significant predictors included type 2 diabetes, cardiovascular disease, rheumatoid arthritis (women only) and use of corticosteroids.
The results of the validation statistics for the algorithm to predict cataract (table 3) showed that the risk prediction equation when tested using the THIN database explained 59.14% of the variation in women and 59.42% for men. The ROC statistics for women were 0.878 and 0.869 for men. The D statistics were high at 2.46 for women and 2.48 for men. The performance of the algorithms was very similar when tested on the QResearch validation sample.
In the analysis combining the major coronary events, and, major cerebrovascular events in the seven trials reporting both outcomes presented in a recent meta-analysis,5 we calculated that, overall, these studies gave a RR of cardiovascular disease of 0.76 (95% CI 0.67 to 0.86) comparing patients in the statin intervention groups with the control groups.
Here are two clinical case histories illustrating the way in which 5-year risks of each outcome would change with statin usage taking account of the patient's medical profile (also see http://www.qintervention.org).
A 64-year-old Caribbean man with type 2 diabetes, hypothyroidism and chronic kidney disease, who is a heavy smoker, with a body mass index of 36.7 kg/m2, systolic blood pressure of 200 mm Hg, cholesterol/high-density lipoprotein ratio 4.3 would have the following 5 year risks on and off statins:
Cardiovascular risk 30% risk off statins, 23% risk on statins;
Acute renal failure 1.8% risk off statins, 2.9% risk on statins;
Cataract 12.7% risk off statins, 16.4% risk on statins;
Liver dysfunction 1.8% risk off statins, 2.8% risk on statins;
Moderate/serious myopathy 0.7% risk off statins, 4.4% risk on statins.
A 60-year-old white women, with rheumatoid arthritis, currently prescribed corticosteroids, heavy smoker, with a systolic blood pressure of 190 mm Hg, cholesterol/high-density lipoprotein ratio of 7; body mass index 33.1 kg/m2 would have the following 5-year risks on and off statins:
Cardiovascular risk 21% risk off statins, 16% risk on statins;
Acute renal failure 0.4% risk off statins, 0.7% risk on statins;
Cataract 7.1% risk off statins, 9.1% risk on statins;
Liver dysfunction 5.4% risk off statins, 8.2% risk on statins;
Moderate/serious myopathy 0.2% risk off statins, 0.7% risk on statins.
For these examples, the mean deprivation score has been used. A postcode in a deprived area would increase the risks of cardiovascular disease (men and women) and acute renal failure (men and women) and cataract (men only). Also note that in the example, 5-year risks of cardiovascular disease have been used for consistency, although 10-year risks are also available.
Thresholds and risk stratification
Since these are new algorithms and there are no established thresholds, we used the 90th centile of risk from the full QResearch validation population to define a high-risk group (ie, the top 10% at highest risk) for each outcome for men and women. We then determined the numbers and proportion of incident cases over 5 years in the QResearch validation cohort which fell within the top tenth of risk.
For example, for acute renal failure in women aged 30–84, the 90th centile defined a high-risk group with a 5-year risk score of >0.24%. There were 268 new cases of acute renal failure within this, which accounted for 58.77% of all new cases of acute renal failure in the population. The 90th centile for acute renal failure for men was 0.35%, which accounted for 50.08% of new cases over 5 years. For myopathy in women, then patients with a risk score of >0.14% would be included within the 10% at highest risk. Of all the new cases of myopathy which occurred in women in the QResearch validation cohort over 5 years, 32.24% were in this group. The corresponding figures for myopathy in men were 0.35% for the 90th centile, which accounted for 32.94% of cases in men. For liver dysfunction the 90th centile cut-off point was 1.34% in women and 1.44% in men and this accounted for 19.43% and 17.07% of cases, respectively. For cataract, the 90th centile cut-off point was 9.38% in women and 5.93% in men and this accounted for 50.24% of new cases in women and 57.79% of new cases in men.
Figure 1 shows the mean predicted and observed risk for each outcome across each tenth of predicted risk as a measure of calibration. The algorithms to predict cataract and acute renal failure were well calibrated in men and women as was the algorithm for myopathy in women. There was a small degree of overprediction for the algorithm to predict myopathy in men. The algorithm to predict liver dysfunction was the least well calibrated.
Numbers needed to treat and numbers needed to harm
Table 7 shows the numbers of patients needed to treat and numbers needed to harm for each outcome among patients aged 35–74 years at high risk of cardiovascular disease as defined based on the QRISK2 10-year cardiovascular risk score. The NNT with any statin to prevent one case of cardiovascular disease for women over 5 years was 37 (95% CI 27 to 64) and 33 (95% CI 24 to 57) for men. The NNH for an additional case of acute renal failure in women over 5 years was 434 (95% CI 284 to 783). In women for moderate/severe myopathy the NNH was 259 (95% CI 186 to 375), for moderate to severe liver dysfunction the NNH was 136 (95% CI 109 to 175) and for cataract it was 33 (95% CI 28 to 38). Overall, the NNH/NNTs for men were similar to those for women except for myopathy where the NNH was 91 (95% CI 74 to 112). This is lower than for women mainly owing to the higher HRs in men.
Table 7 also shows the estimated numbers of extra cases or cases prevented per 10 000 patients treated with statins at both thresholds. For example, there would be 271 fewer cases (95% CI 157 to 374) of cardiovascular disease for every 10 000 women using the 20% threshold compared with 228 fewer cases (95% CI 132 to 315) using the 15% threshold.
Summary of key findings
We have developed and externally validated four new risk prediction algorithms designed to predict the absolute risk of clinical outcomes which might change with statin usage in a large representative primary care population. The algorithms to predict acute renal failure, moderate/serious myopathy and cataract performed well in both validation cohorts and could be used to identify patients at increased risk of these clinical outcomes so that patients can be monitored more closely. While the algorithms have specifically been designed to help inform treatment choices on risks and benefits of statins, they can also be applied to the general population. The algorithm to predict risk of liver dysfunction did not perform sufficiently well to justify its use in patient care. However, given the increased risk of liver dysfunction in statin users overall, all statin users should have liver function tests performed according to existing guidelines.
Comparison with previous studies
Our study has good face validity since it has been conducted in the setting where the majority of patients in the UK are assessed, treated and followed up. While other studies have examined adverse effects of statins using meta-analyses of clinical trials6 26 or observational data,7 we think ours is the first to develop algorithms to individualise risk for the patient.7 Our study adds to previous studies by analysis of a much larger population over a longer time period.
We included acute renal failure as an outcome because of concerns published in the Lancet27 and on the FDA website28 together with reports of proteinuria in patients prescribed rosuvastatin.29 Although our definition of adverse outcomes is broadly consistent with that used elsewhere,26 studies vary in their definition of severity. Our definition of liver dysfunction is similar to that used in other guidelines11 and studies26 and is the level at which treatment should be discontinued.11 Our definition of acute renal failure was based on presence of a computer-recorded diagnostic code, whereas other studies7 have also required a computer code to indicate hospital admission. We think our definition is valid since it would be highly unusual for treatment of a patient with acute renal failure to be managed at home. Our definition of moderate/serious myopathy was based on computer-recorded diagnosis of myopathy or a creatine kinase of four or more times the upper limit of normal, which is similar to the value used in some studies,30 31 though other studies have used 10 or more times the upper limit of normal.7 We think our outcome definition is clinically useful since it will better identify a population at risk at a time when closer assessment and interventions might help reduce progression to more severe disease.
Also, we have been able to confirm associations between statin use and adverse outcomes including myopathy,26 30–32 liver dysfunction26 and acute renal failure26 which are consistent with those elsewhere and which further increases face validity. Other studies found inconsistent results for the association between statin use and cataract.10 33 For example, one study of statin use reported a decrease in risk of nuclear cataracts with a tendency for an increased risk of cortical cataracts,10 while another early study of pravastatin found no association but only studied 14 patients.33 Smeeth et al reported an increased risk of cataract of similar magnitude to that obtained in our study which did not vary by type, dose or duration of use.34 This increase was largely explained by an increase in consultation rates, which suggests that the increased risk may be more an ascertainment bias than a true biological effect.
We have also been able to provide some new information on the association between ethnicity and risk of each outcome. We found that Black Caribbean and Black African patients have the greatest risk of myopathy—six- to eightfold higher risks for men and four- to fivefold higher risks for in women compared with the White reference population. This is interesting given recent studies suggesting an association for genetic variants in statin-induced myopathy.31
Strengths and weaknesses
The strengths and weaknesses of using GP databases for the development and validation of clinical risk prediction algorithms has been described in detail elsewhere.3 18 In summary, key strengths include size, duration of follow-up, representativeness, and lack of selection, recall and respondent bias. Limitations include lack of formally adjudicated outcomes, information bias, potential for missing data, unmeasured confounding. While we are reliant on accuracy of information recorded by primary care doctors, we think that the quality of information is likely to be good since previous studies have validated similar outcomes and exposures using questionnaire data, and found levels of completeness and accuracy to be high.7 Nonetheless, it is possible that there is some overlap between some of the outcomes—for example, some cases of acute renal failure might be due to rhabdomyolysis.
Ascertainment bias might account for the increased risk of cataract since patients prescribed statins may be consulting their GP more frequently thereby increasing the opportunity for patients to report visual problems and be examined.34
A key strength of our study is that we have developed the algorithms in one cohort and validated it in two independent cohorts, each of which is representative of the patients likely to be offered statins. The size of our study is particularly important since some of the adverse events we have studied are relatively uncommon and so associations with statin use would not necessarily be detected in a much smaller clinical trial or meta-analysis.
While the performance of three of the algorithms was very good, the liver dysfunction algorithm did not perform sufficiently well to justify its clinical use. This might be due to the non-specific nature of liver dysfunction, which has many potential causes (and so is particularly susceptible to unmeasured confounding). In addition, it might be due to misclassification of the outcome since a large proportion of patients did not have an alanine transaminase value recorded in their computer record. This is despite recommendations that all patients have liver function tests 3 and 12 months after starting statin treatment. The algorithm could be remodelled once the recording of liver function tests is more complete.
Finally, while our study covers a broad age range (up to 84 years), we have not studied the very elderly, who may be prescribed statins and may be more susceptible to adverse effects.
The algorithms to predict risk of cataract, moderate/serious myopathy and acute renal failure could potentially be used within a consultation to assess and discuss the balance of risks and benefits at the start of statin treatment. They can also be used to identify patients at increased risk of adverse effects so that these patients can be monitored more closely. Patients at increased risk of acute renal failure could have lower doses of statins6 and more regular assessment of glomerular filtration rates, especially in the first year after starting treatment.6 Patients at high risk of myopathy could be warned to report any unusual muscular aches and pains and could be asked about these at their follow-up consultations. Clinicians could also be alerted of patients at high risk of myopathy via the computer system and be prompted to measure creatine kinase levels. Although ascertainment bias might account for the increased risk of cataract, nonetheless, such patients appear to be at increased risk and could be advised to report any visual symptoms and be followed-up by an optician if they are not already receiving regular follow-up because of a comorbid condition such as diabetes.
We have developed and validated four new risk prediction algorithms designed to quantify the absolute risk of moderate/serious myopathy, acute kidney failure, cataract and moderate/serious liver dysfunction associated with statin use in a large representative primary care population. Three of the algorithms could be used to identify patients at high risk of these clinical outcomes so that patients can be monitored more closely. Also, the algorithms can be used within a consultation to assess the balance of risks and benefits at the start of statin treatment.
What is already known on the subject
Meta-analyses of primary prevention trials indicate that statins reduce the risk of cardiovascular disease particularly in high-risk patients.
Meta-analyses provide valuable information on effectiveness but tend not to have representative samples or sufficient detail, duration of follow-up or power to fully assess unintended effects.
There is a lack of information on the unintended effects of statins in representative primary care populations, although such drugs are prescribed in large volumes for long periods of time.
What this study adds
We have developed and validated four new risk prediction algorithms that can be used to identify patients at high risk of clinical outcomes which change with statin treatment.
Three algorithms performed well in both validation datasets. These were acute renal failure, moderate/serious myopathy and cataract.
The performance of the algorithm to predict liver dysfunction was poor and more research is needed to produce a better algorithm.
The algorithms to predict acute renal failure, moderate/serious myopathy and cataract could be used to identify patients at high risk of adverse effects so that patients can be monitored more closely. While the algorithms have specifically been designed to help inform treatment choices on risks and benefits of statins, they can also be applied to the general population.
We acknowledge the contribution of EMIS practices who contribute to the QResearch and to David Stables (medical director, EMIS) for his expertise in establishing, developing and supporting the database. We also acknowledge the contribution of practices using InPractice Systems and to EPIC for the provision of THIN data.
Data Sharing statement The patient level data from the QResearch and THIN databases are specifically licensed according to the governance framework for each database. See http://www.qresearch.org for further details.
Algorithms and software The algorithms presented in this paper will be released as OpenSource Software under the GNU lesser GPL v3. The web calculator can be found at http://www.qintervention.org.
Funding The project was undertaken by ClinRisk Ltd. There was no external funding. Other Funders: work undertaken by ClinRisk Ltd.
Competing interests JHC is professor of clinical epidemiology at the University of Nottingham and co-director of QResearch—a not-for-profit organisation, which is a joint partnership between the University of Nottingham and EMIS (leading commercial supplier of IT for 60% of general practices in the UK). JHC is also director of ClinRisk Ltd which produces software to ensure the reliable and updatable implementation of clinical risk algorithms within clinical computer systems to help improve patient care. CC is associate professor of medical statistics at the University of Nottingham and a consultant statistician for ClinRisk Ltd. This work and any views expressed within it are solely those of the co-authors and not of any affiliated bodies or organisations.
Ethics approval This study was conducted with the approval of the Trent Multi-Centre Ethics Committee as part of the QResearch approval process.
Provenance and peer review Not commissioned; externally peer reviewed.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.