Concomitant with the development of in vitro diagnostic multivariate index assays (IVDMIAs) to improve the diagnostic efficiency of ovarian cancer detection is the need to identify appropriate biostatistical approaches to assess improvements in risk predication. In this study, we assessed the utility of three different approaches for comparing diagnostic efficiency of an ovarian cancer multivariate assay in a retrospective case - control phase 2 biomarker trial. The control cohort included both disease-free women and women with benign gynecological conditions to more accurately reflect the target population of symptomatic women.
The study cohort comprised plasma samples from 244 healthy controls, 223 women with benign gynecological conditions, 53 borderline ovarian cancer cases and 222 women with malignant epithelial ovarian cancer. A multivariate classification model was developed that incorporated plasma concentrations of CA125, C-reactive protein (CRP), serum amyloid-A (SAA), interleukin-6 (IL6) and interleukin-8 (IL8) that were measured using in vitro diagnostics assays on medical device approved clinical analysers. The posterior probability values derived from the implemented algorithm were used for comparisons of the diagnostic performance between the multianalyte panel and CA125 using multiple methods; area under the curve (AUC) of the receiver operating characteristics curve, integrated discrimination improvement (IDI) and net reclassification improvement (NRI).
Each of the biomarkers displayed significantly elevated plasma concentrations in malignant ovarian cancer patients compared with either benign or control subjects. For the discrimination of borderline and malignant ovarian cancer from control and benign subjects, the multivariate classification model showed a significantly greater AUC than that for CA125 alone (88.4% versus 84.3%, respectively, p < 0.001). At a posterior probability threshold of 0.5, the IVDMIA delivered a specificity of 92.3% and a sensitivity of 76.4%. When set at a specificity of 95%, the multimarker diagnostic delivered a sensitivity of 69.5% compared with 62.5% for CA125. Enhanced diagnostic performance of the IVDMIA over the use of CA125 alone was confirmed statistically by alternative comparisons using IDI and NRI.
This study confirms in an independent sample set that a blood-based multianalyte assay has significant advantages over CA125 for distinguishing symptomatic women with borderline and malignant ovarian cancer from controls or those with benign disease.
Keywords:Ovarian cancer; Tumour markers; Multianalyte; Diagnostic
Estimates by the International Agency for Research on Cancer indicate that the number of new cases of ovarian cancer for the year 2008 would reach 225,000 with 140,000 expected deaths from this disease in the same period . In the USA, it is estimated that ovarian cancer will account for 13,850 deaths in 2010, making it the fifth most lethal malignancy in females . Of particular significance is the fact that the distribution of ovarian cancers by stage at the time of diagnosis is dramatically skewed towards late stage disease, with only approximately 30% of ovarian cancers diagnosed when tumours represent localised or regionally contained disease . Overall, five-year survival rate for patients in the USA diagnosed across all stages of ovarian cancer is 46% . Five-year survival rates among patients diagnosed with localised disease, however, are around 94%, but fall to 73% in patients diagnosed with regional malignancy and are only 28% in patients diagnosed with late stage disease . These data are consistent with the proposal that patients diagnosed with early stage ovarian malignancies have a distinct survival advantage and raise the possibility that improved methods to detect more early stage ovarian malignancies may provide improved clinical outcomes.
Due to low incidence of ovarian cancer in most developed populations (1 per 2,500 women per year), it has been suggested that for an acceptable ovarian cancer screening test to be implemented, it would need to perform with a minimum specificity of 99.6% to achieve a positive predictive value of 10% for screening the general population of post-menopausal women . At present, no screening method has been demonstrated to be sufficiently robust to allow for population based screening for ovarian cancer.
CA125, a high molecular weight glycoprotein remains the most widely used biomarker for confirmation of diagnosis and management of ovarian cancer. Although it is commonly used as an aid in the diagnosis of ovarian malignancy, it has significant limitations in terms of sensitivity and specificity. A review of pre-operative serum CA125 concentrations in ovarian cancer patients by FIGO stage and by histological type showed that CA125 was elevated in only 50% of stage I ovarian cancer cases and in 69% of mucinous ovarian tumours while being far more prominently expressed in patients with late stage serous tumours . Elevation of circulating CA125 concentrations have also been documented in benign gynecological conditions, pregnancy and other malignancies, making CA125 less useful as a selective biomarker for the detection of ovarian cancer .
Some improvement in the preoperative diagnosis of ovarian cancer has been achieved by combining serum CA125 concentration, ultrasound score and menopausal status into a risk of malignancy index (RMI) which was shown to outperform the use of CA125 alone to discriminate between a benign and malignant pelvic mass . Furthermore, since as many as 20% of ovarian cancers express little or no CA125, it is likely that additional secreted biomarkers may be able to complement the use of CA125 to improve diagnostic efficiency . Different approaches have been taken to test the use of various multimarker panels that include CA125 to generate a multivariate model to predict the likelihood of ovarian cancer in various patient cohorts. Combinations of biomarkers and multivariate analyses have demonstrated increases in diagnostic efficiency for predicting ovarian malignancy in comparison to using CA125 alone [7-13]. Several of these multimarker tests are aimed at more accurately distinguishing between malignant and benign adnexal masses, thus allowing for more streamlined triage of these patients [10,11,14].
Previously, we reported the results of a retrospective case-control study that assessed the performance of a five biomarker panel (CA125, CRP, SAA, IL6 and IL8) and demonstrated increased diagnostic efficiency of this panel over CA125 alone as assessed by the area under the receiver operating characteristic curve (AUC) using a bootstrapping procedure . This initial multianalyte approach was limited to a comparison of normal healthy controls with confirmed cases of malignant ovarian cancer and demonstrated significant diagnostic advantage over CA125 for detection of both early and late stage ovarian malignancy.
A recent trend in the development of more efficient diagnostic tests has been the use of algorithm-base multivariate index assays. With the development of this new class of diagnostic, the discipline has sought new biostatistical approaches for assessing and quantifying incremental gains in diagnostic efficiency. Traditionally, the AUC has been used as a measure and comparator of diagnostic efficiency. Several investigators have argued that this measure alone may be imperfect and inefficient for comparing the true clinical usefulness of alternative marker panels [15,16]. It was observed that when evaluating improvement in risk assignment of biomarkers, very large odds ratios were often associated with very small increases in the AUC. This feature of the receiver operator characteristic curve analysis limits its utility in identifying putative beneficial contributions of new biomarkers to algorithm-based models. As a result, alternative methods for comparison of diagnostic efficiency have been developed and successfully applied including integrated discrimination improvement (IDI) that assesses improvement in risk discrimination based on the integral of sensitivity and specificity of all possible thresholds [15,16]. These methods were initially developed for the derivation of prognostic indicators from prospective cohort studies, but have been applied in the context of developing diagnostic indicators from case-control studies [17-19].
To further validate the efficacy of the previously described multianalyte panel and to test the utility of different approaches for assessing diagnostic efficiency, we have evaluated the performance of the IVDMIA on an independent cohort of 742 patient samples that included a significant proportion of benign gynecological pathologies that more accurately defines the target population. The performance of the IVDMIA was compared to CA125 alone using AUC bootstrapping approaches as well as IDI and NRI in order to determine the benefit of the multimarker model in correctly classifying women who present clinically with symptoms of ovarian cancer.
The study population comprised 244 apparently healthy normal women, 223 patients with benign gynecological conditions, 53 patients with borderline ovarian tumours and 222 patients with malignant ovarian tumours (Table 1).
Table 1. Characteristics of the study population
All patients underwent surgical removal of ovarian mass or cysts and pathology examination of tissue sections was used to provide a definitive diagnosis. Patients classified as malignant were women with histologically confirmed epithelial ovarian cancer patients and patients classified as benign consisted of women diagnosed with a range of common benign gynecological conditions including serous and mucinous cystadenoma, cystadenofibromas, endometriotic cysts, follicular cysts, fibrothecoma and endometriosis.
Plasma samples were collected with consent from healthy women or from patients prior to surgery or treatment and no special conditions were implemented in preparation for blood collection. The overall protocol was approved by the Mercy Hospital for Women Human Research and Ethics Committee (R09/06). The majority of the samples were provided from banked sample collections at the National University of Singapore, Victorian Cancer Biobank and the West Australian Research and Tissue Network from which part of the samples were collected via the Australian Ovarian Cancer Study. Samples obtained from normal apparently healthy women were collected with consent by ARL pathology as part of the study design. A further 234 samples were collected from gynecological oncology patients attending specialist clinics at the Mater Hospital, Brisbane, Australia and the Women's Clinic, Southend University Hospital, Essex, UK under local ethics approvals. Blood samples were collected into EDTA vacutainer tubes and samples stored as 250-1000 μL aliquots at -80°C until required for analysis. Samples were thawed once, dispensed into single use assay aliquots and were re-labelled to create a totally blind set for biomarker analysis. The order of assay for each blinded sample was further randomised to remove any possible assay bias.
The present study was a case-control retrospective trial design to test the efficacy of a biomarker panel to detect ovarian cancer in symptomatic women, similar to that previously described . The study cohort included both confirmed cases of malignant epithelial ovarian cancer, borderline ovarian tumours as well as control patients with benign gynecological pathologies. A group of apparently healthy age-matched normal women have been included as part of the control cohort.
Five biomarkers, CA125, CRP, SAA, IL6 and IL8, previously associated with ovarian cancer  were analysed in each sample using the clinical pathology platforms Immulite and BN-II (Siemens Healthcare Diagnostics). CA125 (Siemens OM-MA assay), IL6 and IL8 were analysed on the Immulite while CRP and SAA were analysed on the BN-II platform. Analytes were analysed sequentially on each instrument from a single sample aliquot to avoid multiple freeze-thaw cycles and sample variation. All assays were performed as per the manufacturer's instructions. QC measurements were within the expected ranges and coefficients of variation for the assays performed were less than 5% (CA125, CRP, IL6 and IL8) and 8.2% (SAA). The limit of analytical sensitivity as specified by the manufacturer was 1 U/mL (CA125), 0.15 mg/L (CRP), 0.80 mg/L (SAA), 1 pg/mL (IL6) and 2.5 pg/mL (IL8). In the event that sample determinations delivered values at the limit of analytical sensitivity of the assay, values were entered for analysis as limit of sensitivity/2 so that a definitive lower value could be used in statistical analyses.
Statistical comparison of multiple groups was assessed using the Kruskal-Wallis test and Dunn's multiple comparison was employed as a post-hoc test to determine differences between groups. For two sample group comparisons, statistical significance was determined using the Mann Whitney test (GraphPad Prism, La Jolla, CA, USA). Comparison of patient age across the groups was performed using one-way ANOVA followed by Tukey's multiple comparison test. In all cases, a p value < 0.05 was considered to be statistically significant.
Multivariate model development and statistical comparisons of biomarker models were performed by an independent biostatistician (Emphron Informatics Pty Ltd, Toowong, Qld, Australia). A multivariate classification model that incorporated all five biomarkers was developed using a stochastic gradient boosting model with a logistic loss function as previously described . The implemented classification algorithm reported a posterior probability value (i.e. the likelihood that a sample came from a woman with ovarian cancer) for each patient sample using Leave-One-Out-Cross Validation . The cross-validated predicted probabilities were used to generate the ROC curve for the IVDMIA. Comparisons between the diagnostic efficiency of CA125 alone and the IVDMIA were first tested by assessing AUC as calculated using the Wilcoxon statistic . As the AUC for CA125 and for the biomarker panel are not statistically independent, since they are based on the same patients, the difference in AUC between the diagnostics were statistically assessed using a bootstrap procedure . The number of bootstrap samples used in this analysis was n = 10,000, the estimators considered were the AUC as well as the difference between the AUC's, and the measures of accuracy were the 95% confidence intervals.
Statistical differences in diagnostic efficiency between CA125 and the IVDMIA were further assessed using: IDI that is based on a measure of separation in predicted probabilities for case and control groups; and NRI that assesses reclassification tables and quantifies the correct movement in categories [15,16]. Although originally developed in the context of assessing the contribution of additional markers to prognostic indicators using prospective cohort studies, these techniques have been adapted for retrospective case control studies [17-19].
Characteristics of the study population
The age range for the normal control, benign, borderline and malignant ovarian cancer groups was similar (Table 1), however, the mean age of the malignant ovarian cancer group was significantly higher compared with the other groups (51 vs 49 vs 48 vs 58 respectively, p < 0.001). The control, non-malignant cohort was made up of 244 (52.2%) normal control women and 223 (47.8%) patients with benign gynecological lesions with a mean age of 50 ± 14 (SD). The malignant ovarian cancer cohort comprised 130 (58.5%) serous, 19 (8.6%) endometrioid, 16 (7.2%) mucinous, 16 (7.2%) clear cell and 41 (18.5%) of other types that included predominantly mixed forms and adenocarcinomas with no specific histotype recorded (Table 1). A total of 33 (14.9%) of malignant ovarian cancer samples had no staging data reported, 42 (18.9%) were diagnosed with Stage I disease, 27 (12.2%) with Stage II, 106 (47.7%) Stage III and 14 (6.3%) with Stage IV disease (Table 1).
Plasma biomarker concentrations in control, benign, borderline and malignant ovarian cancer patients
The distribution of plasma concentrations of each tumour marker in individual patient samples is shown in Figure 1. Circulating concentrations of all measured biomarkers showed a significant difference (p < 0.05) between normal control samples and patients with malignant disease. Each biomarker also demonstrated a significant difference in plasma concentrations between benign and malignant tumour groups (p < 0.05), indicating that these analytes could discriminate between either normal controls or benign cases and those with malignant epithelial ovarian carcinoma. Concentrations of CRP, SAA and IL6 were indistinguishable between normal controls and patients with benign conditions. Patients with borderline ovarian tumours displayed significantly elevated concentrations of CA125 compared with either normal controls or benign patients, while IL6 and IL8 concentrations were slightly elevated in borderline patients compared with normal controls (Figure 1 and Table 2).
Figure 1. Comparison of individual patient plasma biomarker concentrations across all patient groups. A total of 244 normal controls (Con), 223 patients with benign gynecological conditions (Ben), 53 patients with borderline ovarian tumours (B/L) and 222 cases of malignant epithelial ovarian cancer (Malignant) were analysed. Open circles indicate the individual values of each patient measurement shown on a log scale and horizontal lines show median concentration of each group. *: p < 0.05, compared with Con group; ^: p < 0.05 compared with Ben group (Kruskal-Wallis test followed by Dunn's multiple comparison test)
Table 2. Plasma biomarker concentrations
The median plasma concentration of all biomarkers tested was significantly elevated (p < 0.05) in patients diagnosed with malignant serous ovarian carcinoma as well as ovarian carcinomas of non-serous histotypes compared with either normal controls or patients with benign conditions (Figure 2). Only circulating concentrations of CA125 were significantly different (p < 0.05) between patients with serous ovarian cancer compared with patients with non-serous tumours, with median CA125 concentrations of serous patients being approximately three-times higher than those in non-serous patients (Figure 2). No significant difference was found in concentrations of CRP, SAA, IL6 or IL8 between serous and non-serous ovarian cancer patients (Figure 2), suggesting that unlike CA125, these biomarkers could more effectively discriminate non-serous ovarian cancer cases from control and/or benign patients.
Figure 2. Comparison of individual biomarker concentrations in plasma of patients with serous versus non-serous malignant ovarian cancer. A total of 244 normal controls (Con), 223 patients with benign gynecological conditions (Ben), 130 cases of malignant serous and 92 cases of non-serous ovarian cancer were analysed. Open circles indicate the individual values of each patient measurement shown on a log scale and horizontal lines show median concentration of each group. *: p < 0.05, compared with Con group; ^: p < 0.05 compared with Ben group (Kruskal-Wallis test followed by Dunn's multiple comparison test)
Further analysis demonstrated that each of the biomarkers tested could significantly discriminate between normal controls and either early (Stages I-II) or late (Stages III-IV) stage ovarian cancer patients (Table 2). Similarly, all plasma biomarkers measured were significantly higher in early stage ovarian cancer patients compared with those with benign conditions (Table 2). Only IL6 and IL8 demonstrated a significant elevation in plasma concentration between borderline tumours and early stage (Stage I-II).
Multivariate modelling and comparisons of diagnostic performance
We first compared AUC-ROC of CA125 with that of the multimarker panel modelled using only normal controls (n = 244) and malignant ovarian cancer patients (n = 222). The AUC of the IVDMIA was significantly greater than that of CA125 alone (94.9 vs 91.9, p = 0.007).
Comparisons of AUC of CA125 with the IVDMIA for discrimination of control/benign from borderline and malignant epithelial ovarian cancer patients are shown in Table 3. Sensitivities are reported at the defined specificities of 90% and 95%. The use of CA125 as a single biomarker delivered an AUC of 84.3% with a sensitivity of 62.5% at either a specificity of 90% or 95%. The IVDMIA delivered an AUC of 88.4% with a sensitivity of 77.5% at 90% specificity and a sensitivity of 69.5% at a defined specificity of 95%. The AUC was significantly different (p < 0.001) between CA125 and IVDMIA (Table 4) and the sensitivity of the IVDMIA was 15% higher at 90% specificity and 7% higher than for CA125 at 95% specificity.
Table 3. ROC-AUC comparison of 5-marker multianalyte test with CA125
Table 4. Statistical comparison of diagnostic efficiency between the 5-marker multianalyte test and CA125
The relationship between predicted posterior probability values for individual patient samples across all groups and within the combined control + benign versus borderline + malignant ovarian cancer groups is shown in Figure 3 and shows an incremental increase in posterior probability values across benign, borderline and malignant ovarian tumor patients. The algorithm derived posterior probability values for the discrimination of control/benign from borderline/malignant epithelial ovarian cancer patients was shown to be significantly different (p < 0.0001).
Figure 3. Association between patient group and predicted posterior probability values. A) Scatter plots showing the distribution of predicted posterior probability values for each patient sample across all groups and B), the predicted posterior probability values for each patient sample within the Control + Benign versus Borderline + Malignant groups. Horizontal bars represent median values for each group. For multi-group comparisons, *: p < 0.001, compared with Con group; ^: p < 0.001 compared with Ben group (Kruskal-Wallis test followed by Dunn's multiple comparison test). For two group comparisons, p < 0.0001, compared with Con + Ben group (Mann Whitney test)
The predictive ability of the IVDMIA was further compared with that of CA125 using additional two statistical approaches (Table 4). Both the bootstrapped IDI comparison (p < 0.001) and NRI analysis (p < 0.001) demonstrated that the diagnostic performance of the IVDMIA was significantly enhanced in comparison to the use of CA125 alone for the discrimination of borderline and malignant ovarian cancer patients from the control/benign group.
Based on using a posterior probability threshold of 0.5, the multianalyte panel delivered a specificity of 92.3% and a sensitivity of 76.4%. Using this defined threshold of 0.5 for discrimination of borderline and malignant cases from the control/benign group, the multimarker algorithm correctly predicted 97.9% of normal controls, 86.1% of patients with benign lesions, 49.1% of borderline cases and 82.9% of malignant ovarian cancers of all stages. Of the ovarian malignancies, 88.5% of all serous tumours, 57.9% of all endometrioid tumours, 62.5% of all mucinous tumours, 81.2% of all clear cell tumours and 85.4% of other epithelial ovarian cancer plasma samples were correctly identified (Table 5). The IVDMIA correctly predicted 91.7% of the late stage (Stages III-IV) samples, and 69.6% of the early stage (Stages I-II) malignant ovarian cancer samples (Table 5).
Table 5. Proportion of samples correctly classified by the multimarker algorithm
The primary aims of this study were to: (i) further validate the efficacy of an IVDMIA to correctly classify ovarian cancer in symptomatic women; and (ii) to establish the utility of three different methods for assessing incremental diagnostic performance. The performance of the IDVMIA was compared with CA125 alone using AUC, IDI and NRI. An independent cohort of 742 patient samples that were derived from multiple collection sites that included a substantial proportion of women with benign gynecological conditions and borderline ovarian cancer was used to establish these aims. The diagnostic performance of the IVDMIA was shown to be superior to that of CA125 alone as demonstrated by comparison of AUC and by two new measures of performance, IDI and NRI that offer incremental information over AUC.
Each of the five biomarkers tested showed significant elevation in malignant epithelial ovarian cancer patient plasma compared with either normal control subjects or subjects with confirmed benign gynecological lesions, suggesting that each marker individually showed some ability to discriminate malignant from non-malignant conditions. Furthermore, only two of the five markers, CA125 and IL8, displayed elevated concentrations in plasma of benign patients compared with normal controls indicating that CRP, SAA and IL6 should be most effective in classifying benign lesions as non-malignant. While all of the five biomarkers examined were significantly elevated in plasma samples taken from patients diagnosed with either epithelial ovarian malignancies of serous and non-serous histotypes, only CA125 concentrations were significantly lower in the non-serous group, consistent with previous studies . The differential expression of CA125 between serous and non-serous ovarian cancer histotypes suggests that additional biomarkers such as CRP, SAA, IL6 and IL8 may complement the diagnostic efficacy of CA125, particularly for non-serous histotypes.
Consistent with our previous study , we confirm that if the multimarker model was constructed using only the control and malignant ovarian cancer groups, the IVDMIA delivered a significant diagnostic advantage over the use of CA125 alone, indicating consistent performance of the panel in an independent sample set. The preferred multimarker model was then constructed using the broader combination of normal control and benign samples versus borderline and malignant ovarian cancer samples. When biomarker data was combined into a multivariate classification model to generate cross-validated posterior probability values to generate a ROC curve as previously described , the resulting AUC for the IVDMIA was shown to be significantly greater than that observed for CA125 alone for the discrimination of control/benign samples from borderline/malignant patients. Comparison of sensitivity of the multimarker panel with CA125 at a fixed specificity of 95% demonstrated enhanced performance of the multivariate index (69.5% vs 62.5%).
The multianalyte test delivered posterior probability values across the subject groups that displayed an incremental increase from benign to borderline to malignant ovarian cancer patients. Although the borderline group were the most difficult to accurately predict, combining borderline and malignant ovarian cancer patients resulted in a highly significant difference in posterior probabilities compared with the control/benign group. Using a posterior probability threshold of 0.5, the IVDMIA delivered a specificity of 92.3% and a sensitivity of 76.4%. The inclusion of patients with benign lesions and borderline tumours into the present sample cohort resulted in slightly lower overall AUC and sensitivity and specificity of both the multimarker panel and CA125 than observed in our previous biomarker trial, however, the statistically significant improvement in AUC and sensitivity and specificity over CA125 in this broader independent study cohort was maintained.
Although the AUC has become the most widely used measure of comparing performance of models for binary outcomes, it has become apparent that with a reasonably efficient marker or model, relatively small changes in AUC may not adequately describe the true clinical incremental contribution of adding new markers or of an alternative model . Such observations have led to the development of new indices of classification improvement, the IDI that is based on the integral of sensitivity and specificity over all possible thresholds and can be used to quantify the increase in separation of case and controls and the NRI that is based on reclassification tables that quantify the correct movement in categories [15,16]. It has been suggested that these new measures offer incremental information over the AUC and should be considered in addition to AUC when assessing the differential performance of new models [15,16]. In order to further validate the overall diagnostic performance of the multimarker panel compared with CA125 for the discrimination of borderline and malignant ovarian cancer patients from control/benign patients, cross validated probabilities from the multimarker model were compared with CA125 by IDI and NRI. Using these alternative approaches to assess improvement in diagnostic performance also demonstrated the significant advantage (p < 0.001) of the IVDMIA over CA125. In summary, this study confirms, using three different statistical methodologies that the five-marker multianalyte panel provides significantly better diagnostic performance than CA125 for the discrimination of borderline and malignant ovarian cancer plasma samples from control and benign subjects.
A variety of approaches have been proposed for designing and testing multianalyte panels as aids for the diagnosis of ovarian cancer. Several multimarker panels have been shown to have considerable predictive advantage over the use of CA125 alone in different patient cohorts and settings as well as in different study designs that include retrospective, longitudinal and prospective studies [7,9-11,13,23]. While some marker panels are aimed at discriminating benign from malignant adnexal masses prior to surgery [10,11,14,24], others are being developed as panels for the early detection of ovarian cancer that may ultimately serve as part of a multi-step screening process . In a prospective study of women undergoing surgery for adnexal mass, Moore et al. reported that the combination of CA125 and HE4 delivered significantly better discrimination of benign disease versus ovarian cancer than did CA125 alone, with a cross-validated sensitivity of 76.4% at 95% specificity . A similar study, using this dual biomarker panel and separate algorithms to assess the risk of endothelial ovarian cancer in premenopausal and postmenopausal women with pelvic mass demonstrated sensitivities and specificities of 92.3% and 74.7% versus 76.5% and 74.8% in the postmenopausal and premenopausal groups respectively . A more recent study assessed the clinical utility of replacing CA125 with a five-marker multianalyte test in the American College of Obstetricians and Gynecologists referral guidelines for women with pelvic mass. This study evaluated 516 women with ovarian mass and demonstrated that substituting CA125 with the multimarker test led to increased sensitivity (94% vs 77%) but decreased specificity (35% vs 68%) across all patients .
CA125 is still the most widely used biomarker test that shows clinical utility in the monitoring of ovarian malignancy as well as the preoperative diagnosis of suspected ovarian cancer. The present biomarker panel described here is aimed at providing an alternative to CA125 that will deliver higher diagnostic efficiency that can be used as an aid in the early preoperative diagnostic process. It has been recently suggested that a limitation of several studies examining the efficacy of biomarker panels for the early prediction of ovarian cancer is that samples are sourced predominantly from symptomatic rather than asymptomatic women . Furthermore, this study suggested that several previously published multimarker panels that displayed apparently better performance than CA125 failed to do so in a study using prediagnostic samples . While this is a particularly important point of consideration if the intended use of the biomarker panel is for screening of pre-symptomatic women, it is less critical if the intended use is as a diagnostic marker panel to aid assessment of symptomatic patients, as is the case with the current study.
The biomarkers measured in this study represent proteins that are known to be expressed and released from malignant ovarian epithelium as well as proteins that may be induced and released from other sites as part of an acute and/or ongoing inflammatory response or response to injury. It is well established that the CA125 epitope is contained in MUC16, a transmembrane glycoprotein that is expressed in endothelial ovarian cancer cells and subsequently shed into the circulation, thereby providing a measurable blood biomarker . The acute phase protein CRP is produced predominantly by hepatocytes and its elevated concentration in the serum of ovarian cancer patients has been shown to be independently associated with FIGO stage and overall 5-year survival . Circulating blood concentrations and expression of SAA, IL6 and IL8 have been shown to be correlated with ovarian tumour stage and also with patient survival [29-32]. While the source of circulating SAA, IL6 and IL8 in ovarian cancer patients is not clear, it has been demonstrated that these proteins are expressed and secreted from ovarian cancer cells [30,32,33]. Furthermore, there is evidence that IL6 and IL8 in particular can exert biological actions that influence ovarian cancer cell growth and migration [30,34,35]. Taken together, there is substantial evidence to suggest that although several of these biomarkers are traditionally considered to be systemic acute phase or inflammatory markers, expression of most of these proteins can occur locally in the malignant ovarian epithelium where they may have biological actions on the growth and development of these tumours.
The study reported here is part of a larger multi-site, multi-national phase 2 biomarker evaluation and serves to validate the enhanced performance of a multianalyte panel over that of CA125 in an independent sample set comprised of borderline and malignant epithelial ovarian cancer patients and a control cohort made up of both normal women as well as women diagnosed with benign gynecological lesions. We have established a statistically significant increase in the performance of the multimarker test using the traditional and well established comparison of AUC as well as using two new measures of performance of predictive models, IDI and NRI. The current data demonstrate that the biomarker panel has utility as an improved diagnostic aid for assessing the likelihood of ovarian cancer in clinically presenting symptomatic women.
This study was funded as part of the research and development activities of Healthlinx Ltd. DJA, LR, KK and KB are employees of Healthlinx Ltd and GER is non-executive chairman of Healthlinx Ltd. DJA is an inventor on patent applications that are related to the current study.
DJA was responsible for final study design, collation of all clinical data, preparation of blinded sample lists for assay, statistical analysis, data interpretation and writing of the manuscript. GER was responsible for overall study conception, obtaining human ethics approvals and contributed to data interpretation and critical review of the manuscript. LR, KK and KB each contributed to organising clinical sample logistics, methods for sample handling, assay of all samples and collation of all raw data. Only the authors were responsible for study design, analysis and interpretation of data and writing and submission of the manuscript for publication. All authors read and approved the final manuscript.
We gratefully acknowledge the Victorian Cancer Biobank, Western Australian Research Tissue Network, Australian Ovarian Cancer Study and Professor Mahesh Choolani, Department of Obstetrics and Gynaecology, National University of Singapore for access to plasma samples and clinical data from their respective ovarian cancer collections. AOCS would like to thank the study nurses and research assistants for their contribution to the study http://www.aocstudy.org/ and thank all of the women who participated in the study. The Australian Ovarian Cancer Study was supported by The Cancer Council Victoria, Queensland Cancer Fund, The Cancer Council New South Wales, The Cancer Council South Australia, The Cancer Foundation of Western Australia, the Cancer Council Tasmania and the National Health and Medical Research Council of Australia (NHMRC)". This work was supported by a competitive grant from the State Government of Victoria VSA Investment Fund. GER was in receipt of an NHMRC Principal Research Fellowship.
Hum Reprod 1989, 4:1-12. PubMed Abstract
Jacobs I, Oram D, Fairbanks J, Turner J, Frost C, Grudzinskas JG: A risk of malignancy index incorporating CA 125, ultrasound and menopausal status for the accurate preoperative diagnosis of ovarian cancer.
Gorelik E, Landsittel DP, Marrangoni AM, Modugno F, Velikokhatnaya L, Winans MT, Bigbee WL, Herberman RB, Lokshin AE: Multiplexed immunobead-based cytokine profiling for early detection of ovarian cancer.
Moore RG, Brown AK, Miller MC, Skates S, Allard WJ, Verch T, Steinhoff M, Messerlian G, DiSilvestro P, Granai CO, Bast RC Jr: The use of multiple novel tumor biomarkers for the detection of ovarian carcinoma in patients with a pelvic mass.
Zhang Z, Bast RC Jr, Yu Y, Li J, Sokoll LJ, Rai AJ, Rosenzweig JM, Cameron B, Wang YY, Meng XY, et al.: Three biomarkers identified from serum proteomic analysis for the detection of early stage ovarian cancer.
Ware Miller R, Smith A, Desimone CP, Seamon L, Goodrich S, Podzielinski I, Sokoll L, van Nagell JR Jr, Zhang Z, Ueland FR: Performance of the american college of obstetricians and gynecologists' ovarian tumor referral guidelines with a multivariate index assay.
Stat Med 2008, 27:157-172.
discussion 207-112PubMed Abstract | Publisher Full Text
Cook NR: Comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929).
Ann Statist 1986, 14:1301-1304. Publisher Full Text
Moore RG, Jabre-Raughley M, Brown AK, Robison KM, Miller MC, Allard WJ, Kurman RJ, Bast RC, Skates SJ: Comparison of a novel multiple marker assay vs the Risk of Malignancy Index for the prediction of epithelial ovarian cancer in patients with a pelvic mass.
Am J Obstet Gynecol 2010, 203:228.
e221-226PubMed Abstract | Publisher Full Text
Moore RG, McMeekin DS, Brown AK, DiSilvestro P, Miller MC, Allard WJ, Gajewski W, Kurman R, Bast RC Jr, Skates SJ: A novel multiple marker bioassay utilizing HE4 and CA125 for the prediction of ovarian cancer in patients with a pelvic mass.
Zhu CS, Pinsky PF, Cramer DW, Ransohoff DF, Hartge P, Pfeiffer RM, Urban N, Mor G, Bast RC Jr, Moore LE, et al.: A Framework for Evaluating Biomarkers for Early Detection: Validation of Biomarker Panels for Ovarian Cancer.
Cancer Prev Res (Phila) 2011, 4:375-383. Publisher Full Text
Hefler LA, Concin N, Hofstetter G, Marth C, Mustea A, Sehouli J, Zeillinger R, Leipold H, Lass H, Grimm C, et al.: Serum C-reactive protein as independent prognostic variable in patients with ovarian cancer.
Am J Obstet Gynecol 1991, 164:1038-1042.
discussion 1042-1033PubMed Abstract
Merritt WM, Lin YG, Spannuth WA, Fletcher MS, Kamat AA, Han LY, Landen CN, Jennings N, De Geest K, Langley RR, et al.: Effect of interleukin-8 gene silencing with liposome-encapsulated small interfering RNA on ovarian cancer cell growth.
Urieli-Shoval S, Finci-Yeheskel Z, Dishon S, Galinsky D, Linke RP, Ariel I, Levin M, Ben-Shachar I, Prus D: Expression of serum amyloid a in human ovarian epithelial tumors: implication for a role in ovarian tumorigenesis.
Colomiere M, Ward AC, Riley C, Trenerry MK, Cameron-Smith D, Findlay J, Ackland L, Ahmed N: Cross talk of signals between EGFR and IL-6R through JAK2/STAT3 mediate epithelial-mesenchymal transition in ovarian carcinomas.
Shahzad MM, Arevalo JM, Armaiz-Pena GN, Lu C, Stone RL, Moreno-Smith M, Nishimura M, Lee JW, Jennings NB, Bottsford-Miller J, et al.: Stress effects on FosB- and interleukin-8 (IL8)-driven ovarian cancer growth and metastasis.