Research Papers:
Gene network inherent in genomic big data improves the accuracy of prognostic prediction for cancer patients
PDF | HTML | Supplementary Files | How to cite
Metrics: PDF 1614 views | HTML 2776 views | ?
Abstract
Yun Hak Kim1,7, Dae Cheon Jeong2, Kyoungjune Pak3,7, Tae Sik Goh1,4,7, Chi-Seung Lee5, Myoung-Eun Han1, Ji-Young Kim1, Liu Liangwen1, Chi Dae Kim6, Jeon Yeob Jang7,8, Wonjae Cha7,8 and Sae-Ock Oh1
1Department of Anatomy, School of medicine, Pusan National University, Yangsan, 50612, Republic of Korea
2Department of Statistics, Korea University, Seoul 02841, Republic of Korea
3Department of Nuclear Medicine, Pusan National University Hospital, Busan 49241, Republic of Korea
4Department of Orthopaedic Surgery, Pusan National University Hospital, Busan 49241, Republic of Korea
5Biomedical Research Institute, Pusan National University Hospital and School of Medicine, Pusan National University, Busan 49241, Republic of Korea
6Department of Pharmacology, School of medicine, Pusan National University, Yangsan, 50612, Republic of Korea
7BEER, Busan society of Evidence-based mEdicine and Research, Busan 49241, Republic of Korea
8Department of Otorhinolaryngology-Head and Neck Surgery, Pusan National Hospital, Busan 49241, Republic of Korea
Correspondence to:
Sae-Ock Oh, email: [email protected]
Keywords: prognosis, network-regularized high-dimensional Cox-regression (Net), breast cancer, gene network, gene signature
Received: July 03, 2017 Accepted: August 04, 2017 Published: August 24, 2017
ABSTRACT
Accurate prediction of prognosis is critical for therapeutic decisions regarding cancer patients. Many previously developed prognostic scoring systems have limitations in reflecting recent progress in the field of cancer biology such as microarray, next-generation sequencing, and signaling pathways. To develop a new prognostic scoring system for cancer patients, we used mRNA expression and clinical data in various independent breast cancer cohorts (n=1214) from the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) and Gene Expression Omnibus (GEO). A new prognostic score that reflects gene network inherent in genomic big data was calculated using Network-Regularized high-dimensional Cox-regression (Net-score). We compared its discriminatory power with those of two previously used statistical methods: stepwise variable selection via univariate Cox regression (Uni-score) and Cox regression via Elastic net (Enet-score). The Net scoring system showed better discriminatory power in prediction of disease-specific survival (DSS) than other statistical methods (p=0 in METABRIC training cohort, p=0.000331, 4.58e-06 in two METABRIC validation cohorts) when accuracy was examined by log-rank test. Notably, comparison of C-index and AUC values in receiver operating characteristic analysis at 5 years showed fewer differences between training and validation cohorts with the Net scoring system than other statistical methods, suggesting minimal overfitting. The Net-based scoring system also successfully predicted prognosis in various independent GEO cohorts with high discriminatory power. In conclusion, the Net-based scoring system showed better discriminative power than previous statistical methods in prognostic prediction for breast cancer patients. This new system will mark a new era in prognosis prediction for cancer patients.
All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 20548