Abstract
Madalene A. Earp1, Rama Raghavan2, Qian Li2,10, Junqiang Dai2, Stacey J. Winham1, Julie M. Cunningham3, Yanina Natanzon1, Kimberly R. Kalli4, Xiaonan Hou5, S. John Weroha5,6, Paul Haluska5,6, Kate Lawrenson7, Simon A. Gayther8,9, Chen Wang1, Ellen L. Goode1,* and Brooke L. Fridley2,10,*
1Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
2Department of Biostatistics, University of Kansas Medical Center, KS, USA
3Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
4Division of Medical Oncology, Mayo Clinic, Rochester, MN, USA
5Department of Oncology, Mayo Clinic, Rochester, MN, USA
6Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
7Women’s Cancer Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
8Center for Cancer Prevention and Translational Genomics, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
9Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA
10Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
*These authors contributed equally to this work
Correspondence to:
Brooke L. Fridley, email: [email protected]
Keywords: fusion gene, ovarian cancer, RNA-sequencing, histological subtypes, TCGA
Received: August 08, 2016 Accepted: March 22, 2017 Published: April 01, 2017
ABSTRACT
Gene fusions play a critical role in some cancers and can serve as important clinical targets. In epithelial ovarian cancer (EOC), the contribution of fusions, especially by histological type, is unclear. We therefore screened for recurrent fusions in a histologically diverse panel of 220 EOCs using RNA sequencing. The Pipeline for RNA-Sequencing Data Analysis (PRADA) was used to identify fusions and allow for comparison with The Cancer Genome Atlas (TCGA) tumors. Associations between fusions and clinical prognosis were evaluated using Cox proportional hazards regression models. Nine recurrent fusions, defined as occurring in two or more tumors, were observed. CRHR1-KANSL1 was the most frequently identified fusion, identified in 6 tumors (2.7% of all tumors). This fusion was not associated with survival; other recurrent fusions were too rare to warrant survival analyses. One recurrent in-frame fusion, UBAP1-TGM7, was unique to clear cell (CC) EOC tumors (in 10%, or 2 of 20 CC tumors). We found some evidence that CC tumors harbor more fusions on average than any other EOC histological type, including high-grade serous (HGS) tumors. CC tumors harbored a mean of 7.4 fusions (standard deviation [sd] = 7.4, N = 20), compared to HGS EOC tumors mean of 2.0 fusions (sd = 3.3, N = 141). Few fusion genes were detected in endometrioid tumors (mean = 0.24, sd = 0.74, N = 55) or mucinous tumors (mean = 0.25, sd = 0.5, N = 4) tumors. To conclude, we identify one fusion at 10% frequency in the CC EOC subtype, but find little evidence for common (> 5% frequency) recurrent fusion genes in EOC overall, or in HGS subtype-specific EOC tumors.
INTRODUCTION
A fusion gene, or chimera, is a hybrid gene formed from the aberrant juxtaposition of two distinct genes. Events leading to a fusion gene can occur at the DNA level through translocation, deletion, or inversion, or at the RNA level as a result of read-through transcription or trans-splicing (when a single RNA transcript is processed from multiple separate pre-mRNAs) [1, 2]. Gene fusions play a critical role in some cancers, either by altering expression levels, or functionality, or both [1]. A well-described example is BCR-ABL1, a fusion gene that confers tumor growth factor independence, inhibits apoptosis, and is the defining molecular aberration in chronic myelogenous leukemia (CML) (95% of cases) [3]. BCR-ABL1 is the target for the highly selective drug imatinib (Gleevec®), whose development is largely responsible for nearly doubling the 5-year survival time of CML patients [1, 4]. In solid tumors, recurrent gene fusions have been described in prostate cancer (TMPRSS2-ERG), lung cancer (EML4-ALK), and secretory breast cancer (ETV6-NTRK3) [5–7]. TMPRSS2-ERG has been reported in >50% of tumors, and has been used to stratify patients according to risk [8] and survival time [9]. These examples demonstrate that recurrent gene fusions have the potential for clinical benefit as drug targets and may have diagnostic and prognostic uses.
Epithelial ovarian cancer (EOC) affects 1.3% of women and has poor 5-year survival (~46%) in comparison to other women’s cancers such as breast (~90%), endometrial (~83%), and cervical (68%) [4]. This is in part due to a lack of effective early detection strategies, and a high recurrence rate with only modest activity from standard second-line chemotherapies [10]. Recent advances in the understanding of EOC include the appreciation of histologically and molecularly defined subtypes. With regard to these molecular features, relatively few studies have investigated recurrent fusions in a large panel of EOC tumors by histological subtype. High-grade serous (HGS) EOC, the most common and lethal subtype, has been investigated for fusion genes as part of The Cancer Genome Atlas (TCGA) project [11, 12]. Among 400 tumors studied, very few recurrent fusions were detected, and nearly all of those that were found were in genomics regions with copy number alteration [12]. This led the authors conclude that fusions in HGS EOC arise secondary to the widespread genomic instability characteristic of this subtype [12]. The recurrent fusions reported in TCGA’s EOC tumors were CCDC6-ANK3 (in 4 samples, 1% of tumors), and COL14A1-DEPTOR and KAT6B-ADK (each in 2 samples, 0.5% of tumors). Another recent molecular profiling study of 114 HGS EOC tumors found a similar low frequency of recurrent fusion genes. The only recurrent fusion reported was SLC25A40-ABCB1, detected in two chemotherapy-resistant relapsed tumor samples [13]. Overall, the low frequency of recurrent fusions in HGS EOC limits their potential clinical use.
The role that fusion genes play in the rarer EOC histological subtypes, including endometrioid (END), clear cell (CC), and mucinous (MC) tumors is less clear. Previous studies were limited by small sample size (≤ 8 END, ≤ 16 CC, and ≤ 2 MC) [14–17]. Thus, we screened for recurrent gene fusions using paired-end (PE) RNA sequencing (RNA-seq) of a histologically diverse panel of well characterized EOCs (141 HGS, 55 END, 20 CC, and 4 MC) with clinical outcome data.
RESULTS
Fusion genes by EOC subtype
Following the application of strict filtering criteria, we identified 442 unique fusion genes across 220 EOC tumors. In a given tumor, fusions with multiple predicted breakpoints involving the same gene partners were considered a single event. These 442 fusions were categorized by predicted functional type, shown in Figure 1. These predictions were made by PRADA based on the position of the fusion junction sites (given in human genome build 37). Approximately one third of fusions (30%, N = 134) are predicted to be in-frame, meaning the involved gene partners have a preserved reading frame, and are potentially functional if translated. Another 16% (N = 71) involved exchanges of 5′-untranslated region (UTR). UTR’s contain regulatory elements that can alter expression levels of genes they are positioned next to. Finally, 33% of fusions were out of frame and likely degraded; 21% were not-classified by PRADA. All fusions and their annotations are given in Supplementary Table 1.
Figure 1: Breakdown of 442 fusions by predicted consequence to reading frame.
Fusion genes detected in two or more tumor, i.e. recurrent fusions, are summarized in Table 1 and Figure 2. Two fusions were inter-chromosomal, involving gene partners on different chromosomes (RMND1-BRE and UBAP1-TGM7); the remaining seven involved genes in close proximity (< 1 Mb) on the same chromosome (intra-chromosomal). The fusion UBAP1-TGM7, detected in two CC tumors, was also validated using Sanger Sequencing (Supplementary Figure 1). Fusions involving genes less than 1Mb away on the same strand are likely to reflect read-through transcripts [18]. Five fusions in Table 1 fit this criterion. Many of the fusions in Table 1 are predicted to be in-frame, including both of the inter-chromosomal fusions. The most recurrent fusion was CRHR1-KANSL1 in 6 tumors (2.7% of all samples); all other recurrent fusions were found in two tumors (1% of all samples). UBAP1-TGM7 was unique to CC tumors (in 10%, or 2 of 20, CC tumors). One recurrent fusion, ST7-MET, was found in 1 HGS and 1 CC tumor. All other recurrent fusions were found in HGS tumors only. No recurrent fusions were unique to relapsed or post-NACT tumors, albeit our sample size for tumors with these clinical characteristics was small (6 for each).
Table 1: Recurrent fusions detected in epithelial ovarian cancer tumors
*One tumor exposed to neoadjuvant chemotherapy. #One relapsed tumor. Abbreviations used: UTR, untranslated region; CDS, coding DNA sequence; OV, ovarian; TCGA, The Cancer Genome Atlas. TCGA fusion gene data available at: http://54.84.12.177/PanCanFusV2/.
1Interchromosomal fusions involve gene partners on different chromosomes. Intrachromosomal fusions involve gene partners on the same chromosome. Read-through fusions involve gene partners less than 1 Mb away on the same strand.
2Gene A is 5′ gene; Gene B is 3′ gene.
3As predicted by the fusion caller PRADA based on fusion junction sites (in GRCh37/hg19, see Supplementary Table 1 for positions).
4“Yes” if predicted protein reading frame is in-frame, “No” if reading frame is Out-of-Frame, “Unknown” for all other cases.
5“Yes” if fusion was also detected by the fusion caller SoapFuse; “No” if it was not.
6“Yes” if fusion was also reported in OV-TCGA tumors or any organ site TCGA tumors; “No” if it was not.
Figure 2: Circos plot of recurrent fusion genes detected in 220 EOC tumors. The outer circle shows cytogenetic bands (based on Circos package data UCSC.hg19.chr).
Studies have underscored limitations associated with relying on any one fusion caller to detect fusions [19–21], therefore, we further examined whether the fusions in Table 1 were also detected by the robust fusion caller SoapFuse [22]. Five recurrent fusions detected by PRADA were also detected by SoapFuse (see Table 1). Two of the nine recurrent fusions identified, CCDC6-ANK3 and COL14A1-DEPTOR, have previously been reported in TCGA’s EOC tumors. In our tumor set, the histology-specific frequency of these fusion genes was 1.4% (2 of 141 HGS samples). In TCGA, their frequency was low at 1% (CCDC6-ANK3) and 0.5% (COL14A1-DEPTOR). Examining all tumor types in TCGA, two additional fusions in Table 1 are noted as being previously been reported. CRHR1-KANSL1 fusions were reported in 5% of TCGA’s colorectal tumors (16 of 312 analyzed), and 3.5% of TCGA’s uterine carcinosarcomas (2 of 57 analyzed) [12]. NCOR2-SCARB1 fusions were found in five different TCGA tumor types at a frequency < 2%. In addition to being found in EOC tumors, CCDC6-ANK3 fusions were reported at low frequencies in breast (2 of 1140) and lung (1 of 546) tumors. A COL14A1-DEPTOR fusion was also found in 1 lung tumor in TCGA.
The EOC tumors included in our study harbored a mean of 2 and a median of 0 fusions; 129 (59%) tumors had no fusions detected. The average number of fusions detected in tumors differed by histology (pANOVA = 5.3 × 10−12) (Table 2, Figure 3). Pairwise comparison between the histologies, using t-tests assuming unequal variance with Bonferroni adjustment for multiple comparisons, found significant differences in number of fusions detected between CC and the three other histologies (pHGS-CC = 0.033, pEND-CC = 0.003, pMC-CC = 0.003); additional there was significance differences between HGS and the remaining two histologies (pHGS-END = 3.9 × 10−8, pHGS-MC =0.002). CC tumors had the highest average number of fusions (mean = 7.4, sd = 7.6), followed by HGS tumors (mean = 2.0, sd = 3.3), then END (mean = 0.24, sd = 0.74) and MC (mean = 0.25, sd = 0.5) tumors. It should be noted, however, that these results could be attributed to the batch issues, for which we are unable to untangle this confounding. Therefore, we also looked for histologies differences within given batches of samples. The average number of fusions was compared across histology within RNA-seq batch to further examine histology-specific differences. CC tumors harbored more fusions on average than the other EOC subtypes in RNA-seq Batch 2 (p = 0.002, t-test assuming unequal variance, 12 CC compared to 5 non-CC subtypes), but not Batch 3 (p = 0.16, t-test assuming unequal variance, 7 CC compared to 23 non-CC subtypes). No CC tumors were sequenced in Batch 1 and only 1 CC tumor was in Batch 4 (no fusions were detected in this tumor) (Table 1). No other statistically significant histology specific differences in the number of fusions per tumor were observed.
Table 2: Average No. of fusions detected in tumors by batch and histology
Histology | Batch | Histology Average | |||
---|---|---|---|---|---|
1 | 2 | 3 | 4 | ||
High Grade Serous | 0.11 (62) | 2.00 (2) | 2.00 (2) | 3.59 (76) | 2.03 (141) |
Endometrioid | 0.00 (30) | 0.00 (2) | 0.55 (22) | 0.00 (1) | 0.24 (55) |
Clear Cell | NA (0) | 9.83 (12) | 4.29 (7) | 0.00 (1) | 7.40 (20) |
Mucinous | 0.33 (3) | 0.00 (1) | NA (0) | NA (0) | 0.25 (4) |
Batch Average | 0.09 (95) | 7.18 (17) | 1.47 (30) | 1.47 (78) | 2.0 (220) |
*Bracketed values are the number of tumors in a category. ‘NA’ indicates a category did not contain any tumors.
Figure 3: Histology-specific histograms showing how the number of fusions detected per tumor in: (A) High Grade Serous, (B) Mucinous, (C) Clear Cell, and (D) Endometrioid. RNA-seq batch is also indicated.
Prognostic significance and clinical features of fusion genes by EOC subtype
Fusion gene status, i.e. the presence or absence of any fusion gene, was not significantly associated with overall survival (OS) time (p = 0.34) or progression free survival (PFS) time (p = 0.06). Adjustment for histology and RNA-seq batch did not change this conclusion (OS p = 0.82, PFS p = 0.49). CRHR1-KANSL1, detected in 6 tumors, was not associated with OS (p = 0.59) or PFS (p = 0.85); other recurrent fusions were too rare to warrant survival analyses. We also grouped our detected fusion genes into 4 classes, based on established functional relevance in tumorigenesis [12]: kinases (N = 21 fusions, gene list from [12]); tumor suppressor genes (TSG) (N = 9 fusions, gene list from https://bioinfo.uth.edu/TSGene/); chromatin modifiers (N = 13 fusions); and histone modifiers (N = 25 fusions). The latter two genes lists were from http://epifactors.autosome.ru/. None of these four protein classes were associated with outcome. Tumors fusion gene status was significantly associated with presence of endometriosis (p = 0.008); however other clinical or lifestyle features were not associated fusion status, after adjusting for histology and batch (i.e., grade (p = 0.71), presence of ascites (p = 0.35), peritoneal cytology (p = 0.83), debulking status (p = 0.28), smoking history (p = 0.49), age at first live birth (p = 0.58)).
DISCUSSION
We set out to investigate what role recurrent fusion genes may play in the tumorigenesis of EOC, particularly the rarer EOC histological types. We identified one recurrent fusion (UBAP1-TGM7) unique to CC tumors. UBAP1-TGM7 was present in 2 of 20 (10%) CC tumors, making it relatively common in this histological type. To our knowledge, this fusion gene has not previously been reported in EOC, or any other tumor type [12]. It was also not been observed in over 200 samples spanning 27 different non-neoplastic human tissues [23]. The UBAP1-TGM7 fusion identified here is predicted to fuse the first exon of UBAP1 (coding for 75 amino acids) to the 10th exon of TGM7, preserving the gene partners reading frames. As shown in Figure 4, our RNA-seq data indicates that TGM7 expression is dramatically increased in tumors harboring the UBAP1-TGM7 fusion (17-fold and 87-fold increase above average). UBAP1 is a component of the endosomal sorting complex required for transport I (ESCRT-I), a complex that functions in the sorting of ubiquitinated cargo [24, 25]. Residues in the N-terminus of Ubap1 are responsible for interacting with other proteins in the ESCORT-I complex [24], and are retained in the fusion. TGM7 codes for a transglutaminase, and functions to stabilize protein assemblies through the formation of gamma-glutamyl-epsilon lysine crosslinks. The portion of Tgm7 retained in the UBAP1-TGM7 fusion contains the glutaminase domain, and conceivably this function is preserved. Neither UBAP1 nor TGM7 has a function that, based on known functionally relevant cancer-specific fusions (typically kinases, transcription factors, chromatin modifiers [1, 26]), is readily connected to tumorigenesis, therefore, it is difficult to speculate what role this fusion might serve. UBAP1 has also been reported fused, in-frame, with UBAP2 in one breast cancer tumor, with ADAMTSL1 in one low grade glioma [12]. TGM7 has not been found partnered in-frame with any other genes.
Figure 4: Gene expression correlation between fusion genes partners UBAP1 and TGM7 in 220 EOC tumors studied. Tumors harboring the UBAP-TGM7 fusion are indicated. Gene expression is measured as Reads Per Kilobase of transcript per Million (RPKM) mapped reads.
Beyond this CC-specific fusion, no other fusions were present in more than 3% of the samples analyzed, including histology-specific and histologically combined analyses. The next most recurrent fusion was CRHR1-KANSL1, detected in 6 tumors (2.7% of all samples). Despite its lower frequency, CRHR1-KANSL1 is of interest as it reside at the locus 17q21.31, a structurally complex genomic region containing a 900-kb inversion polymorphism, multiple copy-number variants [27, 28], and markers that associate with ovarian cancer [29], female fertility [28], and female meiotic recombination [28]. Further, CRHR1-KANSL1 fusions have been reported in 5% of TCGA’s colorectal tumors and 3.5% of TCGA’s uterine carcinosarcomas [12]. This fusion identified here (and in TCGA) involved the exchange CRHR1’s 5′-UTR to KANSL1’s 5′-UTR. Thus, the Kansl1 protein is predicted to be intact and functional, but potentially with altered expression. Kansl1 plays a role in chromatin modification, a functional class previously described for genes in fusions associated with tumorigenesis. Fused to CHRH1’s 5′-UTR, altered Kansl1 expression may lead to aberrant histone acetylation and protein expression. CRHR1 has also been reported fused with CENPP, KIA0100, and SPOP, in one breast, cervical, and uterine tumor, respectively [12]. KANSL1 has also been reported fused with ORMDL3 and LAYN in one breast tumor and with UNC45B in one lung tumor. Finally, we found no evidence for the recurrent fusion genes previously reported in the literature, including ESRRA-TEX40 [17], CDKN2D-WDFY2 [16], and BCAM-AKT2 [15]. This is consistent with other reports [11–13, 30].
We found some evidence that CC tumors harbor more fusions on average than any other EOC histological type, including HGS tumors. This is unexpected as CC EOC tumors are typically described as having fewer somatic genetic alterations than HGS tumors [31, 32], and CC tumors from other organ sites, particularly renal clear cell cancer [12], have revealed few fusions per sample. This pattern was observed in one batch of RNA-seq data (Batch 2, p = 0.002); a second batch trended in this direction but was not significant at p < 0.05 (Batch 3, p = 0.16). Not enough CC tumors were sequenced in Batches 1 and 4 to add to this result. We note that despite analyzing within RNA-seq batch, in Batch 2, CC tumors had more mapped reads than non-CC tumors (p = 0.016, t-Test assuming unequal variance) (see Supplementary Table 2 for details). This may have affected the number of fusions detected in the other tumors in this batch, but it is not clear why this occurred. In Batch 3 the number of mapped reads between tumors was not significantly different (p = 0.9, t-Test assuming equal variance). Therefore, we view this CC results with caution, and emphasize the need for replication of this broad finding.
Despite being the largest non-HGS analysis to date, the main limitation of this study was sample size; hence, we focused inference on subtype-specific common recurrent (> 5% frequency). We also note that differences in RNA-seq library preparation methods made it difficult to compare results across batches. Strengths of this study are the inclusion of the rarer subtypes types of EOC, subtype-specific analyses, and the capacity to make direct comparison to those fusion genes detected in all TCGA tumors [12] based on the use of highly similar methods (i.e. applying PRADA). PRADA has been described as a conservative but accurate tool relative to other fusion callers [19]. This means it may fail to detect certain fusions, but those that it does report, validate more often than when using other tools. Thus, we acknowledge that some fusions may have been missed in our analysis. We only had the capacity to validate one fusion, and therefore we chose to validate the CC-specific fusion gene UBAP1-TGM7. The confirmed validation of this fusion lends support to PRADA being an accurate bioinformatics tool for calling gene fusions.
Only two of the recurrent fusions reported in our EOC tumors, CCDC6-ANK3 and COL14A1-DEPTOR, were also found in TCGA HGS EOC tumors. Although recurrent, these were found to be relatively rare here (1.4% of HGS tumors) and in TCGA HGS EOCs (0.5–1% of tumors). To conclude, we identify one in-frame fusion at 10% frequency in the CC EOC subtype, but find little evidence for common (> 5% frequency) recurrent fusion genes in EOC overall, or in HGS tumors.
MATERIALS AND METHODS
Ethics statement
All cases provided written informed consent for use of their tissues and medical records in research; all protocols were approved by the Mayo Clinic Institutional Review Boar (HSC # 09-003270 and 09-008768).
Tumor samples
Patients were those receiving surgery for ovarian, fallopian tube, or primary peritoneal cancer at the Mayo Clinic (Rochester, MN). Clinical diagnoses were confirmed by a gynecologic pathologist, who verified tumor histology, grade, and the presence of 70% tumor content prior to RNA extraction from fresh frozen tissue. RNA was extracted from a total of 220 tumors, including 141 HGS, 55 END, 20 CC, and 4 MC (Supplementary Table 2). Six HGS samples were from relapsed tumors, and six other samples were from primary tumors treated with neo-adjuvant chemotherapy (NACT) (5 HGS, 1 CC). All other samples were from treatment-naive primary tumors. RNA-seq data for 7 normal samples sourced at Mayo Clinic from whole ovary (N = 5), endometrium (N = 1), and omentum (N = 1) was also generated (Supplementary Table 3). RNA-seq data for 10 normal whole ovary samples was downloaded from GTEx (http://www.gtexportal.org/home/) (Supplementary Table 3). Normal samples were analyzed using the same pipeline as tumor samples. Results were used to remove fusion genes that are not cancer-specific (see below). RNA-sequencing of tumor and normal samples was performed in four batches (details in Supplementary Table 4).
Bioinformatics analysis of fusion genes
Pipeline for RnAseq Data Analysis (PRADA) (v1.2) was used to call gene fusions [33]. Combined genome and transcriptome reference files were downloaded from http://bioinformatics.mdanderson.org/Software/PRADA/, including the hg19 assembly, Ensembl GTF Release 64, and dbSNP 135 v37. Fusions events were filtered to those supported by at least two split reads, one ‘perfect-match’ junction spanning read, and low homology between fusion partner genes (retained those with PRADA E-value > 0.001). The 2*junction length (junL) bases parameter was set to 80% of the read length (recommended). Fusion events identified by PRADA in 17 (7 Mayo Clinic, 10 GTEx Portal) normal tissues were removed. A catalog of 11,531 fusion RNAs identified in over 200 RNA sequencing libraries from 27 different non-neoplastic human tissues analyzed using the SoapFuse pipeline was further used to remove fusion genes that are not cancer-specific [22, 23]. To assess robustness of fusion results from PRADA pipeline, we also ran a subset of sample through three other fusion callers (SoapFuse, FusionMap and TopHat Fusion) with results presented in Supplementary Table 1.
Association of fusion genes with clinical outcome
The relationship between fusion gene status (none versus any) and prognosis (OS and PFS) was evaluated by fitting a Cox proportional hazards regression model (subtypes combined, and for each subtype separately). Post-NACT, relapsed, and normal samples were removed from outcome analyses. Progression-free survival time was defined as time from the date of diagnosis to the date that second-line therapy was initiated for a clinically-actionable tumor recurrence. Tests for association between fusion status and time to event clinical endpoints were assessed with Wald tests. The association between fusion gene status (presence vs absence) and clinical features were assessed via logistic regression models and Wald tests, with adjustment for histology and batch. All association p-values reported are unadjusted for multiple-testing.
Fusion validation
The UBAP1-TGM7 fusion (Supplementary Figure 1) was validated using SuperScript® VILO™ cDNA synthesis kit from Thermo Fisher Scientific. cDNA was amplified with TaqGold at 35 cycles and 60 degree annealing temperature for Sanger sequencing using model 3730 × l DNA analyzer. Primers (5′GCTCTCCCTAGGGGCTGTC, 3′TCCCTAAGACCCCCAGACTC) were designed based on sequence in Supplementary Figure 1 with red indicating the fusion site. All validation work was performed in Genome Analysis Core at the Mayo Clinic.
ACKNOWLEDGMENT
The Mayo Clinic Genome Analysis Shared Resource is supported by CA15083 and the Mayo Clinic Center for Individualized Medicine.
CONFLICTS OF INTEREST
None of the authors have conflicts-of-interest or financial disclosures to declare.
GRANT SUPPORT
R01 CA122443, P50 CA136393. K.L. is supported by a K99/R00 Pathway to Independence Award from the NCI (R00CA184415). B.L.F and R.R. were supported by P30 CA168524 and P20 GM103418.
REFERENCES
1. Mertens F, Johansson B, Fioretos T, Mitelman F. The emerging complexity of gene fusions in cancer. Nat Rev Cancer. 2015; 15:371–381.
2. Li H, Wang J, Ma X, Sklar J. Gene fusions and RNA trans-splicing in normal and neoplastic human cells. Cell Cycle. 2009; 8:218–222.
3. Fernandez-Luna JL. Bcr-Abl and inhibition of apoptosis in chronic myelogenous leukemia cells. Apoptosis. 2000; 5:315–318.
4. Howlader N, Noone AM, Krapcho M, Miller D, Bishop K, Altekruse SF, Kosary CL, Yu M, Ruhl J, Tatalovich Z, Mariotto A, Lewis DR, Chen HS, et al. (2016). SEER Cancer Statistics Review, 1975–2013, National Cancer Institute. Bethesda, MD, http://seer.cancer.gov/csr/1975_2013/, based on November 2015 SEER data submission, posted to the SEER web site, April 2016.
5. Mosquera JM, Mehra R, Regan MM, Perner S, Genega EM, Bueti G, Shah RB, Gaston S, Tomlins SA, Wei JT, Kearney MC, Johnson LA, Tang JM, et al. Prevalence of TMPRSS2-ERG fusion prostate cancer among men undergoing prostate biopsy in the United States. Clin Cancer Res. 2009; 15:4706–4711.
6. Dhanasekaran SM, Balbin OA, Chen G, Nadal E, Kalyana-Sundaram S, Pan J, Veeneman B, Cao X, Malik R, Vats P, Wang R, Huang S, Zhong J, et al. Transcriptome meta-analysis of lung cancer reveals recurrent aberrations in NRG1 and Hippo pathway genes. Nat Commun. 2014; 5:5893.
7. Tognon C, Knezevich SR, Huntsman D, Roskelley CD, Melnyk N, Mathers JA, Becker L, Carneiro F, MacPherson N, Horsman D, Poremba C, Sorensen PH. Expression of the ETV6-NTRK3 gene fusion as a primary event in human secretory breast carcinoma. Cancer Cell. 2002; 2:367–376.
8. Merdan S, Tomlins SA, Barnett CL, Morgan TM, Montie JE, Wei JT, Denton BT. Assessment of long-term outcomes associated with urinary prostate cancer antigen 3 and TMPRSS2:ERG gene fusion at repeat biopsy. Cancer. 2015; 121:4071–4079.
9. Hagglof C, Hammarsten P, Stromvall K, Egevad L, Josefsson A, Stattin P, Granfors T, Bergh A. TMPRSS2-ERG expression predicts prostate cancer survival and associates with stromal biomarkers. PLoS One. 2014; 9:e86824.
10. Aletti GD, Gallenberg MM, Cliby WA, Jatoi A, Hartmann LC. Current management strategies for ovarian cancer. Mayo Clin Proc. 2007; 82:751–770.
11. Cancer Genome Atlas Research N. Integrated genomic analyses of ovarian carcinoma. Nature. 2011; 474:609–615.
12. Yoshihara K, Wang Q, Torres-Garcia W, Zheng S, Vegesna R, Kim H, Verhaak RG. The landscape and therapeutic relevance of cancer-associated transcript fusions. Oncogene. 2015; 34:4845–4854.
13. Patch AM, Christie EL, Etemadmoghadam D, Garsed DW, George J, Fereday S, Nones K, Cowin P, Alsop K, Bailey PJ, Kassahn KS, Newell F, Quinn MC, et al. Whole-genome characterization of chemoresistant ovarian cancer. Nature. 2015; 521:489–494.
14. McPherson A, Hormozdiari F, Zayed A, Giuliany R, Ha G, Sun MG, Griffith M, Heravi Moussavi A, Senz J, Melnyk N, Pacheco M, Marra MA, Hirst M, et al. deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data. PLoS Comput Biol. 2011; 7:e1001138.
15. Kannan K, Coarfa C, Chao PW, Luo L, Wang Y, Brinegar AE, Hawkins SM, Milosavljevic A, Matzuk MM, Yen L. Recurrent BCAM-AKT2 fusion gene leads to a constitutively activated AKT2 fusion kinase in high-grade serous ovarian carcinoma. Proc Natl Acad Sci USA. 2015; 112:E1272–1277.
16. Kannan K, Coarfa C, Rajapakshe K, Hawkins SM, Matzuk MM, Milosavljevic A, Yen L. CDKN2D-WDFY2 is a cancer-specific fusion gene recurrent in high-grade serous ovarian carcinoma. PLoS Genet. 2014; 10:e1004216.
17. Salzman J, Marinelli RJ, Wang PL, Green AE, Nielsen JS, Nelson BH, Drescher CW, Brown PO. ESRRA-C11orf20 is a recurrent gene fusion in serous ovarian carcinoma. PLoS Biol. 2011; 9:e1001156.
18. Akiva P, Toporik A, Edelheit S, Peretz Y, Diber A, Shemesh R, Novik A, Sorek R. Transcription-mediated gene fusion in the human genome. Genome Res. 2006; 16:30–36.
19. Liu S, Tsai WH, Ding Y, Chen R, Fang Z, Huo Z, Kim S, Ma T, Chang TY, Priedigkeit NM, Lee AV, Luo J, Wang HW, et al. Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data. Nucleic Acids Res. 2016; 44:e47.
20. Carrara M, Beccuti M, Lazzarato F, Cavallo F, Cordero F, Donatelli S, Calogero RA. State-of-the-art fusion-finder algorithms sensitivity and specificity. Biomed Res Int. 2013; 2013:340620.
21. Jia W, Qiu K, He M, Song P, Zhou Q, Zhou F, Yu Y, Zhu D, Nickerson ML, Wan S, Liao X, Zhu X, Peng S, et al. SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data. Genome Biol. 2013; 14:R12.
22. Wu J, Zhang W, Huang S, He Z, Cheng Y, Wang J, Lam TW, Peng Z, Yiu SM. SOAPfusion: a robust and effective computational fusion discovery tool for RNA-seq reads. Bioinformatics. 2013; 29:2971–2978.
23. Babiceanu M, Qin F, Xie Z, Jia Y, Lopez K, Janus N, Facemire L, Kumar S, Pang Y, Qi Y, Lazar IM, Li H. Recurrent chimeric fusion RNAs in non-cancer tissues and cells. Nucleic Acids Res. 2016; 44:2859–2872.
24. Agromayor M, Soler N, Caballe A, Kueck T, Freund SM, Allen MD, Bycroft M, Perisic O, Ye Y, McDonald B, Scheel H, Hofmann K, Neil SJ, et al. The UBAP1 subunit of ESCRT-I interacts with ubiquitin via a SOUBA domain. Structure. 2012; 20:414–428.
25. Stefani F, Zhang L, Taylor S, Donovan J, Rollinson S, Doyotte A, Brownhill K, Bennion J, Pickering-Brown S, Woodman P. UBAP1 is a component of an endosome-specific ESCRT-I complex that is essential for MVB sorting. Curr Biol. 2011; 21:1245–1250.
26. Kumar-Sinha C, Kalyana-Sundaram S, Chinnaiyan AM. Landscape of gene fusions in epithelial cancers: seq and ye shall find. Genome Med. 2015; 7:129.
27. Boettger LM, Handsaker RE, Zody MC, McCarroll SA. Structural haplotypes and recent evolution of the human 17q21.31 region. Nat Genet. 2012; 44:881–885.
28. Stefansson H, Helgason A, Thorleifsson G, Steinthorsdottir V, Masson G, Barnard J, Baker A, Jonasdottir A, Ingason A, Gudnadottir VG, Desnica N, Hicks A, Gylfason A, et al. A common inversion under selection in Europeans. Nat Genet. 2005; 37:129–137.
29. Permuth-Wey J, Lawrenson K, Shen HC, Velkova A, Tyrer JP, Chen Z, Lin HY, Chen YA, Tsai YY, Qu X, Ramus SJ, Karevan R, Lee J, et al. Identification and molecular characterization of a new ovarian cancer susceptibility locus at 17q21.31. Nat Commun. 2013; 4:1627.
30. Micci F, Panagopoulos I, Thorsen J, Davidson B, Trope CG, Heim S. Low frequency of ESRRA-C11orf20 fusion gene in ovarian carcinomas. PLoS Biol. 2014; 12:e1001784.
31. Prat J. New insights into ovarian cancer pathology. Ann Oncol. 2012; 23:x111–117.
32. Kurman RJ, Shih Ie M. The Dualistic Model of Ovarian Carcinogenesis: Revisited, Revised, and Expanded. Am J Pathol. 2016; 186:733–747.
33. Torres-Garcia W, Zheng S, Sivachenko A, Vegesna R, Wang Q, Yao R, Berger MF, Weinstein JN, Getz G, Verhaak RG. PRADA: pipeline for RNA sequencing data analysis. Bioinformatics. 2014; 30:2224–2226.