Abstract
Background
Urothelial bladder cancer is a highly heterogeneous disease. Cancer cell lines are useful tools for its study. This is a comprehensive genomic characterization of 40 urothelial bladder carcinoma (UBC) cell lines including information on origin, mutation status of genes implicated in bladder cancer (FGFR3, PIK3CA, TP53, and RAS), copy number alterations assessed using high density SNP arrays, uniparental disomy (UPD) events, and gene expression.
Results
Based on gene mutation patterns and genomic changes we identify lines representative of the FGFR3-driven tumor pathway and of the TP53/RB tumor suppressor-driven pathway. High-density array copy number analysis identified significant focal gains (1q32, 5p13.1-12, 7q11, and 7q33) and losses (i.e. 6p22.1) in regions altered in tumors but not previously described as affected in bladder cell lines. We also identify new evidence for frequent regions of UPD, often coinciding with regions reported to be lost in tumors. Previously undescribed chromosome X losses found in UBC lines also point to potential tumor suppressor genes. Cell lines representative of the FGFR3-driven pathway showed a lower number of UPD events.
Conclusions
Overall, there is a predominance of more aggressive tumor subtypes among the cell lines. We provide a cell line classification that establishes their relatedness to the major molecularly-defined bladder tumor subtypes. The compiled information should serve as a useful reference to the bladder cancer research community and should help to select cell lines appropriate for the functional analysis of bladder cancer genes, for example those being identified through massive parallel sequencing.
Similar content being viewed by others
Background
Urothelial bladder cancer (UBC) has a high incidence, with 133,696 new cases and 51,056 deaths from UBC in Europe in 2011 [1] and a high prevalence due to the fact that it is commonly an indolent disease. UBC has a higher incidence in males than in females (3:1) and it is the fourth most common cancer in men. Age, smoking, chlorination byproducts, and occupational exposures are the major risk factors [2].
UBC displays a high level of clinical and pathological heterogeneity. Morphologically, tumors can show papillary vs. solid growth patterns. A clinically relevant issue is the level of invasion of the bladder wall: tumors are classified as non-muscle invasive (NMIBC, Ta, carcinoma in situ, and T1) or muscle-invasive (MIBC, ≥T2). The majority of patients (ca. 70%-80%) present with papillary NMIBC, most of whom have a good prognosis. Patients with high-grade NMIBC, and those with MIBC, have an aggressive disease that can lead to patient’s death, emphasizing the need to better classify these tumor subgroups.
Approximately 70% of NMIBC harbour activating mutations in FGFR3, the main oncogene involved in UBC [3-5]. PIK3CA mutations occur in 15% of UBC, often in association with FGFR3 mutations [6]. An additional 10% of tumors have mutations in RAS genes, mutually exclusive with FGFR3 mutations [7]. MIBC tend to have a low frequency of mutations in FGFR3 (10%) and develop predominantly through the inactivation of the P53 and RB pathways [4,8,9]. Unlike NMIBC, these tumors are genomically unstable [4,10,11]; several studies have reported the most commonly gained and lost regions [11,12]. TERT promoter mutations occur in >70% UBC, regardless of stage/grade [13].
Tumor cell lines are invaluable research tools. They are readily amenable to experimental manipulation, providing opportunities for functional analyses and contributing to improved knowledge [14]. Cell lines have proven useful in preclinical pharmacological studies [15] and will be very important to characterize the function of new cancer genes identified through massive parallel sequencing. However, cell lines often fail to faithfully reflect the genetic and phenotypic diversity of primary tumors and do not fully recapitulate their complexity because the stromal and inflammatory components are not represented in vitro. In addition, tumor cells may behave differently in vitro due to the lack of interactions with non-neoplastic cells. Therefore, a thorough knowledge of their genotype and phenotype is essential in order to optimize their use while considering their limitations.
Cell lines from primary UBC are commonly used as disease models. It is crucial to identify those lines best suited to answer specific biological questions and to place the studies in the context of patient’s tumors. The genetic make-up of UBC cell lines has been analyzed using array comparative genomic hybridization [16]. High-resolution gene copy number information for 24 UBC lines is published as part of The Cancer Cell Line Encyclopedia [14] but a detailed comparison of mutations, gene copy number changes, and gene expression is not available. Importantly, the NCI-60 panel does not contain any UBC line [15]. Here, we compile high-resolution genomic information on the largest panel of UBC lines analyzed so far and provide a comprehensive overview of their genetic/genomic architecture. In addition, we use the global transcriptomics data to place the cell lines in the context of the recently reported molecular taxonomy of UBC. This will serve as a reference to the bladder cancer research community and will help to select the most adequate cells to answer specific biological questions.
Results
We report here a detailed genomic analysis of a large set of UBC cell lines in order to improve their use as models for the study of this tumor type. Web resources used are listed in Additional file 1: Table S1. Mutations were assessed for 49 lines, copy number changes were analyzed in 42 lines, and global expression profiles were gathered for 48 lines. For 40 of them (UBC-40 panel), the complete set of analyses is provided. A summary of the literature search, and our own results, is shown in Table 1 and Additional file 1: Table S2.
The genomic architecture of UBC lines
Information on 902,103 autosomal probes covering 2,787 Mb of the genome was analyzed using the waviCGH web server. All of these probes were called as altered (copy number lost or gained) in ≥10 cell lines. An average of 1349 Mb (510-2111 Mb) were altered across the panel: 592 Mb were gained (219-1189 Mb) and 757 Mb were lost (180-1291 Mb). The line showing the lowest fraction of the (autosomal) genome altered was MGH-U3 (510 Mb, 18.3% of the covered genome). 639V cells showed the highest fraction of gains/losses: 2111 Mb (75.8%) (Table 1, Figure 1). The remaining lines showed variable fractions of the genome altered over a continuum; no discrete categories could be identified (Figure 1). Losses were more frequent than gains: an average 28% of covered genome was lost as compared to 21% gained (P = 0.0005). Most cell lines showed loss or gain of multiple whole chromosomes (Figure 1).
Alterations in oncogenes and tumor suppressors
Table 1 shows the mutational status of UBC-relevant oncogenes and tumor suppressors. Figure 2A and Additional file 1: Table S3 summarize these results and compare mutation prevalence in cell lines and in primary UBC using information retrieved from the COSMIC database. FGFR3 mutations were significantly less frequent in cell lines than in tumors (20% vs. 46%, P = 1.9x10-4). RT112 and RT4 cells exhibited amplification of a 75 and 79 Mb region, respectively, encompassing FGFR3 and part of the neighboring TACC3. FGFR3 mRNA expression was higher in FGFR3-mutant lines (P = 0.09) (Figure 2C). These two lines, and SW-780, have recently been shown to harbour activating translocations involving FGFR3 [17]. PIK3CA mutation frequency was similar in lines and UBC tissues (24% vs. 19%, P = 0.3). Five of 45 lines (11%) harbored a mutation in both FGFR3 and PIK3CA, comparable with the frequency in COSMIC UBC tissues (16%, P = 0.6). Mutations in HRAS (7%), KRAS (8%), NRAS (5%), and AKT1 (5%) were less frequent (Table 1, Figure 2A, and Additional file 1: Table S3). UM-UC-7 demonstrated amplification of a 7.4 Mb region including KRAS. There were no amplifications in PIK3CA, HRAS, or NRAS.
To assess the status of key tumor suppressors (INK4A, PTEN, and TP53) both mutations and genomic losses were considered (Table 1 and Figures 2A and B). VM-CUB-1 was the only line harboring a point mutation in INK4A; gene losses were present in 63% of cell lines, including both loss of heterozygosity (LOH) (n = 7) and homozygous deletions (HD) (n = 20). INK4A mRNA expression was significantly lower in lines with LOH (defined as gene copy number loss) or HD than in wild type lines (Figure 2D). As of PTEN, 23% and 19% of the lines harboured mutations or LOH. J82 and UM-UC-3 had a PTEN mutation and a partial HD. 639V, T24, and UM-UC-9 harboured a missense mutation and retained a wild type allele whereas 5637, RT4, and SW-780 were wild type and showed LOH. Cell lines with LOH or mutant PTEN had a significantly lower expression of PTEN mRNA than wild type lines (Figure 2E). PTEN mutations were also significantly more frequent in cell lines than in tumor tissues (23% vs. 4%, P = 1.04x10-4). Regarding TP53, mutations were significantly more frequent in cell lines than in tumors (66% vs. 31%, P = 2.7x10-6). LOH was found in 47% of the lines.
Figure 2B compares the frequency of tumor suppressor gene losses in cell lines and tissues analyzed using the same assay platform (n = 49), categorized as non-aggressive vs. aggressive. Gene loss in cell lines and aggressive tumors was comparable (P = 0.64). However, non-aggressive tumors showed a lower frequency of alterations as compared to cell lines (P = 0.02). The frequency of INK4A and PTEN loss was similar in cell lines and tumors (P = 0.3) but the frequency of TP53 LOH was higher in cell lines (47% vs. 28%, P = 0.06).
Original tumor grade, oncogene/tumor suppressor status, and genomic instability
The grade of the original tumor from which 27 lines were isolated was available (Additional file 1: Table S2). Genomic instability, assessed as the size of the genome with copy number alterations, was compared in samples harbouring - or not - mutations in UBC oncogenes and tumor suppressor genes.
In agreement with the genomic analyses of tumors, FGFR3 mutant lines showed lower genomic instability (genome altered: 1024 ± 461 Mb vs. 1402 ± 349 M, P = 0.06, Wilcoxon). By contrast, TP53 mutant lines showed higher genomic instability (genome altered: 1381 ± 366 Mb vs. 1023 ± 433 Mb, P = 0.04) (Additional file 2: Figure S1 and Additional file 1: Table S4). Cell lines isolated from low-grade tumors (G1/G2) tended to be more stable than those isolated from high-grade tumors (G3/G4) (Additional file 2: Figure S1). Similar tendencies were observed when using 3 different metrics to assess genomic instability (total size of the genome altered, fraction of probes altered, or number of altered segments identified; see methods section). FGFR3 mutant lines tended to fall within the genomically stable group whereas TP53 mutant and high-grade lines tended to fall within the genomically unstable-high group (Additional file 1: Table S5).
Copy number changes involving whole chromosomes/whole chromosome arms
Because distinct mechanisms lead to alterations in whole chromosomes or chromosome arms and to interstitial changes, these were assessed separately. Most cell lines showed losses and gains of multiple whole chromosomes/whole chromosome arms (Figure 1, Table 2, and Additional file 1: Table S6). Chromosomes most frequently gained were chr.20 (41%), chr.7 (23%), chr.21 (20%), and chr.5 (11%). The chromosome arms most frequently gained included 5p (45%), 8q (39%), 3q (34%), 7p (18%), 9q (18%), 1q (18%), 20q (16%), 20p (14%), and 9p (11%). Chromosomes most frequently lost were chr.4 (34%), chr.1 (27%), chr.21 (25%), chr.15 (20%), chr.22 (20%), chr.13 (16%), and chr.16 (16%). The most common arm losses included 8p (52%), 18q (25%), 3p (25%), 9p (23%), 17p (20%), and 2q (18%).
Recurrent focal copy number alterations across cell lines
Figure 3 shows copy number calls of individual probes for each line; Table 3 shows statistically significant minimal common regions (MCRs) identified using waviCGH, a permutation-based method [18]. Altogether, 21 statistically significant (FDR <0.05) MCRs were identified (11 gained and 10 lost), ranging from 1.6 kb-156 Mb in size (gains: 211 Kb-56 Mb; losses: 1.6 Kb-156 Mb). Six MCRs almost entirely covering chromosome 4 were identified; some MCRs overlapped with whole chromosome or chromosome arm changes, such as gains in 1q, 3q, 5p, 7, and 21 and losses in 3p, 4, and 15. Other MCRs included gains at 11p15 and losses at 6p22.1-6p22.2, 10q23.33, and 13q33.3 (Tables 2 and 3, Figure 3, and Additional file 1: Table S6). All recurrent focal losses were hemizygous.
Eight regions were amplified in ≥3 cell lines (81 Kb-73 Mb) (Additional file 2: Figure S2 and Table 4), mostly in chromosomes or chromosome arms lacking high frequency alterations. Six of them have previously been described as gained/amplified in UBC tissues but not in cell lines; another region at 12p11.22-12q13.13 is novel to both tumors and cell lines.
X chromosome analysis
Data regarding the X chromosome could be evaluated in 37 lines (9 female and 28 male). Large structural alterations were rare: 6 lines showed complete loss of Xp whereas 3 lines showed almost complete gain of Xq. No significant MCRs were identified although peak gains were seen at Xp22.2, Xp11.4, Xp11.23, Xq11.2, Xq12, and Xq25 and peak losses at Xq21.31 and Xp21.3-21.1 (Additional file 2: Figure S3).
Uniparental disomies (UPD)
Autosomes were analyzed for the presence of UPD (Figure 4 and Additional file 1: Table S7), defined as copy number neutral or amplified regions showing LOH. Examples of different categories of UPD events are shown in Additional file 2: Figure S4A-F. Overall, 195 UPD events were identified in 40 lines: UPDs were absent from BC61, RT4, and MGH-U3. Interestingly, these lines are among those showing lower fraction of the genome altered (Figure 1). All autosomes displayed ≥1 UPD event in ≥2 lines. The median number of UPD events per line was 4; cell line 97-24 showed 22 UPD events. Focal UPDs were the most common event (n = 91), ranging in size from 2-129 Mb. There were 51 UPDs involving whole chromosomes and 39 UPDs of a whole chromosome arm. UPDs involving whole chromosomes were most common in chr. 9, 17, and 22.
Many UPD events occurred in regions that are lost in other lines, supporting the occurrence of tumor suppressors therein. For example, region 3p21.1-3p14.2, lost in 46% of lines, coincides with a recurrent UPD. In addition, 4 lines show UPD of chr.3 (639V, 253J, and UM-UC-13) or 3p arm (96-1). The number of UPD and their total genome size correlated with the genome size affected by copy number alterations (p = 0.08 and p = 0.09) (Additional file 2: Figure S5). FGFR3 mutant cell lines had significantly fewer UPD events and overall size of the genome affected by UPD than wild type lines, supporting that UPD associates with aggressiveness (Additional file 2: Figure S6).
Gene-level analysis of copy number alterations
Gene copy number data was newly generated from 19 low-risk and 30 high-risk primary UBC. Some gains (5p, 8q, 17q, and whole chr.20) and losses (5q, 8p, 17p) occurred with similar frequency in lines and tumors (Figures 1B, and 5, Table 2, and Additional file 1: Table S6).
Tumors typically showed whole chr.9 loss, likely targeting multiple tumor suppressors (i.e. INK4A, PTCH1, and TSC1) whereas the cell lines had a high frequency of both gains and losses of chromosome 9, often in association with UPD affecting either the whole chromosome or its q-arm (Figure 4). Partial chr. 9 UPDs were found in several cell lines although there was no overlap among the regions affected. Chr.19 and chr.22 were more frequently lost in lines whereas they were more often gained in tumors (Figure 5).
Comparison of gene copy number alterations and expression
The complete expression dataset is provided in Additional file 1: Table S8. The 8 regions amplified in ≥3 cell lines (Additional file 1: Table S9) include 825 protein-coding genes with microarray expression information; 396 of them had a higher average expression in lines with gains/amplifications vs. those without them. This difference was statistically significant for 51 genes (Additional file 1: Table S9). Among them are CDKAL1 (CDK5 regulatory subunit associated protein like 1, 6p22.3; 4-fold differential expression, p < 0.05), ASRGL1 (11q13.4), ATP2B4 (1p12), ITGA3, PRPSAP2, and C17orf39 (17p11.2), all with a fold-change difference between amplified and non-amplified tumors of 1.5-1.9 (p < 0.05).
Gene expression information for 334 genes in the 10 lost regions was available; 225 had a lower average expression in cell lines with loss vs. those without loss. Of them, 28 showed statistically significant differential expression, including ANXA10 (4q32.3; 4.8-fold), ARAP2 (4p14; 2-fold), CDS1 (4q21.23; 2.2-fold), and PTPRG (3p14.2; 1.6-fold) (Additional file 1: Table S9).
Genomic analyses of new genes involved in UBC identified through exome sequencing
We analyzed copy number, UPD status, and expression of new driver UBC genes (Additional file 1: Tables S10 and S11) identified through exome sequencing [19-24]. Several of them are in genomic regions with either whole chromosome/chromosome arm gain/loss. LOH, and gains were more common than UPD (average 10 vs. 4 events per gene). PDZD2 and CSMD3 were often gained (41%, 64% and 52% respectively) whereas ANK2, FAT4 and MLL were often lost (55%, 55% and 50% respectively); MLL is on chromosome 11 - which is not frequently altered in UBC - and is significantly under-expressed in cell lines with LOH (Additional file 1: Table S11). TSC1 showed both gains (52%) and UPD (29%); a similar pattern was observed for other tumor suppressor genes. TP53 and EP300 were affected by both LOH and UPD.
UBC cell lines represent molecularly defined bladder cancer subtypes
We applied the UBC molecular classifier based on gene expression defined by Sjodahl et al [12] to identify lines most representative of the taxonomical groups proposed. Figure 6A shows that cell lines could be adscribed to the “Urobasal A”, “Urobasal B”, and “SCC-like” classifiers (Additional file 1: Table S12). The “Genomically Unstable” group was most commonly represented among the lines. Rebouissou et al. have recently reported on a 40-gene basal-like signature [25]. We have applied their 40-gene classifier to the cell line dataset and identify 4 major groups: lines with a predominant enrichment in the “Basal-like” signature; lines with a predominant enrichment in the “Non basal-like” signature; lines with enrichment of both signatures; and lines in which none of the signatures is enriched (Figure 6B and Additional file 1: Table S13). In agreement with the gene mutation/copy number change data indicating that cell lines are biased towards a more aggressive type, the “Non Basal/Luminal” phenotype is less represented among the available established cell lines.
Discussion
This is the most comprehensive analysis of the genomic landscape of UBC lines reported to date, including mutation, copy number, and expression data for a panel of UBC lines. As we have complete copy number and gene expression data for 40 of them, we have named this dataset UBC-40.
We provide detailed information on the source of the cell lines used in order to avoid “mistaken identity”. When surveying the literature, there are conflicting reports regarding the mutation status of some of the genes studied possibly due to cell contamination or mislabeling; similar problems have been reported with lines from the NCI-60 panel [26]. For example, MGH-U1/EJ and some subcultures of J82 are derived from T24 [27] and KU7 was cross-contaminated with HeLa [28]. Cell lines extensively cultured in different laboratories may have evolved independently and ultimately acquired mutations or genomic changes absent from the original line [29]. Nevertheless, fundamental features of these cell lines show stability and have allowed their extensive use as models of UBC in a wide variety of studies; this is the case of FGFR3-dependent RT112 cells [30]. The clustering shown in Figure 6 allows to propose, using two independent classifiers, prototype cell lines of papillary/luminal tumor phenotypes (i.e. RT4, UM-UC-7), and Basal-like tumors (well represented by several cell lines). Interestingly, a few cell lines display enrichment in both Basal and Non-Basal gene signatures (i.e. RT112 and MGH-U3, both of which are known to be FGFR3-driven). This information will be useful for selecting the best models to address specific functional studies.
Most of the lines analyzed here were generated many years ago, have been extensively passaged, and do not have matched normal tissue - or lymphoblastoid B cells - from the same patient available. Therefore, it is not possible to define the somatic variants they carry, this being the reason why we have not conducted exome sequencing. Consequently, and because the lines available only represent incompletely the spectrum of UBC as it relates to low-grade tumors, renewed efforts should be placed in the establishment of new cell lines and xenografts to facilitate preclinical studies. Improved sample processing, matrigel embedding, and orthotopic implantation, as well as more reliable systems for primary culture and passage [31,32], should contribute to improve efficiency. Furthermore, efforts should be made to biobank non-neoplastic material from the same patients.
The summary data reported here provides an overview and the detailed datasets should serve as a resource to the research community in order to identify which - among these lines - serve best as disease models for specific tumor subtypes. The recent discovery of new driver genes involved in UBC through massive parallel sequencing [19-24] and the information provided here should be useful to select lines appropriate for their functional analysis and for preclinical studies.
Conclusions
- TP53-mutant lines show high genomic instability whereas FGFR3- or PIK3CA-mutant lines are more genomically stable.
- We have identified for the first time UPD events in UBC lines, pointing to new regions containing putative tumor suppressors.
- We provide novel information on chr. X losses, where new important tumor suppressor genes have been identified (i.e. KDM6A and STAG2).
- We identify novel regions deserving research as they are frequently altered in UCB lines as well as in primary tumors.
- Some cell lines are more representative of the FGFR3-driven tumor pathway (RT112, MGH-U3, 97-7, BC61, RT4, SW-780, and UM-UC-6) whereas others are more representative of the tumor suppressor-driven pathways (5637, 92-1, 96-1, 97-18, 97-24, HT1376, SW-1710, UM-UC-1, UM-UC-13, and VM-CUB-2). We propose that - as has already been done, in part - these cells be used as models of non-aggressive and aggressive UBC, respectively.
- The UBC lines available cover a wide range of tumor genotypes and phenotypes. While they do not fully represent the spectrum of tumors found in patients and are enriched towards a more aggressive genetic architecture, the main genetic pathways involved in UBC are represented in this panel.
- Future efforts should be placed to establish new UBC lines, mainly focusing on less aggressive tumors, as well as collections of patient-derived xenografts.
Methods
Literature and web-based search
A literature search was performed to retrieve information regarding sex, histology, and stage/grade of the original tumor from which the cell line was established, as well as the original reference. The mutational status of the main genes involved in UBC (TERT, FGFR3, PIK3CA, KRAS, HRAS, NRAS, p16/INK4A, PTEN, and TP53) was analyzed and complemented/supported through data obtained from COSMIC, CCLE, and IARC public databases [14,33,34]. Information on mutations in FGFR3, PIK3CA, KRAS, HRAS, NRAS, PTEN, and TP53 in primary UBC tumors with either “papillary” or “carcinoma” histology was retrieved from the COSMIC database. All web resources used in the analysis are listed in Additional file 1: Table S1.
UBC cell lines
JON, MGH-U4, RT4, SCaBER, SW-800, SW-850, SW-1710, T24, VM-CUB-2, 253J, 639V, 5637, and 575A were purchased from the American Type Culture Collection (Rockville, MD, US); J82, MGH-U3, and RT112 cells were kindly provided by F. Radvanyi (Institut Curie, Paris, France); UM-UC-1, UM-UC-3, UM-UC-4, UM-UC-5, UM-UC-6, UM-UC-7, UM-UC-9, UM-UC-10, UM-UC-11, UM-UC-12, UM-UC-13, UM-UC-14, UM-UC-15, UM-UC-17, and UM-UC-18 were provided by H. B. Grossman (MD Anderson Cancer Center, Houston, TX, US) [35]; HT1197, HT1376, HU456, KK47, PSI, SW-780, UM-UC-2, and VM-CUB-1 were provided by D. Theodorescu (University of Colorado, Aurora, CO); 92-1, 96-1, 97-1, 97-7, 97-18, and 97-24 were generated by C. Reznikoff [36] and provided by M. Knowles (University of Leeds, Leeds, UK); TCCSUP was provided by M. Sánchez-Carbayo (CNIO, Madrid, Spain); BC61 was provided by W. Schulz [37]; and LGWO1 G600 was provided by J. Reeder (U. Rochester, NY). Only Mycoplasma-free cultures were used.
DNA and RNA isolation from cell lines and tumors
Cells were cultured in RPMI supplemented with 10-20% FBS and were harvested at 70-90% confluence. DNA from cell lines HT1197, HT1376, HU456, KK47, PSI, UM-UC-2, and VM-CUB-1 was isolated in the laboratory of D. Theodorescu; the remaining cell lines were cultured at CNIO. DNA was isolated using the DNAeasy blood and tissue kit (Qiagen) according to manufacturer’s instructions.
Tumor samples (n = 49) came from UBC cases diagnosed with UBC recruited to the Spanish Bladder Cancer/EPICURO study. Informed consent was obtained from study participants in accordance with the Institutional Review Board of the Ethics Committees of participating hospitals that approved the study (IRB Hospital del Mar, ref. 2008/3296/1). The T/G distribution was as follows: Ta (n = 26), T1 (n = 8), T2 (n = 5), T3 (n = 6), and T4 (n = 4); cases were grouped in two categories, non-aggressive (TaG1 and TaG2) (n = 19) and aggressive (TaG3 and T1-T4) (n = 30). Only samples containing >60% tumor cells were used. DNA was isolated using the Puregene kit A (Qiagen) according to the manufacturer’s instructions.
Mutational analyses
FGFR3, PIK3CA, HRAS, KRAS, and NRAS hotspot mutational analysis was performed using ABI PRISM® SNaPshot® (ABI) as previously described [38].
Analysis of gene copy number alterations using the Illumina 1 M Duo array
DNA (1.5 μg) was quantified using picogreen and used for array hybridization. The Illumina 1 M Duo array includes 902,103 autosomal probes and 39,779 probes from sex chromosomes. A total of 43 different cell lines were hybridized to the arrays (GSE64572) Genotypes and R values were extracted using the beadstudio software (version 3.1.3.0) and R values were normalized using the method described by Pounds and co-workers [39]. Log R ratios were calculated using as reference the average R value from 200 blood leukocyte samples from control subjects included in the EPICURO study [40] with the R program version 2.8 [41]. Copy number calls were obtained using the waviCGH software [18]; segmentation and calling were performed using DNAcopy [42] and the probability-based method (CGHcall) [43], respectively. Gene copy number changes were called as follows: -1 = loss (hemi or homozygous), 0 = copy number neutral, +1 = gain and +2 = amplification (defined as ≥5 copies). Minimal common regions (MCRs) were identified using the permutations method in waviCGH which computes a P‐value based on a permutation test assuming that the alterations found are randomly located in the genome. Consecutive probes with P‐values <0.05 were joined in a common region. Focal copy number alterations are those not involving whole chromosomes or whole chromosome arms.
Gene copy number reproducibility analysis
The experimental reproducibility of the gene copy number analysis was assessed using data from DNA isolated in 2 different laboratories (n = 3) or DNA isolated from different cultures in the same laboratory (n = 5). The absolute call concordance rate was between 79.3 and 96.7%. “Gain/loss” type discordances were very uncommon (0.015-0.03%); most discordances were “gain/no-change” (1.4-7.5%) or “loss/no-change” (1.9-15.8%). The replicate with the highest signal to noise ratio was considered the most accurate and selected for subsequent analysis. A summary of the call concordance rate in replicates is provided in Additional file 1: Table S14.
Copy number analysis of FGFR3, PIK3CA, KRAS, HRAS, NRAS, INK4A, PTEN, and TP53
Gene amplification was determined from the copy call results from CGHcall. LOH and HD were determined by combining copy call, B allele frequency (BAF), and genotyping data. Probes with homozygous calls and BAF of either 0 or 1 and a decline in the logR ratio of were classified as LOH and non-called (NC) probes with an abnormal BAF and a decline in LogR ratio were classified as HD. The number of probes representing each gene was: FGFR3 (n = 22), PIK3CA (n = 42), KRAS (n = 22), HRAS (n = 2), NRAS (n = 6), INK4A (n = 17), PTEN (n = 49), and TP53 (n = 22).
Copy number analysis of genes in X chromosome
The gender of the patient from whom the cell lines were derived was not always available. The presence of a Y chromosome was considered as a reliable indication that it was of male origin. This, combined with the data from the literature, was used to select cell lines for which we could perform X chromosome gene copy number analysis. The LogR ratio was calculated with the average R value normalized to a pool of control male or female blood leukocytes from the EPICURO Study. For probes representing the X chromosome (38,016 probes), or those corresponding to sequences present on both the X and Y chromosomes (395 probes), copy number calls were obtained using the waviCGH software as described for autosomes.
Assessment of genomic instability
Genomic instability was assessed as the fraction of the genome altered (either lost or gained), calculated from call data generated by CGHcall using autosome probes and measured in 3 different ways:
(1) total size of the genome (covered by the Illumina array) altered. The size in base pairs (bp) of the segmented regions altered (lost or gained) was calculated from the start and end position of the segments; (2) fraction of probes altered - proportion of probes showing loss, gain, and amplification; (3) number of altered segments identified - the total number of individual altered segments, lost or gained, was determined from the waviCGH segmented call data.
Cell lines were classified in 3 categories according to fraction of the genome altered (upper, middle, and lower tertile) calculated by the 3 different methods; low/medium/high genomic instability groups were thus identified. We compared the relationship between mutational status in FGFR3, PIK3CA, KRAS, HRAS, NRAS, p16/INK4A, PTEN, and TP53 and the original tumor grade with the fraction of the genome altered, calculated by these approaches. The chi-square test was used to assess the difference between the frequency of mutant vs. wild type genotypes and low vs. high grade cell lines in each of the genome instability groups. The Wilcoxon test was used to assess the difference in the mean of the genome instability variable between mutant/wild type and low/high grade cell lines.
Uniparental disomy (UPD) detection
Log R ratios from hybridization data were analyzed using the zoo package of the R statistical program in order to identify UPDs [40]. Chromosomal regions with LOH, as determined from the BAF, and an average LogR ratio value around 0 indicate a probable segmental UPD. UPD events were classified into 6 different categories: (1) involving the whole chromosome, (2) involving a whole chromosome arm, (3) focal UPD, (4) focal UPD and segmental duplication, (5) UPD and segmental amplification, and (6) UPD involving almost the entire chromosome with a combined focal deletion.
RNA expression analyses
We have used previously reported data corresponding to 28 cell lines (GEO: GSE5845) [25] and have generated expression data for 20 additional cell lines (GSE64279). RNA was isolated with Trizol in both experimental batches. For the new cell lines, RNA (500 ng) was amplified, labeled, and used for array hybridization. The Affymetrix U133A array was used in all experiments. Raw expression data from all experiments were normalized using the R library Frozen Robust Multiarray Analysis (fRMA) method [44]. We applied this method as described by the authors for multiple arrays. We read the raw data (CEL files) and used the Random effect model for preprocessing. This model allows us combining data from different batches using the same microarray platform for analysis. Further, to obtain a matrix of gene-level expression values we used the exprs function with parameters by default.
Comparison of gene copy number and expression data
Gene lists were generated for regions identified as significant MCRs in copy number lost, gained or amplified regions. To determine the relationship between gene copy number alteration and expression, correlation analysis was performed for amplified genes by comparing copy number amplified/gained cases vs. copy number neutral/lost cases. For copy number lost genes, a comparison of copy number lost vs. no copy number loss was performed; the difference in expression between the 2 groups was assessed using the Wilcoxon test. The chi-square test was used to compare the distribution of mutation frequencies in cell lines and tumors.
Gene copy number, UPD analysis, and comparison with expression data of 44 new genes implicated in UBC
Copy number status and UPD of 44 genes recently shown to be involved in UBC were assessed [20,21,23,24]. To determine the relationship between gene copy number alteration and expression, correlation analysis was performed for gained genes by comparing copy number amplified/gained cases vs. copy number neutral/lost cases. For copy number lost genes, a comparison of copy number lost vs. no copy number loss was performed; the difference in expression between the groups was assessed using the Wilcoxon test.
Molecular classification of cell lines according to expression signatures of primary tumors
A molecular classifier consisting of 1038 genes [12] was used for hierarchical clustering; M. Lauss kindly provided the gene expression values for each signature (Additional file 3) and the classification method (Additional file 4). We processed the data to obtain one expression value for each gene present in the Affymetrix U133A array, as different probes for same gene are present in the platform. For this purpose we used the CollapseDataset tool available in the GenePattern webserver with collapse mode set at maximum. Then, we extracted the common genes present in the centroids and our expression file to calculate the pearson correlation for each centroid to each sample (classification script is provided as Additional file 4) using the R software. Furthermore, hierarchical clustering and heatmap plots were generated with the heatmap.2 function included in the gplots library in R software using Pearson correlation values calculated previously for each centroid to each sample (Additional file 1: Table S12). Expression signatures were re-named as “Urobasal A” (MS1a-b), “Genomically unstable” (MS2a1-2),“Urobasal B” (MS2b2.1) and “SCC-like” (MS2b2.2). The “Infiltrated” phenotype was not considered as it is mainly based on a stromal signature.
The same approach was applied using the centroids provided by Rebouissou et al. [25] to classify cell lines as “basal” or “non-basal” with the respective confidence prediction (Additional file 1: Table S13).
Data availability
All genomic data reported in this manuscript have been deposited in GEO. Expression data accession number: GSE64279. SNP array data accession number: GSE64572. Any other information of interest to readers can be requested to the authors.
Abbreviations
- HD:
-
Homozygous deletions
- LOH:
-
Loss of heterozygosity
- MCR:
-
Minimal common regions
- MIBC:
-
Muscle-invasive bladder cancer
- NMIBC:
-
Non-muscle invasive bladder cancer
- UBC:
-
Urothelial bladder cancer
- UPD:
-
Uniparental disomy
References
Ferlay J, Shin HR, Bray F, Forman D, Mathers C, Parkin DM. Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int J Cancer J Int Cancer. 2010;127(12):2893–917.
Murta-Nascimento C, Schmitz-Drager BJ, Zeegers MP, Steineck G, Kogevinas M, Real FX, et al. Epidemiology of urinary bladder cancer: from tumor development to patient’s death. World J Urol. 2007;25(3):285–95.
Hernandez S, Lopez-Knowles E, Lloreta J, Kogevinas M, Amoros A, Tardon A, et al. Prospective study of FGFR3 mutations as a prognostic factor in nonmuscle invasive urothelial bladder carcinomas. J Clin Oncol. 2006;24(22):3664–71.
Luis NM, Lopez-Knowles E, Real FX. Molecular biology of bladder cancer. Clin Transl Oncol. 2007;9(1):5–12.
Cappellen D, De Oliveira C, Ricol D, de Medina S, Bourdin J, Sastre-Garau X, et al. Frequent activating mutations of FGFR3 in human bladder and cervix carcinomas. Nat Genet. 1999;23(1):18–20.
Lopez-Knowles E, Hernandez S, Malats N, Kogevinas M, Lloreta J, Carrato A, et al. PIK3CA mutations are an early genetic alteration associated with FGFR3 mutations in superficial papillary bladder tumors. Cancer Res. 2006;66(15):7401–4.
Jebar AH, Hurst CD, Tomlinson DC, Johnston C, Taylor CF, Knowles MA. FGFR3 and Ras gene mutations are mutually exclusive genetic events in urothelial cell carcinoma. Oncogene. 2005;24(33):5218–25.
Real FX. p53: it has it all, but will it make it to the clinic as a marker in bladder cancer? J Clin Oncol. 2007;25(34):5341–4.
Schulz WA. Understanding urothelial carcinoma through cancer pathways. Int J Cancer J Int Cancer. 2006;119:1513–8.
Lindgren D, Frigyesi A, Gudjonsson S, Sjodahl G, Hallden C, Chebil G, et al. Combined gene expression and genomic profiling define two intrinsic molecular subtypes of urothelial carcinoma and gene signatures for molecular grading and outcome. Cancer Res. 2010;70(9):3463–72.
Blaveri E, Brewer JL, Roydasgupta R, Fridlyand J, DeVries S, Koppie T, et al. Bladder cancer stage and outcome by array-based comparative genomic hybridization. Clin Cancer Res. 2005;11(19 Pt 1):7012–22.
Sjodahl G, Lauss M, Lovgren K, Chebil G, Gudjonsson S, Veerla S, et al. A molecular taxonomy for urothelial carcinoma. Clin Cancer Res. 2012;18(12):3377–86.
Allory Y, Beukers W, Sagrera A, Flandez M, Marques M, Marquez M, et al. Telomerase reverse transcriptase promoter mutations in bladder cancer: high frequency across stages, detection in urine, and lack of association with outcome. Europ Urol. 2014;65(2):360–6.
Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7.
Shoemaker RH. The NCI60 human tumour cell line anticancer drug screen. Nat Rev Cancer. 2006;6(10):813–23.
Hurst CD, Fiegler H, Carr P, Williams S, Carter NP, Knowles MA. High-resolution analysis of genomic copy number alterations in bladder cancer by microarray-based comparative genomic hybridization. Oncogene. 2004;23(12):2250–63.
Williams SV, Hurst CD, Knowles MA. Oncogenic FGFR3 gene fusions in bladder cancer. Hum Mol Genet. 2013;22(4):795–803.
Carro A, Rico D, Rueda OM, Diaz-Uriarte R, Pisano DG. waviCGH: a web application for the analysis and visualization of genomic copy number alterations. Nucleic Acids Res. 2010;38(Web Server issue):W182–7.
Balbas-Martinez C, Sagrera A, Carrillo-de-Santa-Pau E, Earl J, Marquez M, Vazquez M, et al. Recurrent inactivation of STAG2 in bladder cancer is not associated with aneuploidy. Nat Genet. 2013;45(12):1464–9.
Gui Y, Guo G, Huang Y, Hu X, Tang A, Gao S, et al. Frequent mutations of chromatin remodeling genes in transitional cell carcinoma of the bladder. Nat Genet. 2011;43(9):875–8.
Guo G, Sun X, Chen C, Wu S, Huang P, Li Z, et al. Whole-genome and whole-exome sequencing of bladder cancer identifies frequent alterations in genes involved in sister chromatid cohesion and segregation. Nature Genet. 2013;45(12):1459–63.
The Cancer Genome Atlas Research Network. Comprehensive molecular characterization of urothelial bladder carcinoma. Nature. 2014;507(7492):315–22.
Balbas-Martinez C, Rodriguez-Pinilla M, Casanova A, Dominguez O, Pisano DG, Gomez G, et al. ARID1A alterations are associated with FGFR3-wild type, poor-prognosis, urothelial bladder tumors. PLoS One. 2013;8(5):e62483.
Solomon DA, Kim JS, Bondaruk J, Shariat SF, Wang ZF, Elkahloun AG, et al. Frequent truncating mutations of STAG2 in bladder cancer. Nature Genet. 2013;45(12):1428–30.
Rebouissou S, Bernard-Pierrot I, de Reynies A, Lepage ML, Krucker C, Chapeaublanc E, et al. EGFR as a potential therapeutic target for a subset of muscle-invasive bladder cancers presenting a basal-like phenotype. Sci Transl Med. 2014;6(244):244ra291.
Lorenzi PL, Reinhold WC, Varma S, Hutchinson AA, Pommier Y, Chanock SJ, et al. DNA fingerprinting of the NCI-60 cell line panel. Mol Cancer Ther. 2009;8(4):713–24.
O’Toole CM, Povey S, Hepburn P, Franks LM. Identity of some human bladder cancer cell lines. Nature. 1983;301(5899):429–30.
Jager W, Horiguchi Y, Shah J, Hayashi T, Awrey S, Gust KM, et al. Hiding in plain view: genetic profiling reveals decades old cross contamination of bladder cancer cell line KU7 with HeLa. J Urol. 2013;190(4):1404–9.
Warenius HM, Jones M, Gorman T, McLeish R, Seabra L, Barraclough R, et al. Combined RAF1 protein expression and p53 mutational status provides a strong predictor of cellular radiosensitivity. Br J Cancer. 2000;83(8):1084–95.
Qing J, Du X, Chen Y, Chan P, Li H, Wu P, et al. Antibody-based targeting of FGFR3 in bladder carcinoma and t(4;14)-positive multiple myeloma in mice. J Clin Invest. 2009;119(5):1216–29.
Rubio-Viqueira B, Hidalgo M. Direct in vivo xenograft tumor model for predicting chemotherapeutic drug response in cancer patients. Clin Pharmacol Ther. 2009;85(2):217–21.
Seifert HH, Meyer A, Cronauer MV, Hatina J, Muller M, Rieder H, et al. A new and reliable culture system for superficial low-grade urothelial carcinoma of the bladder. World J Urol. 2007;25(3):297–302.
Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 2011;39(Database issue):D945–50.
Petitjean A, Mathe E, Kato S, Ishioka C, Tavtigian SV, Hainaut P, et al. Impact of mutant p53 functional properties on TP53 mutation patterns and tumor phenotype: lessons from recent developments in the IARC TP53 database. Hum Mutat. 2007;28(6):622–9.
Sabichi A, Keyhani A, Tanaka N, Delacerda J, Lee IL, Zou C, et al. Characterization of a panel of cell lines derived from urothelial neoplasms: genetic alterations, growth in vivo and the relationship of adenoviral mediated gene transfer to coxsackie adenovirus receptor expression. J Urol. 2006;175(3 Pt 1):1133–7.
Yeager TR, DeVries S, Jarrard DF, Kao C, Nakada SY, Moon TD, et al. Overcoming cellular senescence in human cancer pathogenesis. Genes Dev. 1998;12(2):163–74.
Hatina J, Huckenbeck W, Rieder H, Seifert HH, Schulz WA. [Bladder carcinoma cell lines as models of the pathobiology of bladder cancer. Review of the literature and establishment of a new progression series]. Der Urologe Ausg A. 2008;47(6):724–34.
Kompier LC, Lurkin I, van der Aa MN, van Rhijn BW, van der Kwast TH, Zwarthoff EC. FGFR3, HRAS, KRAS, NRAS and PIK3CA mutations in bladder cancer and their potential as biomarkers for surveillance and therapy. PLoS One. 2010;5(11):e13821.
Pounds S, Cheng C, Mullighan C, Raimondi SC, Shurtleff S, Downing JR. Reference alignment of SNP microarray signals for copy number analysis of tumors. Bioinformatics. 2009;25(3):315–21.
Rodriguez-Santiago B, Malats N, Rothman N, Armengol L, Garcia-Closas M, Kogevinas M, et al. Mosaic uniparental disomies and aneuploidies as large structural variants of the human genome. Am J Hum Genet. 2010;87(1):129–38.
Team RDC. R: A language and environment for statistical computing. 2008.
Olshen AB, Venkatraman ES, Lucito R, Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004;5(4):557–72.
van de Wiel MA, Kim KI, Vosse SJ, van Wieringen WN, Wilting SM, Ylstra B. CGHcall: calling aberrations for array CGH tumor profiles. Bioinformatics. 2007;23(7):892–4.
McCall MN, Bolstad BM, Irizarry RA. Frozen robust multiarray analysis (fRMA). Biostatistics. 2010;11(2):242–53.
Rieger KM, Little AF, Swart JM, Kastrinakis WV, Fitzgerald JM, Hess DT, et al. Human bladder carcinoma cell lines as indicators of oncogenic change relevant to urothelial neoplastic progression. Br J Cancer. 1995;72(3):683–90.
Sarkar S, Julicher KP, Burger MS, Della Valle V, Larsen CJ, Yeager TR, et al. Different combinations of genetic/epigenetic alterations inactivate the p53 and pRb pathways in invasive human bladder cancers. Cancer Res. 2000;60(14):3862–71.
Chiong E, Lee IL, Dadbin A, Sabichi AL, Harris L, Urbauer D, et al. Effects of mTOR inhibitor everolimus (RAD001) on bladder cancer cells. Clin Cancer Res. 2011;17(9):2863–73.
Koch A, Hatina J, Rieder H, Seifert HH, Huckenbeck W, Jankowiak F, et al. Discovery of TP53 splice variants in two novel papillary urothelial cancer cell lines. Cell Oncol. 2012;35(4):243–57.
Hurst CD, Platt FM, Knowles MA. Comprehensive mutation analysis of the TERT promoter in bladder cancer and detection of mutations in voided urine. Eur Urol. 2014;65(2):367–9.
Acknowledgements
We thank the investigators mentioned in the text that provided cell lines, Núria Malats for valuable discussions, Angel Carro and Eduardo Andrés for technical support. We also thank M. Lauss, G. Sjödahl, and M. Hoglund for providing script and detailed information on the centroids used for their classification which are now included as supplementary information in this manuscript.
Authors’ note
We apologize for not being able to cite a large number of relevant articles due to space constraints. All data generated in the study are available upon request.
Grant support
This work was supported, in part, by grants Consolider ONCOBIO, SAF2011-15934-E, Red Temática de Investigación Cooperativa en Cáncer (RTICC)]; Asociación Española Contra el Cáncer, EU-FP7-201663, and NIH RO-1 (CA089715); and National Institutes of Health grants CA075115 and CA104106 (to D.T.).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
JE, MPP, HA, DT, and SC generated and analyzed data. DR, ECdSP, BRS, GG, DGP, and LPJ analyzed data. HBG and WAS provided crucial reagents for the study. AC contributed to generate and analyze data. FXR designed and supervised the overall conduct of the study; AV, DT, and SC supervised specific aspects of the study conduct. JE, DR, and FXR wrote the manuscript. All authors reviewed, commented, and approved the manuscript.
An erratum to this article can be found at http://dx.doi.org/10.1186/s12864-016-3179-z.
An erratum to this article can be found at http://dx.doi.org/10.1186/s12864-015-2227-4.
Additional files
Additional file 1: Table S1.
List of web resources used. Table S2. Characteristics and frequency of mutations in key oncogenes and tumor suppressors in UBC lines and in urinary bladder tumors. Table S3. Frequency of mutations in key oncogenes and tumor suppressors in UBC lines and in urinary bladder tumors. Table S4. Relationship between genomic instability, FGFR3, and TP53 mutation status and grade of original tumor. Table S5. Chi square analysis of genomic instability of UBC lines, original tumor grade, and FGFR3 and TP53 mutation status. Table S6. Summary of large gene copy number alterations involving whole chromosomes or whole chromosome arms. Table S7. UPD analysis of UBC cell lines. Table S8. Affymetrix expression array data of UBC lines included in the study. Table S9. Genes in lost and amplified regions and comparison of gene copy number and gene expression data. Table S10. Gene copy number changes and UPD events in new driver genes identified recently through exome/whole genome sequencing analysis of UBC tissues. Table S11. Comparison of copy number data and expression of 44 new genes implicated in UBC. Table S12. Pearson’s correlations according to the gene signature from Sjodahl et al [12]. Table S13. Pearson’s correlations according to the 40-gene signature from Rebouissou et al [25]. Table S14. WaviCGH call all concordance rate in replicate cell lines.
Additional file 2: Figure S1.
Association between genomic instability (measured as fraction of the genome altered) (bp)andFGFR3 mutation status, TP53 mutation status, and Grade of original tumor from which cell lines were established. Figure S2. Genome wide assessment of gene amplifications in UBC lines. Figure S3. Log R ratios of X chromosome probe signal in UBC lines and gender origin. Figure S4. UPD analysis in UBC lines. Figure S5. Relationship between UPD events and copy number alterations. Figure S6. Uniparentaldisomies (UPD) and FGFR3 mutation status. Figure S7. Segmented mean Log ratios of probe signals for UBC lines in the Cancer Cell Line Encyclopedia (http://www.broadinstitute.org/ccle/home).
Additional file 3:
Classification script for molecular subtyping of bladder tumors kindly provided by Lauss M with modifications by Carrillo-de Santa Pau E.
Additional file 4:
Centroids file kindly provided by Sjödahl G, Lauss M and Höglund M generated in Clin Cancer Res. 2012 Jun 15;18(12):3377-86.
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
Earl, J., Rico, D., Carrillo-de-Santa-Pau, E. et al. The UBC-40 Urothelial Bladder Cancer cell line index: a genomic resource for functional studies. BMC Genomics 16, 403 (2015). https://doi.org/10.1186/s12864-015-1450-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-015-1450-3