Abstract
Free full text
Breast Cancer Cell Line Classification and Its Relevance with Breast Tumor Subtyping
Abstract
Breast cancer cell lines have been widely used for breast cancer modelling which encompasses a panel of diseases with distinct phenotypical associations. Though cell lines provide unlimited homogenous materials for tumor studies and are relatively easy to culture, they are known to accumulate mutations duringthe initial establishment and subsequent series of cultivations. Thus, whether breast cancer cell line heterogeneity reflects that of carcinoma remains an important issue to resolve before drawing any reliable conclusion at the tumor level using cell lines. Inconsistent nomenclatures used for breast cancer cell line subtyping and the different number of subtypes grouped for cell lines and tumors make their direct matching elusive. By analyzing the molecular features of 92 breast cancer cell lines as documented by different literatures, we categorize 84 cell lines into 5 groups to be consistent with breast tumor classification. After combing through these cell lines, we summarized the molecular features, genetically and epigenetically, of each subtype, and manually documented 10 cell lines lacking explicit information on subtyping. Nine cell lines, either found inconsistent on their primary molecular features from different studies or being contaminated at the origin, are not suggested as the first choice for experimental use. We conclude that breast tumor cell lines, though having a high mutational frequency with many uncertainties and could not fully capture breast cancer heterogeneity, are feasible but crude models for tumors of the same subtype. New cell lines with enriched interferon regulated genes need to be established to enlarge the coverage of cell lines on tumor heterogeneity.
Introduction
Breast cancer is highly heterogeneous encompassing a group of genetically and epigenetically distinct diseases exhibiting diverse clinical features 1. A large proportion of current knowledge on breast carcinomas are derived from in vivo and in vitro studies performed using breast cancer cell lines, given that they could provide an unlimited source of homogenous self-replicating materials using simple yet standard media and approaches 2. Thus, whether these cell lines well capture the molecular features and reflect the heterogeneity of corresponding tumors remains an important issue to resolve before obtaining any clinically relevant results. Though it is concluded that breast cancer cell lines are representative of breast carcinoma to a large extent, with ER and HER2 being important stratifiers for their classification, continuous evidences have suggested dramatic genetic and epigenetic changes during the initial cell line establishment and subsequent serial passaging, suggesting that the resultant cell lines may have evolved significantly from the primary tumors 3. Also, different studies categorize breast cancer cell lines into different groups 1, 4-8, complicating our understandings towards cell line classification and their relevance with tumors. For example, Birnbaum et al. grouped 27 breast cancer cell lines into luminal, basal and mesenchymal subtypes 7; Riaz et al. characterized 5 subtypes, i. e. , luminal, luminal-HER2+, ER-negative-HER2+, basal, normal-like, among 51 breast cancer cell lines using a panel of 496 genes identified by Perou 1; Lehmann et al. subdivided triple negative cell lines into 7 categories, namely, basal-like 1 (BL1), basal-like 2 (BL2), immunomodulatory (IM), mesenchymal (M), mesenchymal stem-like (MSL), luminal androgen receptor (LAR) and unstable (UNS) 9; and some studies discriminate breast cancer cell lines into luminal-like and basal-like, with the basal class further subdivided into basal-A and basal-B 5, 8. Given the inconsistent nomenclatures, classification and even contradictory molecular characterization in different literatures 1, 4-8, 10, we are overwhelmed with cell lines lacking systematic feature documentation and consistent subtyping. On the other hand, the number of cell lines widely used for breast cancer studies is extremely small, with cell lines such as MCF7, T47D and MDAMB231 accounting for more than two-thirds of cell lines used in the associated studies 2. This raises the issue on how representative these few cell lines are of the vast diverse spectrum of breast tumors with distinct clinical implications. We are thus motivated to sort out the molecular features and corresponding tumor subtype each cell line represents to facilitate breast cancer modeling using appropriate cell lines.
Nomenclature of breast cancer cell lines
Ever since the establishment of the first breast cancer cell line, BT-20, in 1958 11, relatively few cell lines have been obtained due to technical difficulties in extracting viable tumor cells from the surrounding stroma 12, 13 and the bottleneck of long-term propagation during cultivation 12, 14. Most cell lines were established in late 1970s.
Cell line naming does not, in general, reflect its phenotypical association, but rather how they are established regarding, e. g. whether they are derived from the same laboratory, the same patient, isolated by serial subculture from the same initial population, or cultured using the same approach. For instance, 'HCC series' cell lines, as represented by their names, were isolated at Hamon Cancer Centre12; 'MDA series' were established from M. D. Anderson Hospital and Tumor Institute14; '21 series' were established from the same patient diagnosed as having infiltrating and intra-ductal mammary carcinoma, despite their distinct phenotypes and genotypes 15. 'HMT series' underwent a successive subcultivation under various conditions during their establishment, including P53 mutation, MYC amplification, EGF-independence accompanied by tumorigenicity in nude mice, EGFR and HER2 over expression. 'SUM series', though isolated from different tumor specimens, were established using the same selective media 16. As cell lines are typically named by the scientist who derived them, there is no rule on how each cell line is named especially for those do not belong to any series. It is until recently that standardization on cell line nomenclature has been proposed 17, which helps regularize the naming of newly established cell lines and ultimately improvement in cell line annotation and scientific reproducibility 18.
Molecular classification of breast cancer cell lines
Gene expression profiling has been widely applied to catalogue breast cancer cell lines, with diverse number of clusters being matched into various schemes. We characterized 84 cell lines based on the status of three important receptors conventionally used for breast cancer subtyping, i. e. , estrogen receptor (ER), progesterone receptor (PR), and human epithelial receptor 2 (HER2), and classified them using the same nomenclature, i. e. , luminal A, luminal B, HER2 positive, and triple negative subtypes, with triple negative cells being further divided into A and B to capture its heterogeneity and provide an easy link to the widely used names, i. e. , basal A and B (Table (Table11).
Table 1
Cell lines | ER | PR | HER2 | BRCA1 Mutation | Subtype | Series | Basic Medium | Tumor | Source | References |
---|---|---|---|---|---|---|---|---|---|---|
BT483 | + | +/- | - | WT | LA | BT | RPMI | IDC | L | 1, 5, 6, 8, 10 |
CAMA1 | + | +/- | - | WT | LA | NA | DMEM | AC | L | 1, 5, 6, 10 |
EFM19 | + | + | - | ND | LA | EFM | RPMI | IDC | L | 8, 10 |
HCC1428 | + | + | - | ND | LA | HCC | RPMI | AC | L | 5, 8 |
HCC712 | + | +/- | - | ND | LA | HCC | RPMI | DC | L | 8 |
IBEP2 | + | - | - | ND | LA | NA | DMEM | IDC | L | 10 |
KPL1 | + | - | - | ND | LA | NA | RPMI | IDC | L | 10, 61 |
LY2 | + | - | - | ND | LA | NA | DMEM | IDC | L | 5 |
MCF7 | + | + | - | WT | LA | MCF | RPMI, DMEM | IDC | L | 1, 5, 6, 8, 10, 58 |
MDAMB134 | + | - | - | ND | LA | MDA | RPMI | IDC | L | 8 |
MDAMB134VI | + | - | - | WT | LA | MDA | DMEM | IDC | L | 1, 5, 6, 10 |
MDAMB175 | + | - | - | ND | LA | MDA | RPMI | IDC | L | 8 |
MDAMB175VII | + | - | - | WT | LA | MDA | DMEM | IDC | L | 1, 5, 6, 10 |
MDAMB415 | + | +/- | - | WT | LA | MDA | DMEM | AC | L | 5, 6, 10 |
T47D | + | + | - | WT | LA | NA | RPMI | IDC | L | 1, 5, 6, 8, 10 |
ZR751 | + | +/- | - | WT | LA | ZR75 | RPMI | IDC | L | 1, 5, 6, 8, 10 |
ZR75B | + | - | - | ND | LA | ZR75 | RPMI | NA | L | 5 |
BSMZ | + | + | + | ND | LB | NA | RPMI | IDC | L | 10, 19 |
BT474 | + | + | + | WT | LB | BT | RPMI | IDC | L | 1, 5, 6, 8, 10 |
EFM192A | + | + | + | ND | LB | EFM | RPMI | AC | L | 8 |
IBEP1 | - | + | + | ND | LB | IBEP | DMEM | IDC | L | 10 |
IBEP3 | - | + | + | ND | LB | IBEP | DMEM | IDC | L | 10 |
MDAMB330 | + | - | + | WT | LB | MDA | RPMI | ILC | L | 1, 6, 10 |
MDAMB361 | + | +/- | + | WT | LB | MDA | RPMI, DMEM | AC | L | 1, 5, 6, 8, 10 |
UACC812 | + | +/- | + | WT | LB | UACC | RPMI, DMEM | IDC | L | 1, 5, 6, 8, 10 |
ZR7527 | + | - | + | WT | LB | ZR75 | RPMI | IDC | L | 60 |
ZR7530 | + | - | + | WT | LB | ZR75 | RPMI | IDC | L | 1, 5, 6, 8, 10 |
21MT1 | - | +/- | + | ND | H | 21 | α-MEM/DFC1 | IDC | L | 15 |
21MT2 | - | +/- | + | ND | H | 21 | α-MEM/DFC1 | IDC | L | 15 |
21NT | - | +/- | + | ND | H | 21 | α-MEM/DFC1 | IDC | L | 15 |
21PT | - | +/- | + | ND | H | 21 | α-MEM/DFC1 | IDC | L | 15 |
AU565 | - | - | + | WT | H | NA | RPMI | AC | L | 5, 10, 58 |
HCC1008 | - | - | + | ND | H | HCC | RPMI | IDC | L | 12 |
HCC1569 | - | - | + | WT | H | HCC | RPMI | MC | L | 1, 5, 8 |
HCC1954 | - | - | + | WT | H | HCC | RPMI | DC | L | 1, 5, 8 |
HCC202 | - | - | + | WT | H | HCC | RPMI | DC | L | 1, 5, 8 |
HCC2218 | - | - | + | ND | H | HCC | RPMI | DC | L | 8 |
HH315 | - | - | + | ND | H | HH | RPMI | C | L | 10, 59 |
HH375 | - | - | + | ND | H | HH | RPMI | C | L | 10, 59 |
KPL-4 | - | - | + | WT | H | KPL | DMEM | IDC | L | 10, 62 |
MDAMB453 | - | - | + | WT | H | MDA | RPMI, DMEM | AC | L | 1, 5, 6, 8, 10 |
OCUB-F | - | - | + | WT | H | NA | RPMI | NA | L | 1, 6 |
SKBR3 | - | - | + | WT | H | SKBR | RPMI, McCoys | AC | L | 1, 5, 6, 8, 10 |
SKBR5 | - | - | + | WT | H | SKBR | RPMI | AC | L | 1, 6 |
SUM190PT | - | - | + | WT | H | SUM | Ham's F12 | Inf | L | 1, 5, 6, 8 |
SUM225CWN | - | - | + | WT | H | SUM | Ham's F12 | IDC | L | 1, 5, 6 |
UACC893 | - | - | + | WT | H | UACC | RPMI | IDC | L | 1, 6, 8, 10 |
BT20 | - | - | - | WT | TNA | BT | RPMI, DMEM | IDC | L | 1, 5, 6, 8, 10 |
CAL148 | - | - | - | WT | TNA | CAL | DMEM | AC | D | 71 |
DU4475 | - | - | - | WT | TNA | NA | RPMI | IDC | D | 1, 6, 10 |
EMG3 | - | - | - | ND | TNA | NA | DMEM | IDC | L | 25, 48, 57 |
HCC1143 | - | - | - | ND | TNA | HCC | RPMI | DC | L | 1, 5, 8 |
HCC1187 | - | - | - | ND | TNA | HCC | RPMI | DC | L | 5, 8 |
HCC1599 | - | - | - | ND | TNA | HCC | RPMI | DC | L | 8 |
HCC1806 | - | - | - | ND | TNA | HCC | RPMI | SqC | L | 1, 8 |
HCC1937 | - | - | - | MU | TNA | HCC | RPMI | DC | L | 1, 5, 6, 8 |
HCC2157 | - | - | - | ND | TNA | HCC | RPMI | DC | L | 5, 8 |
HCC3153 | - | - | - | MU | TNA | HCC | RPMI | DC | L | 5, 8 |
HCC70 | - | - | - | WT | TNA | HCC | RPMI | DC | L | 1, 5, 8 |
HMT3522 | - | - | - | WT | TNA | HMT | DMEM, F12 | B | L | 49 |
KPL-3C | - | - | - | ND | TNA | KPL | RPMI | IDC | D | 10, 36 |
MA11 | - | - | - | ND | TNA | NA | DMEM | ILC | D | 10, 37, 63 |
MDAMB435 | - | - | - | WT | TNA | MDA | DMEM | AC | L | 1, 5, 6, 10 |
MDAMB436 | - | - | - | MU | TNA | MDA | RPMI, L15 | AC | L | 1, 5, 6, 8, 10 |
MDAMB468 | - | - | - | WT | TNA | MDA | RPMI, L15 | AC | L | 1, 5, 6, 8, 10 |
MFM223 | - | - | - | WT | TNA | NA | MEM | C | D | 71 |
SUM185PE | - | - | - | WT | TNA | SUM | Ham's F12 | DC | L | 1, 5, 6 |
SUM229PE | - | - | - | WT | TNA | SUM | RPMI | DC | L | 1, 6 |
BT549 | - | - | - | WT | TNB | BT | RPMI | IDC | L | 1, 5, 6, 8, 10 |
CAL120 | - | - | - | WT | TNB | CAL | DMEM | AC | D | 71 |
CAL51 | - | - | - | WT | TNB | CAL | DMEM | AC | D | 71 |
CAL851 | - | - | - | WT | TNB | CAL | DMEM | AC | D | 71 |
HCC1395 | - | - | - | ND | TNB | HCC | RPMI | DC | L | 1, 8 |
HCC1739 | - | - | - | ND | TNB | HCC | RPMI | DC | D | 12 |
HCC38 | - | - | - | ND | TNB | HCC | RPMI | DC | L | 1, 5, 8 |
HDQ-P1 | - | - | - | MU | TNB | NA | DMEM | IDC | D | 71 |
Hs578T | - | - | - | WT | TNB | NA | RPMI, DMEM | IDC | L | 1, 5, 6, 8, 10 |
MDAMB157 | - | - | - | WT | TNB | MDA | RPMI, DMEM | MC | L | 1, 5, 6, 8, 10 |
MDAMB231 | - | - | - | WT | TNB | MDA | RPMI, DMEM | AC | L | 1, 5, 6, 8, 10 |
SKBR7 | - | - | - | WT | TNB | SKBR | RPMI | AC | L | 1, 6 |
SUM102PT | - | - | - | WT | TNB | SUM | Ham's F12 | IDC | L | 1, 6, 8 |
SUM1315M02 | - | - | - | MU | TNB | SUM | Ham's F12 | IDC | L | 1, 5, 6 |
SUM149PT | - | - | - | MU | TNB | SUM | Ham's F12 | InfDC | L | 1, 5, 6, 8 |
SUM159PT | - | - | - | WT | TNB | SUM | Ham's F12 | AnC | L | 1, 5, 6 |
The status on ER, PR and HER2 are the primary features used for breast cancer cell line subtyping 1, 5, 6, 8, 10, 12, 15, 19, 25, 36, 37, 48, 49, 57-63. BRCA1 mutation status is annotated for the ease of sporadic core basal tumor modeling, where MU represents cell lines with 'BRCA1 mutation', WT means 'wild type', and ND is short for 'not decided' 4, 6, 64, 65. We categorize these cell lines into luminal A (LA), luminal B (LB), HER2 positive (H), Triple negative A (TNA) and Triple negative B (TNB), according to literature available subtyping suggestions (L) or derived information from the original article where they are established (D). Such information are provided in the 'Subtype' and 'Source' columns, respectively, with references listed in column 'References'. The 'Series' column shows which cell line series each cell line comes from which are conventionally given by the establisher and represents, e. g. , the place where the cell line is created 12, 14, 15, 49, 60, 66-70. 'Basic Medium' shows the common or suggested medium for the cultivation of each cell line, which may need the addition of other elements and does not eliminate the feasibility of other medium 1, 5, 8, 15, 66. The 'Tumor' column shows the clinical property of the tumor where the cell lines are derived, with the following symbolic representations, i. e. , AC is 'adenocarcinoma', AnC is 'anaplastic carcinoma', B is 'benign tumor', C is 'carcinoma', CS is 'carcinosarcoma', DC is 'ductal carcinoma', IDC is 'invasive ductal carcinoma', ILC is 'invasive lobular carcinoma', InfC is 'inflammatory carcinoma', InfDC is 'inflammatory ductal carcinoma', MC is 'medullary carcinoma', SqC is 'squamous carcinoma', and NA means such information is not available 5, 8, 10, 15.
Though, in most studies, luminal cell lines are not further differentiated into subtypes A and B according to their HER2 status as tissue samples, a few reports have classified ER+HER2+ cell lines, such as MDAMB330, as the luminal-ERBB2+ subtype 1, 6. However, the majority of luminal-ERBB2+ cell lines including, e. g. , HCC202, SKBR3, UACC893, SUM190PT, SUM225CWN, are ER-HER2+ at the molecular level, corresponding to HER2 positive tumor patients. While some of these cell lines are referred to as luminal (HCC202, SKBR3, UACC893) 1, 2, 5-8, 12, 19, some are grouped as basal or basal A (SUM190PT, SUM225CWN) 1, 5-7. HER2+ cell lines are given a diverse spectrum of names despite their considerable overlaps, e. g. , luminal (AU565, HCC2218, OCUB-F, SKBR5, HCC202, SKBR3, UACC893) 1, 5, 8, luminal-HRBB2+ (HCC202, SKBR3, UACC893, SUM190PT, SUM225CWN) 1, ER-negative-ERBB2+ (HCC1008, HCC1569, HCC1954) 1, and basal A (SUM190PT, SUM225CWN, HCC1008, HCC1569, HCC1954) 5, 8. Cell lines with triple negative status of ER, PR and HER2 are differentiated as basal A and basal B cell lines in most literatures, with basal A being more luminal-like and basal B more basal-like. For example, while MDAMB468 is classified as basal A in 5, 8, it is described as weakly luminal in 2; Hs578T, MDAMB231 are grouped as basal B in 5, 8 but characterized as claudin-low in 20 or even mesenchymal-like in 2, 7. We mapped the cell line subtyping nomenclatures from different literatures 1, 4, 5, 7, 21 to the most commonly used breast cancer classification structure (Figure (Figure1),1), and summarized the molecular features and morphological characteristics of each subtype (Table (Table2)2) with details provided below.
Table 2
Cell line subtype | mRNA | miRNA 1 | Protein | Morphology | ||||
---|---|---|---|---|---|---|---|---|
Luminal | ER, GATA3, KRT19/KRT8/KRT18, XBP1, PBX1, ZNF278, SPDEF, CRABP2, MUC1, FOXA1, MYB, RET, EGR3, TFF1, HER3, TOB1, TFF31, 5, 7, 8 | hsa-miR-501-5p hsa-miR-202 hsa-miR-760 hsa-miR-626 | ER, GATA3, KRT19 7 | More differentiated; tight cell-cell junctions 5 | ||||
HER2 positive | HER2, GRB7, PERLD1, STARD3, C17ORF377 | hsa-let-7b hsa-miR-640 hsa-miR-200c hsa-miR-378 hsa-miR-141 hsa-miR-196a hsa-miR-29c hsa-miR-18a* | HER2 | Breakdown of cell-cell junctions 10 | ||||
Triple negative | Triple negative A (Basal A) | EGFR CAV1/2 MSN ETS1 5, 7, 8 | KRT4/5/6A/6B/13/14/15/16/17, ITGA6, ITGB4/6, LAMB3, LAMC2, TRIM29, S100A2, SLPI, LYN, ANXA8, COL17A1, BNC1, MET, CD133, GABRK, VTCN1, BST2, FABP7, CD10/14/58/59, 1, 7, 8 | hsa-miR-492 hsa-miR-26b hsa-miR-617 hsa-miR-155 | EGFR CAV1/2 MSN ETS1 7 | KRT5/6 CD10 MET 7 | Core basal-like | |
Triple negative B (Basal B) | VIM, SPARC, FN1, FBN1, HAS2, PRG1, COL3A1, COL6A1/2/3, COL8A1, MMP2/14, TIMP1, CTSC, PLAU, PLAUR, AXL, PLAT, CD24(-), CD44, TGFBR2, SERPINE1/2, TGFB15, 7, 8 | hsa-miR-22 hsa-miR-532-3p hsa-miR-125b hsa-miR-501-5p hsa-miR-155* | CD44 7 | Least differentiated and most stemness; more mesenchymal-like appearance and highly invasive 5 | ||||
Luminal breast cancer cell lines
Luminal breast cancer cell lines are characterized by positive ER and/or PR expression, despite the existence of a few exceptional cases, such as IBEP-1 and IBEP-3 22, where PR positivity drives their luminal phenotype. This type of cell lines exhibits high expression of a panel of luminal feature associated genes/proteins including, e.g., ESR1 (ERα or ER), luminal keratins (KRT8/18/19), and transcription factors such as GATA3 and FOXA1. A systematic study on miRNA expression profiling has unveiled distinctive over-expression of has-miR-501-5p, has-miR-202, has-miR-760 and has-miR-626 in luminal cell lines 1. Luminal cell lines are comparably more differentiated and have less propensity for migration due to tight cell-cell junctions, consistent with that at the tumor level.
Though the majority of the studies do not further stratify luminal cell lines into luminal A and B subtypes according to their HER2 status, we embrace such a differentiation not only for the sake of achieving consistent categorization with tumor subtyping to facilitate easy tumor modelling but also satisfying the requirement of drug response assays hinging on ER and HER2 status. For example, a study using BT474 (ER+HER2+) has revealed the synergistic advantage of tamoxifen and Herceptin in the treatment of breast tumors 23, and MCF7 (ER-HER2) has been conventionally used for testing tamoxifen-induced cell response 24. Luminal B cell lines are, in principle, more invasive and consequently more aggressive than luminal A cells, as HER2 over-expression is shown to be associated with ER down-regulation 2. Thus, luminal B cells participate in the phenotype attenuation of luminal A cells, and could be more appropriately described as weakly luminal 2. Lots of information for luminal B tumors has been successfully deciphered using cell lines of this subtype with translational potential at clinics 25-28. For example, the expression of quiescin-sulfhydryl oxidase 1 is reported to be associated with a poor prognosis in luminal B tumors using a ER+HER2+ cell line, BT474 29, suggesting the usefulness and importance of separating luminal B cell lines from the luminal subtype.
HER2 positive breast cancer cell lines
HER2 positive cell lines, as featured by ER negativity and HER2 positivity, share the same over-represented genomic profile on chromosomal region 17q12 encompassing genes including HER2, GRB7, PERLD1, STARD3 and C17ORF37 30. Micro RNAs including has-let-7b, has-miR-640, has-miR-200c, has-miR-378, has-miR-141, has-miR-196a, has-miR-29c and has-miR-18a* are overtly expressed in these cell lines 1.
HER2 positive cell lines bridge the gap between luminal and basal cell lines, and are heterogeneous encompassing both luminal and basal features. They are grouped as luminal-ERBB2+ and ER-negative-ERBB2+ in 1 according to the expression of luminal and basal markers besides ER and HER2 expression. Consistent with their molecular features, cells of this subtype are more aggressive with respect to cell migration as compared with luminal cells, since HER2 over-expression is associated with the breakdown of cell-cell junctions 2; and more responsive to certain drugs and is an excellent model for the examination of Herceptin response, as a significant correlation is revealed between the molecular signature and biological response by Pearson's correlation test 5. Proteins whose expression are strongly correlated with such a drug response include increased levels of ESR1, MAPK1/3, MEK, TYK2, FASN, and GRB7, which are mostly associated with cell proliferation, and up-regulated expression of SFN, CAV2, GRB2, RB1 and FLNA indicates a drug resistance 5. Thus, by studying cell lines over-represented by HER2 expression, MAPK signaling is shown to predict Herceptin response, and mTOR pathway, Toll-like receptor pathway, N-glycan biosynthesis as well as inositol-phosphate signaling are associated with Herceptin resistance 5.
Triple negative breast cancer cell lines
Triple negative cell lines, as represented by the name, are featured by low or no expression of all three markers, i.e., ER-PR-HER2-. It is the most heterogeneous among all subtypes and referred to as basal A and basal B cell lines in many literatures. Triple negative A (basal A) lines are called basal-like as they are enriched with basal markers including cytokeratins (KRT4/5/6A/6B/13/14/15/16/17), integrins (ITGA6, ITGB4/6), LAMB3, LAMC2, TRIM29, S100A2, SLPI, ANXA8, COL17A1, BNC1, CD10/14/58/59, MET, LYN, CD133, GABRK, VTCN1, BST2, FABP7 1, 7, 8, and resemble the core basal tumor subtype. Triple negative B (basal B) lines, designated the mesenchymal cluster or normal-like/claudin-low, over-express genes associated with tumor invasive and aggressive features such as VIM, MSN, PLAT, TGFB1, TGFBR2, AXL, COL3A1, COL6A1/2/3, COL8A1, MMP2/14, TIMP1, CTSC, PLAU, PLAUR, SERPINE1/2, SPARC, FN1, FBN1, HAS2, PRG1 5, 7, 8; and cancer stemness such as CD44(+) and CD24(-) 5. Collagens (COL3A1, COL6A1/2/3, COL8A1), proteases (MMP2/14, TIMP1, CTSC, PLAU, PLAUR, SERPINE1/2, PLAT), and proteins stabilizing cytoskeletal interactions (VIM, MSN) are important players for extracellular matrix remodelling required for cell migration, and signalling factors (TGFB1, TGFBR2, AXL) are crucial in mediating such aggressive morphology. A selected panel of proteins (EGFR, CAV1/2, MSN, ETS1) are used for triple negative cell line characterization at the translational level as a whole, which characterize triple negative A cells when combined with basal keratins (KRT5/6), CD10, and MET, and identify triple negative B cells when jointly assessed with the stemness marker CD44 7. Distinct microRNAs are detected for triple negative A and B cells. In particular, hsa-miR-492, has-miR-26b, has-miR-617, has-miT-155 are overtly expressed in the triple negative A subtype, and hsa-miR-22, hsa-532-3p, hsa-miR-125b, hsa-miR-501-5p, hsa-miR-155* pop up in the triple negative B subtype 23. Interestingly, hsa-miR-155 (triple negative A) and hsa-miR-155* (triple negative B) are derived from the same precursor, but show an opposing expression pattern in these two triple negative subtypes 1. Phenotypically, triple negative A cells, as a comparatively more differentiated subtype within triple negative cell lines, may have either luminal-like or basal-like morphologies, whereas triple negative B cells has a more mesenchymal-like appearance and are more likely invasive. Thereby, triple negative A lines mostly resemble the core basal tumor subtype as featured by basal markers, and triple negative B cells could be used for claudin-low or metaplastic breast cancer modelling given their enrichment in epithelial mesenchymal transition (EMT) and stem-cell markers 31.
BRCA1 encodes a protein forming the Rap80/Abraxas/Brca1/Brcc36 complex in response to DNA damage 32. Mutation of this gene predisposes hereditary breast cancer, which also resembles sporadic core basal tumors 33-35. Triple negative A lines are characterized by BRCA1 signatures 8, with most of the currently available BRCA1 mutated commercial breast cancer cell lines (HCC1937, MDAMB436, SUM149PT, HCC3153) belonging to this subtype except for SUM1315MO2 (Table (Table11).
There are ten triple negative cell lines, DU4475, KPL-3C, MA11, HCC1739, CAL148, MFM223, CAL120, CAL51, CAL851, HDQ-P1, lacking direct information for further discrimination. We infer DU4475, KPL-3C, MA11, CAL148, MFM223 as triple negative A, and the rest as triple negative B based on the relevant literatures 5, 6, 12, 20, 36, 37, 39-41. DU4475 cells carry a mutation in the gene MAP2K4 whose alteration is characteristic of luminal features 6, agreeing with the fact that triple negative A cells may have either luminal-like or basal-like morphologies 5. Immunohistochemical studies reveal the expression of keratins but not vimentin in KPL-3C 36, fulfilling our classification criteria of triple negative A cells (Table (Table2).2). MA11 is characterized by MUC1 secretion 37, whose expression occurs at a high frequency in early stage basal A triple negative breast cancer 20. CAL148 and MFM223 are classified as the triple negative A cell line given their negative expression on vimentin 39. CC1739 is a poorly differentiated cell line 12, satisfying the stemness feature of triple negative B cells. CAL120, CAL51 40, CAL851 and HDQ-P1 41 are positive on vimentin expression, falling into the triple negative B group. Our categorization of these cell lines is in agreement with other studies where different classification schemes were used. For example, Lehmann et al. categorized CAL148 and MFM223 to the LAR subtype which are heavily enriched with hormonally regulated pathways 9, suggesting their close relationship with hormone responsive cell lines and corroborating our classification of them as triple negative A than triple negative B cells.
Breast cancer cell lines are feasible models of breast cancer
The occurrence of DNA alterations was higher in breast cancer cell lines than in tumors. On average, alterations in cell lines are about two-fold more frequent than those in tumors 2, 21. This might be because that breast cancer cell lines are mostly originated from invasive high-grade tumors, which are more easily to accumulate genomic aberrations during in vitro cultivation. Breast cancer cell lines may exhibit DNA alterations not detected in tumors, which may result from the sequential cultivation or reflect low frequency alteration in tumors that is hard to observe. Such newly gained mutations may cause phenotypical changes to cells. For example, MCF7 cell lines with variable sensitivities to tamoxifen are reported 42. However, CGH analyses suggest that breast cancer cell lines are representative of tumors with regard to major DNA alterations 21. A comprehensive study comparing the HCC series of cell lines with their original tumors reveals that almost all aberrations (except one) presented in a tumor also occur in the corresponding cell line 43. This has been confirmed by another study comparing allelic losses at 18 chromosomal regions frequently deleted in breast tumors using 51 polymorphic micro-satellite markers 20. Also, it seems that DNA alterations affecting specific genes under our interest remain highly consistent between cell lines and tissues. For example, a 75% concordance was seen on the mutation status in exons 5 to 10 of TP53 gene 44. Point mutations identified in tumors were often found in at least one or some cell lines and vice versa 2. At the chromosomal level, breast cancer cell lines, though, have a higher copy number changes, share a similar pattern regarding gains and losses with tumors 2, 21.
Epigenetic alterations are heritable modifications of gene expression that do not involve mutation, which are similar between breast cancer cell lines and tumors. A study has revealed that the CpG island methylation pattern is very close between cell lines and tumors. In their study, most of the genes harboring CpG island hypermethylation in the promoter region are present in both breast cancer cell lines and tumors such as ER, PR, HIC1, APC, ARHI, ASC, BRCA1, CCND2, CDH1, CDH13, CDKN2A, FABP3, FHIT, GIB2, GPC3, GSTP1, HOXA5, HSHIN1, KLK10, NME1, PRDM2, PRKCDBP, RARB, RASSF1, SFN, SYK, TFF1, TIMP3 and WT1 45; and quite a few cases show inconsistency between cell lines and tumors, e. g. , IL6, GSN, PLAU, PRSS8, SLC19A1, SNCG are hypermethylated in tumors but not in cell lines and the hypermethylation of PLAGL1, TGFB3 is observed in cell lines but not tumors 45.
At the molecular level, discriminative markers and associated phenotypic traits observed in breast cancer cell lines are also frequently discriminative features in tumors. Genes correlated to the ER+ or ER- phenotype in breast cancer cell lines are positively or negatively correlated to ER expression in tumors. The mRNA variants of ER containing precise truncations in various exons have been identified in both cell lines and tumors 46.
Breast cancer cell lines are crude models of breast cancer
Despite the considerable roles played by breast cancer cell lines in deciphering the mechanisms underlying tumor initiation and evolution, cell lines are still viewed as crude models of tumors and could not capture all tumor features and heterogeneity.
The clonal population comprising of any single cell line could not well capture the heterogeneity of breast tumors at the intra-tumoral level. Carcinogenesis is a sequential process through many clinical and pathological stages including a typical hyperproliferation, local invasion, invasive carcinomas and ultimately metastatic disease, and is believed to be accompanied with a sequential acquisition of various genetic and epigenetic alterations in a single cell followed by clonal selection and expansion. The establishment of cell lines, on the other hand, may eliminate some types of tumor cells initially present in the tissue samples under the environmental stress created by the cultivation condition. For instance, cells unable to grow properly on plastic or need specific factors secreted by tumor microenvironment may be eliminated during the cultivation process. Thereby, how well a permanent breast cancer cell line could model the tumor intra-heterogeneity becomes a questionable issue.
Success in long-term propagation has been a limiting factor during the establishment of breast cancer cell lines, resulting in less commercially available cell lines than needed to cover the inter-tumoral cancer heterogeneity. Take triple negative breast cancer as an example, it encompasses at least four subtypes, i.e., core basal, claudin-low, metaplastic breast cancer, and interferon-rich, each with distinct molecular features and clinical associations. However, we only have triple negative A and B cell lines. While type A may be well representative of the core basal tumor subtype, and type B is useful for the modelling of claudin-low and/or metaplastic subtype (further differentiation among type B lines is needed), we are left with no suitable cell line for interferon-rich tumor modelling according to their molecular features. Also and importantly, very few breast cancer cell lines (e. g. , MCF7, T-47D, MDAMB231), despite the total number of established ones, have been frequently used for research purpose for the sake of, e. g. , cultivation easiness, rendering the transportability of results obtained from such limited number of cell lines to the evolving tumors even more questionable.
Given the technical difficulties of extracting viable tumor cells from their surrounding stroma, most breast cancer cell lines are originated from invasive carcinoma (Table (Table1),1), raising the questions as to how they are representative of the primary tumors. Also, we could not exclude the possibility that certain breast cancer cell line, either from origin or due to cultivative selection, are not representative of breast cancer cell at all. For example, MDAMB435, previously identified as a breast cancer cell line, has been suggested as being originated from an occult melanoma 47.
Also important is how much tumor microenvironment affects cell signaling, and how much these changes alter cell molecular features and affect the experimental results. A study examining the influences of fibroblasts on cell line morphological changes using EMG3 reveals that fibroblasts could stimulate the expression of luminal keratins in basal cells and/or basal keratins in luminal cells 48. Also reported is that cells developed in an environment with high/low EGFR activity tend to become ER negative/positive 49, suggestive of the selective power imposed by microenvironment on cells. This makes experiments convolving tumor microenvironment difficult to be modeled using cell lines.
Another concern regarding the usage of breast cancer cell lines in tumor modelling is that the same cell line, once cultured in different labs and/or under different conditions, may evolve into distinct populations. We do observe different categorizations of the same cell line into distinct groups according to their varied molecular or morphological descriptions annotated by different laboratories. Take the status of ER and HER2 for instance, HCC1007 is ER+HER2+ in 5, ER+HER2+ in 8, and ER-HER2+ in 12; HCC1419 is ER+HER2+ in 1 but ER-HER2+ in 8, 12; HCC1500 is ER+HER2- in 1, 8, 12 and ER-HER2- in 5; HCC2185 is ER-HER2- in 5, 8 and ER-HER2+ in 12; SUM52PE and SUM44PE are ER+HER2+ in 8 but ER+HER2- in 1, 5, 6; EVSA-T is ER-HER2+ in 6 but ER-HER2- in 1; and MPE600 is ER+HER2- in 5 but ER+HER2+ in 1, 6 (Table (Table33).
Table 3
Cell lines | ER | PR | HER2 | Subtype | Reference |
---|---|---|---|---|---|
HCC1007 | + | - | - | LA | 5 |
HCC1007 | + | - | + | LB | 8 |
HCC1007 | - | - | + | H | 12 |
HCC1419 | + | - | + | LB | 1 |
HCC1419 | - | - | + | H | 8, 12 |
HCC1500 | + | + | - | LA | 1, 8, 12 |
HCC1500 | - | - | - | TNB | 5 |
HCC2185 | - | - | - | TNA | 5, 8 |
HCC2185 | - | - | + | H | 12 |
SUM52PE | + | - | + | LB | 8 |
SUM52PE | + | - | - | LA | 1, 5, 6 |
SUM44PE | + | + | + | LB | 8 |
SUM44PE | + | +/- | - | LA | 1, 5, 6 |
EVSA-T | - | - | + | H | 6 |
EVSA-T | - | + | - | LA | 1 |
MPE600 | + | - | - | LA | 5 |
MPE600 | + | - | + | LB | 1, 6 |
We categorise these cell lines into luminal A (LA), luminal B (LB), HER2 positive (H), Triple negative A (TNA) and Triple negative B (TNB) according to the status of ER, PR and HER2. References where such information is derived are provided in the 'Reference' column.
This renders not only studies hinging on cell lines difficult to be compared with but also our understandings towards these cell lines complicated and confusing at the first hand.
Discussion
By summarizing and comparing the molecular features of breast cancer cell lines currently available to our knowledge with tumors, genetically and epigenetically, we categorize them into five subtypes, i. e. , luminal A, luminal B, HER2 positive, triple negative A and triple negative B. Unlike at the tissue level where luminal tumors dominate (63% 50), cell lines of this subtype take the least percentage (31%) among all, i. e. , 17 are luminal A, 9 are luminal B, 20 HER2 positive, 21 triple negative A, and 17 triple negative B among the lines under study. This is due to the fact that most lines are derived from invasive carcinoma for the sake of easy cultivation, toppling the uneven distribution encountered at the tumor level.
In our nomenclature, luminal A and B cell lines are differentiated from each other and HER2 positive lines are identified as a single subtype to facilitate studies on tissue subtyping and drug response experiments targeting ER and/or HER2 using cell lines. Triple negative lines are identified as two separate groups, which correspond to basal A and B as referred to in most literatures. While triple negative lines share a panel of feature markers such as EGFR which has been used as a conventional marker characterizing triple negative tumors in many studies 51-56, each has its own distinct molecular and phenotypical properties. Triple negative A is characterized by the expression of basal keratins (KRT4/5/6/13/14/15/16/17) which resembles the core basal tumors, and triple negative B is featured by the cancer stem cell pattern such as CD44+CD24- and migration markers such as VIM which could be used for modeling of claudin-low and/or metaplastic breast cancers. Among the collected 92 cell lines here, 74 are unanimously labeled with one subtype as suggested by existing literatures, 10 (DU4475, KPL-3C, MA11, HCC1739, MFM223, CAL148, CAL120, CAL51, CAL851, HDQ-P1) are categorized by their available information based on our subtyping criteria (Table (Table1),1), 8 (Table (Table3)3) have ambiguous and inconsistent categorization among different studies which may have evolved into different cell lines in different labs after serial cultivation, and one (MDAMB435) is found not of breast cancer origin.
Breast cancer cell lines, though having a higher frequency of DNA and/or copy number alterations, share a similar major genetic mutational spectrum and chromosome aberration profiles with tumors, allowing them still feasible for tumor modelling. However, cell lines are crude models for cancer research as they could not well capture the intra- and inter- tumor heterogeneities, and how well they could represent primary tumors and how much they evolve from their initial establishment are still questionable especially for a few cell lines whose characterization on the primary markers are inconsistent across studies. Thereby, it is encouraged to use breast cancer cell lines for initial tumor modelling but the results need to be further validated using tumor samples or mouse models. Also, cell line selection becomes critical, which needs to be representative and the relevant and/or determinant markers need to be pre-tested.
Cultivation medium is crucial for the success of cell line establishment, regarding how well cells maintain their molecular and physiological features in tumor tissue. Currently widely applied medium include RPMI, DMEM, /αMEM , Ham's F12, L15, McCoys etc. , with RPMI, DMEM, αMEM, Ham's F12 providing a full coverage of all the cell lines in Table Table1.1. The amino acid concentration of these medium, ranging from the most enriched to the least are αMEM, DMEM, RPMI and Ham's F12. The observation that DMEM is rather frequently used for the cultivation of luminal cells, Ham's F12 is many times used in triple negative cells and occasionally in HER2 positive cells suggest that the more aggressive and invasive tumor cell lines are the fewer nutrition is needed in the medium.
Concluding Remarks
By summarizing and going through all the molecular features of currently available cell lines, we grouped them using a subtyping system compatible with that in tumors, and clarified some ambiguous information regarding cell molecular and morphological features. This helps us in the proper choice of and widens the selection spectrum of cell lines when conducting relevant studies, which is critical in the guarantee of final successof translating cell line based results to clinics.
Systematic analysis on the molecular differences between cell lines and tumors reveal that breast cell lines, on average, harbour more genetic mutations and approximately the same amount of epigenetic aberrations than the tumors they derive. The key molecular features of cell lines, however, remain the same with tissue tumors, except for a few exceptional cases whose primary markers have evolved into multiple versions. Also, despite the more frequent chromosomal alterations occurred in cell lines, their genomic profiles stay invariant with the tumors. These empower breast cancer cell lines feasible models for tumors of the same subtype. However, they are not accurate models to capture the full heterogeneity of tumors, both at a single line level or the whole cell population. Also, how much they are evolved from the corresponding primary tumors, either due to technical ease at the establishment or during serial cultivation, remain to be elucidated. In addition, without interactions with the microenvironment, whether or not cell lines could reproduce the signaling in situ worth serious considerations before cell lines are chosen for the experiments.
It is worth noting that, our claim that 'cell lines are feasible models for tumors' refers to tumors of the same subtype but not the tumor deriving the cell line. Some cell lines, after passage selection under, e. g. environmental stress, may alter their transcriptional profiles and exhibit the features of another subtype. For instance, EMG3, a triple negative A cell line showing KRT5/14 positivity in the immunocytochemical analysis, is originated from tumors characteristic of luminal markers such as KRT18/19 57.
Triple negative breast cancer, though, do not dominate the patient cases, are very heterogeneous, encompassing at least four subcategories. However, we only have two subgroups within cell lines of this category, leaving no cell line being appropriate for interferon-rich tumor modelling. As a subtype with approximately 10% prevalence among tumors 31, it is suggested to establish triple negative cell lines modeling such tumors which represent a 3rd triple negative subtype and enrich the heterogeneity of current cell line modality.
Acknowledgments
This study is supported by the National Natural Science Foundation of China (Grant No. 31471251), Natural Science Foundation of Jiangsu Province (Grant No. BK20161130), the Six Talent Peaks Project in Jiangsu Province (Grant No. SWYY-128), and the Fundamental Research Funds for the Central Universities (JUSRP11507).
References
Articles from Journal of Cancer are provided here courtesy of Ivyspring International Publisher
Full text links
Read article at publisher's site: https://doi.org/10.7150/jca.18457
Read article for free, from open access legal sources, via Unpaywall: https://www.jcancer.org/v08p3131.pdf
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Discover the attention surrounding your research
https://www.altmetric.com/details/137974706
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.7150/jca.18457
Article citations
Single-cell chemoproteomics identifies metastatic activity signatures in breast cancer.
Sci Adv, 10(43):eadp2622, 23 Oct 2024
Cited by: 0 articles | PMID: 39441940 | PMCID: PMC11498211
Combination of magnetic hyperthermia and gene therapy for breast cancer.
Apoptosis, 19 Oct 2024
Cited by: 0 articles | PMID: 39427089
Targeted therapy approaches for epithelial-mesenchymal transition in triple negative breast cancer.
Front Oncol, 14:1431418, 10 Oct 2024
Cited by: 0 articles | PMID: 39450256 | PMCID: PMC11499239
Review Free full text in Europe PMC
HSP90 promotes tumor associated macrophage differentiation during triple-negative breast cancer progression.
Sci Rep, 14(1):22541, 29 Sep 2024
Cited by: 0 articles | PMID: 39341960 | PMCID: PMC11438890
A Novel Size-Based Centrifugal Microfluidic Design to Enrich and Magnetically Isolate Circulating Tumor Cells from Blood Cells through Biocompatible Magnetite-Arginine Nanoparticles.
Sensors (Basel), 24(18):6031, 18 Sep 2024
Cited by: 0 articles | PMID: 39338775 | PMCID: PMC11436177
Go to all (503) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Molecular characterization of breast cancer cell lines through multiple omic approaches.
Breast Cancer Res, 19(1):65, 05 Jun 2017
Cited by: 49 articles | PMID: 28583138 | PMCID: PMC5460504
Breast cancer intrinsic subtype classification, clinical use and future trends.
Am J Cancer Res, 5(10):2929-2943, 15 Sep 2015
Cited by: 465 articles | PMID: 26693050 | PMCID: PMC4656721
Review Free full text in Europe PMC
Intra-tumor heterogeneity in breast cancer has limited impact on transcriptomic-based molecular profiling.
BMC Cancer, 17(1):802, 29 Nov 2017
Cited by: 7 articles | PMID: 29187174 | PMCID: PMC5708109
Simultaneous discovery of cancer subtypes and subtype features by molecular data integration.
Bioinformatics, 32(17):i445-i454, 01 Sep 2016
Cited by: 11 articles | PMID: 27587661