Show simple item record

dc.contributor.authorAlexe, Gabriela
dc.contributor.authorScanfeld, Daniel
dc.contributor.authorTamayo, Pablo
dc.contributor.authorGanesan, Shridar
dc.contributor.authorDeLisi, Charles
dc.contributor.authorBhanot, Gyan
dc.contributor.authorDalgin, Gul S.
dc.contributor.authorMesirov, Jill P.
dc.date.accessioned2010-10-14T12:28:56Z
dc.date.available2010-10-14T12:28:56Z
dc.date.issued2007-08
dc.date.submitted2007-01
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/1721.1/59307
dc.description.abstractBackground: Clustering analysis of microarray data is often criticized for giving ambiguous results because of sensitivity to data perturbation or clustering techniques used. In this paper, we describe a new method based on principal component analysis and ensemble consensus clustering that avoids these problems. Results: We illustrate the method on a public microarray dataset from 36 breast cancer patients of whom 31 were diagnosed with at least two of three pathological stages of disease (atypical ductal hyperplasia (ADH), ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC). Our method identifies an optimum set of genes and divides the samples into stable clusters which correlate with clinical classification into Luminal, Basal-like and Her2+ subtypes. Our analysis reveals a hierarchical portrait of breast cancer progression and identifies genes and pathways for each stage, grade and subtype. An intriguing observation is that the disease phenotype is distinguishable in ADH and progresses along distinct pathways for each subtype. The genetic signature for disease heterogeneity across subtypes is greater than the heterogeneity of progression from DCIS to IDC within a subtype, suggesting that the disease subtypes have distinct progression pathways. Our method identifies six disease subtype and one normal clusters. The first split separates the normal samples from the cancer samples. Next, the cancer cluster splits into low grade (pathological grades 1 and 2) and high grade (pathological grades 2 and 3) while the normal cluster is unchanged. Further, the low grade cluster splits into two subclusters and the high grade cluster into four. The final six disease clusters are mapped into one Luminal A, three Luminal B, one Basal-like and one Her2+. Conclusion: We confirm that the cancer phenotype can be identified in early stage because the genes altered in this stage progressively alter further as the disease progresses through DCIS into IDC. We identify six subtypes of disease which have distinct genetic signatures and remain separated in the clustering hierarchy. Our findings suggest that the heterogeneity of disease across subtypes is higher than the heterogeneity of the disease progression within a subtype, indicating that the subtypes are in fact distinct diseases.en_US
dc.publisherBioMed Central Ltden_US
dc.relation.isversionofhttp://dx.doi.org/10.1186/1471-2105-8-291en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttp://creativecommons.org/licenses/by/2.0en_US
dc.sourceBioMed Central Ltden_US
dc.subjectEvaluation studiesen_US
dc.subjectAlgorithmsen_US
dc.subjectArtificial intelligenceen_US
dc.subjectBreast neoplasms, diagnosisen_US
dc.subjectBreast neoplasms, metabolismen_US
dc.subjectCarcinoma, ductal, diagnosisen_US
dc.subjectCarcinoma, ductal, metabolismen_US
dc.subjectDiagnosis, computer-assisted methodsen_US
dc.subjectDisease progressionen_US
dc.subjectFemaleen_US
dc.subjectGene expression profiling, methodsen_US
dc.subjectHumansen_US
dc.subjectNeoplasm proteins, analysisen_US
dc.subjectOligonucleotide array sequence analysis, methodsen_US
dc.subjectPattern recognition, automated methodsen_US
dc.subjectPrincipal component analysisen_US
dc.subjectReproducibility of resultsen_US
dc.subjectSensitivity and specificityen_US
dc.subjectNeoplasm proteinsen_US
dc.subjectTumor markers, biological, analysisen_US
dc.titlePortraits of breast cancer progressionen_US
dc.typeArticleen_US
dc.identifier.citationDalgin, Gul S., et al. (2007). Portraits of breast cancer progression. BMC bioinformatics 8:291/1-16.en_US
dc.contributor.departmentBroad Institute of MIT and Harvarden_US
dc.contributor.departmentKoch Institute for Integrative Cancer Research at MITen_US
dc.contributor.mitauthorAlexe, Gabriela
dc.contributor.mitauthorScanfeld, Daniel
dc.contributor.mitauthorTamayo, Pablo
dc.contributor.mitauthorMesirov, Jill P.
dc.relation.journalBMC Bioinformaticsen_US
dc.eprint.versionFinal published versionen_US
dc.identifier.pmid17683614
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2010-09-03T16:07:05Z
dc.language.rfc3066en
dc.rights.holderDalgin et al.; licensee BioMed Central Ltd.
dspace.orderedauthorsDalgin, Gul S; Alexe, Gabriela; Scanfeld, Daniel; Tamayo, Pablo; Mesirov, Jill P; Ganesan, Shridar; DeLisi, Charles; Bhanot, Gyanen
mit.licensePUBLISHER_CCen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record