Show simple item record

dc.contributor.authorTorrente, Aurora
dc.contributor.authorLukk, Margus
dc.contributor.authorXue, Vincent
dc.contributor.authorParkinson, Helen
dc.contributor.authorRung, Johan
dc.contributor.authorBrazma, Alvis
dc.date.accessioned2017-04-05T20:44:22Z
dc.date.available2017-04-05T20:44:22Z
dc.date.issued2016-06
dc.date.submitted2015-12
dc.identifier.issn1932-6203
dc.identifier.urihttp://hdl.handle.net/1721.1/107889
dc.description.abstractRapid accumulation and availability of gene expression datasets in public repositories have enabled large-scale meta-analyses of combined data. The richness of cross-experiment data has provided new biological insights, including identification of new cancer genes. In this study, we compiled a human gene expression dataset from ∼40,000 publicly available Affymetrix HG-U133Plus2 arrays. After strict quality control and data normalisation the data was quantified in an expression matrix of ∼20,000 genes and ∼28,000 samples. To enable different ways of sample grouping, existing annotations where subjected to systematic ontology assisted categorisation and manual curation. Groups like normal tissues, neoplasmic tissues, cell lines, homoeotic cells and incompletely differentiated cells were created. Unsupervised analysis of the data confirmed global structure of expression consistent with earlier analysis but with more details revealed due to increased resolution. A suitable mixed-effects linear model was used to further investigate gene expression in solid tissue tumours, and to compare these with the respective healthy solid tissues. The analysis identified 1,285 genes with systematic expression change in cancer. The list is significantly enriched with known cancer genes from large, public, peer-reviewed databases, whereas the remaining ones are proposed as new cancer gene candidates. The compiled dataset is publicly available in the ArrayExpress Archive. It contains the most diverse collection of biological samples, making it the largest systematically annotated gene expression dataset of its kind in the public domainen_US
dc.language.isoen_US
dc.publisherPublic Library of Scienceen_US
dc.relation.isversionofhttp://dx.doi.org/10.1371/journal.pone.0157484en_US
dc.rightsCreative Commons Attribution 4.0 International Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_US
dc.sourcePLOSen_US
dc.titleIdentification of Cancer Related Genes Using a Comprehensive Map of Human Gene Expressionen_US
dc.typeArticleen_US
dc.identifier.citationTorrente, Aurora et al. “Identification of Cancer Related Genes Using a Comprehensive Map of Human Gene Expression.” Ed. Paolo Provero. PLOS ONE 11.6 (2016): e0157484.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computational and Systems Biology Programen_US
dc.contributor.mitauthorXue, Vincent
dc.relation.journalPLOS ONEen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsTorrente, Aurora; Lukk, Margus; Xue, Vincent; Parkinson, Helen; Rung, Johan; Brazma, Alvisen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-1199-7689
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record