Show simple item record

dc.contributor.authorKellis, Manolis
dc.contributor.authorLin, Michael F.
dc.date.accessioned2012-08-15T17:40:09Z
dc.date.available2012-08-15T17:40:09Z
dc.date.issued2009-04
dc.date.submitted2008-12
dc.identifier.issn1088-9051
dc.identifier.issn1088-9051
dc.identifier.urihttp://hdl.handle.net/1721.1/72151
dc.description.abstractEffective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.en_US
dc.description.sponsorshipNational Human Genome Research Institute (U.S.) (Grant number 1U54HG004555-01)en_US
dc.description.sponsorshipWellcome Trust (London, England) (Grant number WT062023)en_US
dc.description.sponsorshipWellcome Trust (London, England) (Grant number WT077198)en_US
dc.language.isoen_US
dc.publisherCold Spring Harbor Laboratory Pressen_US
dc.relation.isversionofhttp://dx.doi.org/10.1101/gr.080531.108en_US
dc.rightsCreative Commons Attribution-NonCommercial 3.0 Unported Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/en_US
dc.sourceGenome Researchen_US
dc.titleThe Consensus Coding Sequence (Ccds) Project: Identifying a Common Protein-Coding Gene Set for the Human and Mouse Genomesen_US
dc.typeArticleen_US
dc.identifier.citationPruitt, K. D. et al. “The Consensus Coding Sequence (CCDS) Project: Identifying a Common Protein-coding Gene Set for the Human and Mouse Genomes.” Genome Research 19.7 (2009): 1316–1323. Copyright © 2009 by Cold Spring Harbor Laboratory Pressen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverKellis, Manolis
dc.contributor.mitauthorKellis, Manolis
dc.contributor.mitauthorLin, Michael F.
dc.relation.journalGenome Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsPruitt, K. D.; Harrow, J.; Harte, R. A.; Wallin, C.; Diekhans, M.; Maglott, D. R.; Searle, S.; Farrell, C. M.; Loveland, J. E.; Ruef, B. J.; Hart, E.; Suner, M.-M.; Landrum, M. J.; Aken, B.; Ayling, S.; Baertsch, R.; Fernandez-Banet, J.; Cherry, J. L.; Curwen, V.; DiCuccio, M.; Kellis, M.; Lee, J.; Lin, M. F.; Schuster, M.; Shkeda, A.; Amid, C.; Brown, G.; Dukhanina, O.; Frankish, A.; Hart, J.; Maidak, B. L.; Mudge, J.; Murphy, M. R.; Murphy, T.; Rajan, J.; Rajput, B.; Riddick, L. D.; Snow, C.; Steward, C.; Webb, D.; Weber, J. A.; Wilming, L.; Wu, W.; Birney, E.; Haussler, D.; Hubbard, T.; Ostell, J.; Durbin, R.; Lipman, D.en
mit.licensePUBLISHER_CCen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record