MIT Libraries homeMIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

The GENCODE human gene set

Author(s)
Searle, S.; Frankish, A.; Bignell, A.; Aken, B.; Derrien, Thomas; Diekhans, M.; Harte, R.; Howald, C.; Kokocinski, F.; Lin, M.; Tress, M.; Van Baren, M.; Barnes, I.; Hunt, T.; Carvalho-Silva, D.; Davidson, C.; Donaldson, S.; Gilbert, J.; Kay, M.; Lloyd, D.; Loveland, J.; Mudge, J.; Snow, C.; Vamathevan, J.; Wilming, L.; Brent, M.; Gerstein, Mark B.; Guigo, Roderic; Kellis, Manolis; Reymond, A.; Zadissa, A.; Valencia, A.; Harrow, J.; Hubbard, T.; ... Show more Show less
Thumbnail
DownloadSearle-2010-The GENCODE human gene set.pdf (130.8Kb)
PUBLISHER_CC

Publisher with Creative Commons License

Creative Commons Attribution

Terms of use
Creative Commons Attribution http://creativecommons.org/licenses/by/2.0
Metadata
Show full item record
Abstract
The GENCODE consortium is a sub group of the ENCODE consortium. Its aim is to provide complete annotation of genes in the human genome including protein-coding loci, non-coding loci and pseudogenes, based on experimental evidence. The final aim is for the HAVANA team to manually annotate the complete genome. This is a time-consuming process which will be completed over the course of the ENCODE project. Currently, to provide a set of annotation covering the complete genome, rather than just the regions that have been manually annotated, a merge of manual annotation from HAVANA with automatic annotation from the Ensembl automatically annotated gene set is created. This process also adds unique full-length CDS predictions from the Ensembl protein coding set into manually annotated genes, to provide the most complete up to date annotation of the genome possible. Also included in the set are short and long ncRNA genes predicted by the Ensembl prediction pipelines and a consensus set of pseudogene predictions agreed between Havana, Yale and UCSC. The CCDS set is also fully represented within the GENCODE set. The GENCODE set is the default annotation available in Ensembl and is also available in the UCSC genome browser. All the annotation is tagged as to whether it is produced by manual annotation alone, automatic annotation alone, or by both approaches. We are currently working to provide confidence levels for annotation, based on depth and type of evidence supporting it.
Description
This article is part of the supplement: Beyond the Genome: The true gene count, human evolution and disease genomics, Boston, MA, USA. 11-13 October 2010.
Date issued
2010-10
URI
http://hdl.handle.net/1721.1/69840
Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Journal
Genome Biology
Publisher
BioMed Central Ltd.
Citation
Searle, S et al. “The GENCODE Human Gene Set.” Genome Biology 11.Suppl 1 (2010): P36.
Version: Final published version
ISSN
1465-6906
1474-7596

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries homeMIT Libraries logo

Find us on

Twitter Facebook Instagram YouTube RSS

MIT Libraries navigation

SearchHours & locationsBorrow & requestResearch supportAbout us
PrivacyPermissionsAccessibility
MIT
Massachusetts Institute of Technology
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.