dc.contributor.author | Gnerre, Sante | |
dc.contributor.author | MacCallum, Iain | |
dc.contributor.author | Przybylski, Dariusz | |
dc.contributor.author | Ribeiro, Felipe J. | |
dc.contributor.author | Burton, Joshua | |
dc.contributor.author | Walker, Bruce J. | |
dc.contributor.author | Sharpe, Ted | |
dc.contributor.author | Hall, Giles | |
dc.contributor.author | Shea, Terrance P. | |
dc.contributor.author | Sykes, Sean | |
dc.contributor.author | Berlin, Aaron M. | |
dc.contributor.author | Aird, Daniel | |
dc.contributor.author | Costello, Maura | |
dc.contributor.author | Daza, Riza | |
dc.contributor.author | Williams, Louise | |
dc.contributor.author | Nicol, Robert | |
dc.contributor.author | Gnirke, Andreas | |
dc.contributor.author | Nusbaum, Chad | |
dc.contributor.author | Jaffe, David B. | |
dc.contributor.author | Lander, Eric Steven | |
dc.date.accessioned | 2011-07-15T16:53:31Z | |
dc.date.available | 2011-07-15T16:53:31Z | |
dc.date.issued | 2010-12 | |
dc.date.submitted | 2010-10 | |
dc.identifier.issn | 0027-8424 | |
dc.identifier.issn | 1091-6490 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/64820 | |
dc.description.abstract | Massively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach. Here, we report the development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform. The resulting draft genome assemblies have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome. In particular, the base accuracy is high (≥99.95%) and the scaffold sizes (N50 size = 11.5 Mb for human and 7.2 Mb for mouse) approach those obtained with capillary-based sequencing. The combination of improved sequencing technology and improved computational methods should now make it possible to increase dramatically the de novo sequencing of large genomes. The ALLPATHS-LG program is available at http://www.broadinstitute.org/science/programs/genome-biology/crd. | en_US |
dc.description.sponsorship | National Institutes of Health (U.S.) | en_US |
dc.description.sponsorship | National Human Genome Research Institute (U.S.) (Grant U54HG003067) | en_US |
dc.description.sponsorship | National Human Genome Research Institute (U.S.) (Grant R01HG003474) | en_US |
dc.description.sponsorship | National Institute of Allergy and Infectious Diseases (U.S.) (Contract HHSN2722009000018C) | en_US |
dc.language.iso | en_US | |
dc.publisher | National Academy of Sciences | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1073/pnas.1017351108 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | PNAS | en_US |
dc.title | High-quality draft assemblies of mammalian genomes from massively parallel sequence data | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Gnerre, S. et al. “High-quality Draft Assemblies of Mammalian Genomes from Massively Parallel Sequence Data.” Proceedings of the National Academy of Sciences 108.4 (2010) : 1513-1518. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Biology | en_US |
dc.contributor.approver | Lander, Eric S | |
dc.contributor.mitauthor | Lander, Eric S. | |
dc.relation.journal | Proceedings of the National Academy of Sciences of the United States of America | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.orderedauthors | Gnerre, S.; MacCallum, I.; Przybylski, D.; Ribeiro, F. J.; Burton, J. N.; Walker, B. J.; Sharpe, T.; Hall, G.; Shea, T. P.; Sykes, S.; Berlin, A. M.; Aird, D.; Costello, M.; Daza, R.; Williams, L.; Nicol, R.; Gnirke, A.; Nusbaum, C.; Lander, E. S.; Jaffe, D. B. | en |
mit.license | PUBLISHER_POLICY | en_US |
mit.metadata.status | Complete | |