FlexGP 2.0 : multiple levels of parallelism in distributed machine learning via genetic programming

Sherry, Dylan J. (Dylan Jacob)

dc.contributor.advisor	Kalyan Veeramachaneni and Una-May O'Reilly.	en_US
dc.contributor.author	Sherry, Dylan J. (Dylan Jacob)	en_US
dc.contributor.other	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.	en_US
dc.date.accessioned	2014-03-06T15:46:22Z
dc.date.available	2014-03-06T15:46:22Z
dc.date.copyright	2013	en_US
dc.date.issued	2013	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/85498
dc.description	Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2013.	en_US
dc.description	Cataloged from PDF version of thesis.	en_US
dc.description	Includes bibliographical references (pages 105-107).	en_US
dc.description.abstract	This thesis presents FlexGP 2.0, a distributed cloud-backed machine learning system. FlexGP 2.0 features multiple levels of parallelism which provide a significant improvement in accuracy v.s. elapsed time. The amount of computational resources in FlexGP 2.0 can be scaled along several dimensions to support large, complex data. FlexGP 2.0's core genetic programming (GP) learner includes multithreaded C++ model evaluation and a multi-objective optimization algorithm which is extensible to pursue any number of objectives simultaneously in parallel. FlexGP 2.0 parallelizes the entire learner to obtain a large distributed population size and leverages communication between learners to increase performance via transferral of search progress between learners. FlexGP 2.0 factors training data to boost performance and enable support for increased data size and complexity. Several experiments are performed which verify the efficacy of FlexGP 2.0's multilevel parallelism. Experiments run on a large dataset from a real-world regression problem. The results demonstrate both less time to achieve the same accuracy and overall increased accuracy, and illustrate the value of FlexGP 2.0 as a platform for machine learning.	en_US
dc.description.statementofresponsibility	by Dylan J. Sherry.	en_US
dc.format.extent	107 pages	en_US
dc.language.iso	eng	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582	en_US
dc.subject	Electrical Engineering and Computer Science.	en_US
dc.title	FlexGP 2.0 : multiple levels of parallelism in distributed machine learning via genetic programming	en_US
dc.title.alternative	Multiple levels of parallelism in distributed machine learning via genetic programming	en_US
dc.type	Thesis	en_US
dc.description.degree	M. Eng.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc	871002391	en_US

Files in this item

Name:: 871002391-MIT.pdf
Size:: 22.89Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record