Show simple item record

dc.contributor.advisorDaniel Sanchez.en_US
dc.contributor.authorNutile, Domenic Jeffrey.en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2020-09-15T21:59:10Z
dc.date.available2020-09-15T21:59:10Z
dc.date.copyright2020en_US
dc.date.issued2020en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/127464
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May, 2020en_US
dc.descriptionCataloged from the official PDF of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 37-39).en_US
dc.description.abstractModern processors are reaching hundreds of cores, and the need for highly parallel programs is at an all-time high. Speculative execution is a promising approach to automatically parallelize sequential programs, but even the best speculative parallelizing compilers and hardware architectures fail to unlock all available parallelism on every program. These limitations are often caused by the structure of the original sequential program causing tasks to be unnecessarily dependent, limiting speedups. This thesis presents TProf, a system that dynamically profiles automatically parallelized sequential programs to find parallelism bottlenecks. Our implementation of TProf targets the T4 compiler on the Swarm architecture. T4 leverages Swarm's fine-grain speculation and high scalability to enable novel compiler parallelization techniques. TProf targets the programs created by T4, analyzing the execution of the generated task structures to identify key points of parallel contention at the LLVM intermediate representation level. TProf processes the collected information, relating performance bottlenecks to the source code which helps programmers quickly and easily decide how to enact transformations to unlock maximum parallelism.en_US
dc.description.statementofresponsibilityby Domenic Jeffrey Nutile.en_US
dc.format.extent39 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleUsing profiling to improve the performance of automatically parallelized programsen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1192966905en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2020-09-15T21:59:09Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record