dc.contributor.advisor | Cafarella, Michael | |
dc.contributor.author | Yue, Kevin | |
dc.date.accessioned | 2022-08-29T16:28:59Z | |
dc.date.available | 2022-08-29T16:28:59Z | |
dc.date.issued | 2022-05 | |
dc.date.submitted | 2022-05-27T16:19:18.247Z | |
dc.identifier.uri | https://hdl.handle.net/1721.1/145040 | |
dc.description.abstract | Data is an ever-expanding part of life in today’s world. Understanding the origin and the history of data - a concept known as data provenance - can thus be extremely important. In this thesis, we first address the need for a data provenance knowledge graph system, then address the need for being able to recover workflows that exist in such provenance networks, in an unsupervised manner. Along with evaluating the effectiveness of existing unsupervised community and motif detection methods, we also suggest a novel approach that augments standard motif detection. Our research shows weak precision and recall numbers for almost all considered approaches, but provides a promising basis for future experimentation using more multifaceted methods. | |
dc.publisher | Massachusetts Institute of Technology | |
dc.rights | In Copyright - Educational Use Permitted | |
dc.rights | Copyright MIT | |
dc.rights.uri | http://rightsstatements.org/page/InC-EDU/1.0/ | |
dc.title | Unsupervised Workflow Discovery in Provenance Graphs | |
dc.type | Thesis | |
dc.description.degree | M.Eng. | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
mit.thesis.degree | Master | |
thesis.degree.name | Master of Engineering in Electrical Engineering and Computer Science | |