Show simple item record

dc.contributor.advisorThaler, Jesse
dc.contributor.authorKryhin, Serhii
dc.date.accessioned2022-08-29T16:22:25Z
dc.date.available2022-08-29T16:22:25Z
dc.date.issued2022-05
dc.date.submitted2022-07-14T17:21:57.162Z
dc.identifier.urihttps://hdl.handle.net/1721.1/144940
dc.description.abstractWe study quark and gluon jets separately using public collider data from the CMS experiment. Our analysis is based on an Open Data dataset of proton-proton collisions collected at the Large Hadron Collider in 2011. We define two non-overlapping data mixtures via a pseudorapidity cut—central jets with |𝜂| ≤ 0.65 and forward jets with |𝜂| > 0.65—and employ jet topic modeling to extract individual distributions for the maximally separable categories. Under certain assumptions, such as sample independence and mutual irreducibility, the extracted “topic” categories correspond to “quark” and “gluon” distributions. We consider a number of different methods for extracting reducibility factors from the central and forward datasets and determine fractions of quark jets in each sample dataset. We also utilize the extracted fractions to reconstruct the distributions of observables for “quark” and “gluon” components, explore the change of topic fraction with the rapidity spectrum, compute the intrinsic dimensionality for each of the topics, and perform a crosscheck by exploring the tagging performance. The greatest stability and robustness to statistical uncertainties is achieved by a novel method based on parametrizing the endpoints of a receiver operating characteristic (ROC) curve. To mitigate detector effects, which would otherwise induce unphysical differences between central and forward jets, we use the OmniFold method to perform central value unfolding. To our knowledge, this work is the first application of full phase space unfolding to real collider data, and one of the first applications of topic modeling to extract separate quark and gluon distributions at the LHC.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright MIT
dc.rights.urihttp://rightsstatements.org/page/InC-EDU/1.0/
dc.titleApplication of Unsupervised Machine Learning for Event Classification
dc.typeThesis
dc.description.degreeS.B.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Physics
mit.thesis.degreeBachelor
thesis.degree.nameBachelor of Science in Physics


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record