Show simple item record

dc.contributor.advisorRoy, Deb
dc.contributor.authorSun, Daniel X.
dc.date.accessioned2022-02-07T15:24:38Z
dc.date.available2022-02-07T15:24:38Z
dc.date.issued2021-09
dc.date.submitted2021-11-03T19:25:33.170Z
dc.identifier.urihttps://hdl.handle.net/1721.1/140109
dc.description.abstractTwitter is a popular social media platform where users interact through follows and tweets. This work explores computational methods of analyzing tweets with regards to understanding users and their interests. We consider various embedding models to produce tweet embeddings, which we then use to cluster the tweets, forming groups of semantically similar tweets. We then compare these tweet clusters to users clustered by interest based on accounts they follow. This work introduces techniques on how to effectively cluster tweets by semantic meaning despite the colloquial structure of tweet language. We also discuss how the topics of these tweet clusters align with the interests derived from the follow-based clustering approach, and provide insights into where they do and don’t intersect.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright MIT
dc.rights.urihttp://rightsstatements.org/page/InC-EDU/1.0/
dc.titleClustering Tweets via Tweet Embeddings
dc.typeThesis
dc.description.degreeM.Eng.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degreeMaster
thesis.degree.nameMaster of Engineering in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record