dc.contributor.advisor | Roy, Deb | |
dc.contributor.author | Sun, Daniel X. | |
dc.date.accessioned | 2022-02-07T15:24:38Z | |
dc.date.available | 2022-02-07T15:24:38Z | |
dc.date.issued | 2021-09 | |
dc.date.submitted | 2021-11-03T19:25:33.170Z | |
dc.identifier.uri | https://hdl.handle.net/1721.1/140109 | |
dc.description.abstract | Twitter is a popular social media platform where users interact through follows and tweets. This work explores computational methods of analyzing tweets with regards to understanding users and their interests. We consider various embedding models to produce tweet embeddings, which we then use to cluster the tweets, forming groups of semantically similar tweets. We then compare these tweet clusters to users clustered by interest based on accounts they follow. This work introduces techniques on how to effectively cluster tweets by semantic meaning despite the colloquial structure of tweet language. We also discuss how the topics of these tweet clusters align with the interests derived from the follow-based clustering approach, and provide insights into where they do and don’t intersect. | |
dc.publisher | Massachusetts Institute of Technology | |
dc.rights | In Copyright - Educational Use Permitted | |
dc.rights | Copyright MIT | |
dc.rights.uri | http://rightsstatements.org/page/InC-EDU/1.0/ | |
dc.title | Clustering Tweets via Tweet Embeddings | |
dc.type | Thesis | |
dc.description.degree | M.Eng. | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
mit.thesis.degree | Master | |
thesis.degree.name | Master of Engineering in Electrical Engineering and Computer Science | |