Unsupervised summarization of public talk radio

O'Brien, Shayne,S.M.Massachusetts Institute of Technology.

dc.contributor.advisor	Deb Roy.	en_US
dc.contributor.author	O'Brien, Shayne,S.M.Massachusetts Institute of Technology.	en_US
dc.contributor.other	Program in Media Arts and Sciences (Massachusetts Institute of Technology)	en_US
dc.date.accessioned	2020-01-23T17:02:34Z
dc.date.available	2020-01-23T17:02:34Z
dc.date.copyright	2019	en_US
dc.date.issued	2019	en_US
dc.identifier.uri	https://hdl.handle.net/1721.1/123648
dc.description	Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2019	en_US
dc.description	Cataloged from PDF version of thesis.	en_US
dc.description	Includes bibliographical references (pages 111-118).	en_US
dc.description.abstract	Talk radio exerts significant influence on the political and social dynamics of the United States, but labor-intensive data collection and curation processes have prevented previous works from analyzing its content at scale. Over the past year, the Laboratory for Social Machines and Cortico have created an ingest system to record and automatically transcribe audio from more than 150 public talk radio stations across the country. Using the outputs from this ingest, I introduce "hierarchical compression" for neural unsupervised summarization of spoken opinion in conversational dialogue. By relying on an unsupervised framework that obviates the need for labeled data, the summarization task becomes largely agnostic to human input beyond necessary decisions regarding model architecture, input data, and output length. Trained models are thus able to automatically identify and summarize opinion in a dynamic fashion, which is noted in relevant literature as one of the most significant obstacles to fully unlocking talk radio as a data source for linguistic, ethnographic, and political analysis. To evaluate model performance, I create a novel spoken opinion summarization dataset consisting of compressed versions of "representative," opinion-containing utterances extracted from a hand-curated and crowd-source-annotated dataset of 275 snippets. I use this evaluation dataset to show that my model quantitatively outperforms strong rule- and graph-based unsupervised baselines on ROUGE and METEOR while qualitatively demonstrating fluency and information retention according to human judges. Additional analyses of model outputs show that many improvements are still yet to be made to this model, thus laying the ground for its use in important future work such as characterizing the linguistic structure of spoken opinion "in the wild."	en_US
dc.description.statementofresponsibility	by Shayne O'Brien.	en_US
dc.format.extent	118 pages	en_US
dc.language.iso	eng	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582	en_US
dc.subject	Program in Media Arts and Sciences	en_US
dc.title	Unsupervised summarization of public talk radio	en_US
dc.type	Thesis	en_US
dc.description.degree	S.M.	en_US
dc.contributor.department	Program in Media Arts and Sciences (Massachusetts Institute of Technology)	en_US
dc.identifier.oclc	1136615205	en_US
dc.description.collection	S.M. Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences	en_US
dspace.imported	2020-01-23T17:02:33Z	en_US
mit.thesis.degree	Master	en_US
mit.thesis.department	Media	en_US

Files in this item

Name:: 1136615205-MIT.pdf
Size:: 9.212Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record