Automatically Generating Wikipedia Articles: A Structure-Aware Approach
Author(s)
Sauper, Christina Joan; Barzilay, Regina
DownloadBarzilay_Automatically generating.pdf (99.48Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
In this paper, we investigate an approach for creating a comprehensive textual overview of a subject composed of information drawn from the Internet. We use the high-level structure of human-authored texts to automatically induce a domain-specific template for the topic structure of a new overview. The algorithmic innovation of our work is a method to learn topic-specific extractors for content selection jointly for the entire template. We augment the standard perceptron algorithm with a global integer linear programming formulation to optimize both local fit of information into each topic and global coherence across the entire overview. The results of our evaluation confirm the benefits of incorporating structural information into the content selection process.
Date issued
2009-08Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP
Publisher
Association for Computational Linguistics
Citation
Sauper, Christina and Regina Barzilay. "Automatically Generating Wikipedia Articles: A Structure-Aware Approach."Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 208–216,
Suntec, Singapore, 2-7 August 2009.
Version: Author's final manuscript
ISBN
978-1-932432-45-9