A natural language interface for querying graph databases
Author(s)
Sun, Christina, M. Eng. Massachusetts Institute of Technology
DownloadFull printable version (965.3Kb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
Sanjeev Mohindra.
Terms of use
Metadata
Show full item recordAbstract
An increasing amount of knowledge in the world is stored in graph databases. However, most people have limited or no understanding of database schemes and query languages. Providing a tool that translates natural language queries into structured queries allows people without this technical knowledge or specific domain expertise to retrieve information that was previously inaccessible. Many existing natural language interfaces to databases (NLIDB) propose solutions that may not generalize well to multiple domains and may require excessive feature engineering, manual customization, or large amounts of annotated training data. We present a method for constructing subgraph queries which can represent a graph of activities, events, persons, behaviors, and relations, for search against a graph database containing information from a variety of data sources. Our model interprets complex natural language queries by using a pipeline of named entity recognition and binary relation extraction models to identify key entities and relations corresponding to graph components such as nodes, attributes, and edges. This information is combined in order to create structured graph queries, which may then be applied to graph databases. By breaking down the translation task into a pipeline of several submodules, our model achieves a prediction accuracy of 46.9 % with a small training set of only 218 sentences.
Description
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 67-69).
Date issued
2018Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.