Learning a Compositional Semantics for Freebase with an Open Predicate Vocabulary
Autor: | Tom M. Mitchell, Jayant Krishnamurthy |
---|---|
Rok vydání: | 2015 |
Předmět: |
Linguistics and Language
Vocabulary Parsing Computer science business.industry Communication media_common.quotation_subject Probabilistic database computer.software_genre Predicate (grammar) Computer Science Applications Human-Computer Interaction Artificial Intelligence Question answering Logical form Artificial intelligence business computer Sentence Natural language processing Natural language media_common |
Zdroj: | Transactions of the Association for Computational Linguistics. 3:257-270 |
ISSN: | 2307-387X |
DOI: | 10.1162/tacl_a_00137 |
Popis: | We present an approach to learning a model-theoretic semantics for natural language tied to Freebase. Crucially, our approach uses an open predicate vocabulary, enabling it to produce denotations for phrases such as “Republican front-runner from Texas” whose semantics cannot be represented using the Freebase schema. Our approach directly converts a sentence’s syntactic CCG parse into a logical form containing predicates derived from the words in the sentence, assigning each word a consistent semantics across sentences. This logical form is evaluated against a learned probabilistic database that defines a distribution over denotations for each textual predicate. A training phase produces this probabilistic database using a corpus of entity-linked text and probabilistic matrix factorization with a novel ranking objective function. We evaluate our approach on a compositional question answering task where it outperforms several competitive baselines. We also compare our approach against manually annotated Freebase queries, finding that our open predicate vocabulary enables us to answer many questions that Freebase cannot. |
Databáze: | OpenAIRE |
Externí odkaz: |