Discontinuous Constituency and BERT: A Case Study of Dutch
Autor: | Kogkalidis, Konstantinos, Wijnholds, Gijs, LS Linguistiek de taalinformatica, ILS LLI |
---|---|
Přispěvatelé: | LS Linguistiek de taalinformatica, ILS LLI |
Jazyk: | angličtina |
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | Findings of the Association for Computational Linguistics: ACL 2022. Association for Computational Linguistics (ACL) Findings of the Association for Computational Linguistics: ACL 2022 |
Popis: | In this paper, we set out to quantify the syntactic capacity of BERT in the evaluation regime of non-context free patterns, as occurring in Dutch. We devise a test suite based on a mildly context-sensitive formalism, from which we derive grammars that capture the linguistic phenomena of control verb nesting and verb raising. The grammars, paired with a small lexicon, provide us with a large collection of naturalistic utterances, annotated with verb-subject pairings, that serve as the evaluation test bed for an attention-based span selection probe. Our results, backed by extensive analysis, suggest that the models investigated fail in the implicit acquisition of the dependencies examined. 8 pages plus references. To appear in Findings of the Association for Computational Linguistics 2022 |
Databáze: | OpenAIRE |
Externí odkaz: |