FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain
Autor: | Aleksandra Marković, Natalija Tomić, Olivera Kitanović, Ranka Stanković |
---|---|
Rok vydání: | 2021 |
Předmět: |
rudarski korpus
obrada prirodnog jezika Srpski jezik semantika okvira Computer science business.industry FrameNet frame semantics computer.software_genre Lexical database scenario rizika risk scenario Domain (software engineering) Artificial intelligence natural language processing Serbian language business mining corpus computer Natural language processing |
Zdroj: | Infotheca |
ISSN: | 2217-9461 1450-9687 |
DOI: | 10.18485/infotheca.2021.21.1.1 |
Popis: | 1 33 7 21 M50 М53 U radu se daje kratak prikaz teorije semantike okvira, na kojoj je zasnovana leksička baza Frejmnet. Predstavljena je koncepcija ove mreže, kao i mogućnosti njene primene. Predstavljena je i leksička analiza koja se primenjuje u projektu izrade Frejmneta i ukazano na razlike između analize zasnovane na okviru u odnosu na analizu zasnovanu na reči. Zatim je prikazano nekoliko povezanih okvira koje prizivaju reči iz domena rizika. U radu je predstavljena i platforma NLTК pomoću koje se mogu koristiti razni jezički resursi, među njima i Frejmnet. Završno poglavlje pruža analizu imenice rizik na korpusu rudarstva. Predstavljeni su najčešći kolokati ove imenice, skica njene upotrebe, konkordance za pojedine modele, pronalaženje sinonima i povezanih reči u vidu tezaurusa, grafički prikaz frekvencija pojedinih kolokacija, kao i oblaka reči. This paper gives a short overview of the frame semantics theory that forms the theoretical basis of the Berkeley FrameNet project. We present the basic concepts of this database, as well as the possibility of implementing it in Serbian. We also take a close look at the lexical analysis used in the FrameNet development project and point out the differences between the frame-based lexical analysis and its word-based counterpart. This is followed by an illustration of a couple of related frames evoked by words from the risk domain. FrameNet data is also readily available through the Python API included in the NLTК (Natural Language Toolkit) suite, which provides a good natural language processing resource. The last chapter shows a corpus search of the noun risk in a mining-themed corpus. We also present its most common collocates, word sketch, individual pattern concordances, thesaurus entry of its synonyms and related words, collocation frequency graphs. A word cloud for the word risk is also included. |
Databáze: | OpenAIRE |
Externí odkaz: |