A unifying similarity measure for automated identification of national implementations of european union directives
Autor: | Daniel Traykov, Hristo Konstantinov, Francesco Costamagna, Llio Humphreys, Rohan Nanda, Livio Robaldo, Luigi Di Caro, Hristo Hristov, Guido Boella, Michele Romano, Tenyo Tyankov |
---|---|
Přispěvatelé: | Nanda, Rohan, Robaldo, Livio, Romano, Michele, Di Caro, Luigi, Boella, Guido, Konstantinov, Hristo, Tyankov, Tenyo, Traykov, Daniel, Hristov, Hristo, Costamagna, Francesco, Humphreys, Llio |
Rok vydání: | 2017 |
Předmět: |
Artificial intelligence
European law Legal information retrieval Artificial intelligence Matching (statistics) Computer science 02 engineering and technology Similarity measure 0603 philosophy ethics and religion computer.software_genre Legal information retrieval 020204 information systems Similarity (psychology) 0202 electrical engineering electronic engineering information engineering media_common.cataloged_instance European union media_common Computer science [C05] [Engineering computing & technology] business.industry 06 humanities and the arts Approximate string matching Sciences informatiques [C05] [Ingénierie informatique & technologie] 16. Peace & justice Directive European law Identification (information) 060301 applied ethics Data mining business computer Natural language processing semantic similarity information retireval European law |
Zdroj: | Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law-ICAIL 17 Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law -ICAIL '17 ICAIL A Unifying Similarity Measure for Automated Identification of National Implementations of European Union Directives. (2017). STARTPAGE=149;ENDPAGE=158;TITLE=None Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law |
DOI: | 10.1145/3086512.3086527 |
Popis: | This paper presents a unifying text similarity measure (USM) for automated identification of national implementations of European Union (EU) directives. The proposed model retrieves the transposed provisions of national law at a fine-grained level for each article of the directive. USM incorporates methods for matching common words, common sequences of words and approximate string matching. It was used for identifying transpositions on a multilingual corpus of four directives and their corresponding national implementing measures (NIMs) in three different languages : English, French and Italian. We further utilized a corpus of four additional directives and their corresponding NIMs in English language for a thorough test of the USM approach. We evaluated the model by comparing our results with a gold standard consisting of official correlation tables (where available) or correspondences manually identified by domain experts. Our results indicate that USM was able to identify transpositions with average F-score values of 0.808, 0.736 and 0.708 for French, Italian and English Directive-NIM pairs respectively in the multilingual corpus. A comparison with state-of-the-art methods for text similarity illustrates that USM achieves a higher F-score and recall across both the corpora. |
Databáze: | OpenAIRE |
Externí odkaz: |