Who are my ancestors? : retrieving family relationships from historical texts
Autor: | Efremova, I., Montes Garcia, A., Bolt Iriondo, A.J., Calders, T.G.K., Braslavski, P., Markov, I., Pardalos, P., Volkovich, Y., Ignatov, D.I., Koltsov, S., Koltsova, O. |
---|---|
Přispěvatelé: | Process Science |
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
Family relationship
Training set Recall business.industry Computer science media_common.quotation_subject Sibling relationship computer.software_genre Support vector machine Support vector machine classifier Feature (machine learning) Wife Artificial intelligence Data mining business computer Natural language processing media_common |
Zdroj: | Communications in Computer and Information Science ISBN: 9783319417172 RuSSIR Information Retrieval: 9th Russian Summer School, RuSSIR 2015, Saint Petersburg, Russia, August 24-28, 2015, Revised Selected Papers, 121-129 STARTPAGE=121;ENDPAGE=129;TITLE=Information Retrieval |
Popis: | This paper presents an approach for automatically retrieving family relationships from a real-world collection of Dutch historical notary acts. We aim to retrieve relationships like husband - wife, parent - child, widow of, etc. Our approach includes person names extraction, reference disambiguation, candidate generation and family relationship prediction. Since we have a limited amount of training data, we evaluate different feature configurations based on the n-gram analysis. The best results were obtained by using a combination of bi-grams and tri-grams of words together with the distance in words between two names. We evaluate our results for each type of the relationships in terms of precision, recall and \(f-score\). |
Databáze: | OpenAIRE |
Externí odkaz: |