Autor: |
Rane, Sunayana, Nencheva, Mira L., Wang, Zeyu, Lew-Williams, Casey, Russakovsky, Olga, Griffiths, Thomas L. |
Rok vydání: |
2022 |
Předmět: |
|
DOI: |
10.48550/arxiv.2207.09847 |
Popis: |
For human children as well as machine learning systems, a key challenge in learning a word is linking the word to the visual phenomena it describes. By organizing model output into word categories used to analyze child language learning data, we show a correspondence between word learning in children and the performance of image captioning models. Although captioning models are trained only on standard machine learning data, we find that their performance in producing words from a variety of word categories correlates with the age at which children acquire words from each of those categories. To explain why this correspondence exists, we show that the performance of captioning models is correlated with human judgments of the concreteness of words, suggesting that these models are capturing the complex real-world association between words and visual phenomena. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|