Zobrazeno 1 - 10
of 139
pro vyhledávání: '"Yildirim, Ilker"'
Diffusion-based image generation models such as DALL-E 3 and Stable Diffusion-XL demonstrate remarkable capabilities in generating images with realistic and unique compositions. Yet, these models are not robust in precisely reasoning about physical a
Externí odkaz:
http://arxiv.org/abs/2402.09052
Autor:
Peters, Benjamin, DiCarlo, James J., Gureckis, Todd, Haefner, Ralf, Isik, Leyla, Tenenbaum, Joshua, Konkle, Talia, Naselaris, Thomas, Stachenfeld, Kimberly, Tavares, Zenna, Tsao, Doris, Yildirim, Ilker, Kriegeskorte, Nikolaus
Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-u
Externí odkaz:
http://arxiv.org/abs/2401.06005
Large language models (LLMs) show remarkable capabilities across a variety of tasks. Despite the models only seeing text in training, several recent studies suggest that LLM representations implicitly capture aspects of the underlying grounded concep
Externí odkaz:
http://arxiv.org/abs/2310.14540
Autor:
Yildirim, Ilker, Paul, L. A.
In what sense does a large language model have knowledge? The answer to this question extends beyond the capabilities of a particular AI system, and challenges our assumptions about the nature of knowledge and intelligence. We answer by granting LLMs
Externí odkaz:
http://arxiv.org/abs/2310.04276
Much of what we remember is not due to intentional selection, but simply a by-product of perceiving. This raises a foundational question about the architecture of the mind: How does perception interface with and influence memory? Here, inspired by a
Externí odkaz:
http://arxiv.org/abs/2302.10392
Autor:
Yildirim, Ilker, Siegel, Max H., Soltani, Amir A., Chaudhari, Shraman Ray, Tenenbaum, Joshua B.
Many surface cues support three-dimensional shape perception, but people can sometimes still see shape when these features are missing -- in extreme cases, even when an object is completely occluded, as when covered with a draped cloth. We propose a
Externí odkaz:
http://arxiv.org/abs/2301.03711
Large-scale vision-language models such as CLIP have shown impressive performance on zero-shot image classification and image-to-text retrieval. However, such performance does not realize in tasks that require a finer-grained correspondence between v
Externí odkaz:
http://arxiv.org/abs/2212.12043
Autor:
Yildirim, Ilker, Paul, L.A.
Publikováno v:
In Trends in Cognitive Sciences May 2024 28(5):404-415
Autor:
Yildirim, Ilker, Saeed, Basil, Bennett-Pierre, Grace, Gerstenberg, Tobias, Tenenbaum, Joshua, Gweon, Hyowon
The ability to estimate task difficulty is critical for many real-world decisions such as setting appropriate goals for ourselves or appreciating others' accomplishments. Here we give a computational account of how humans judge the difficulty of a ra
Externí odkaz:
http://arxiv.org/abs/1905.04445
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.