Zobrazeno 1 - 10
of 24
pro vyhledávání: '"Tutar, Ismail"'
Autor:
Zhong, Wenliang, Wu, Wenyi, Li, Qi, Barton, Rob, Du, Boxin, Sam, Shioulin, Bouyarmane, Karim, Tutar, Ismail, Huang, Junzhou
Multimodal Large Language Models (MLLMs) have achieved SOTA performance in various visual language tasks by fusing the visual representations with LLMs leveraging some visual adapters. In this paper, we first establish that adapters using query-based
Externí odkaz:
http://arxiv.org/abs/2406.02987
As online shopping is growing, the ability for buyers to virtually visualize products in their settings-a phenomenon we define as "Virtual Try-All"-has become crucial. Recent diffusion models inherently contain a world model, rendering them suitable
Externí odkaz:
http://arxiv.org/abs/2401.13795
We present Catalog Phrase Grounding (CPG), a model that can associate product textual data (title, brands) into corresponding regions of product images (isolated product region, brand logo region) for e-commerce vision-language applications. We use a
Externí odkaz:
http://arxiv.org/abs/2308.16354
We introduce DreamPaint, a framework to intelligently inpaint any e-commerce product on any user-provided context image. The context image can be, for example, the user's own image for virtual try-on of clothes from the e-commerce catalog on themselv
Externí odkaz:
http://arxiv.org/abs/2305.01257
Price Per Unit (PPU) is an essential information for consumers shopping on e-commerce websites when comparing products. Finding total quantity in a product is required for computing PPU, which is not always provided by the sellers. To predict total q
Externí odkaz:
http://arxiv.org/abs/2204.05555
Autor:
Arici, Tarik, Seyfioglu, Mehmet Saygin, Neiman, Tal, Xu, Yi, Train, Son, Chilimbi, Trishul, Zeng, Belinda, Tutar, Ismail
Vision-and-Language Pre-training (VLP) improves model performance for downstream tasks that require image and text inputs. Current VLP approaches differ on (i) model architecture (especially image embedders), (ii) loss functions, and (iii) masking po
Externí odkaz:
http://arxiv.org/abs/2109.12178
Autor:
Tutar, Ismail B.
Publikováno v:
Connect to this title online; UW restricted.
Thesis (Ph. D.)--University of Washington, 2007.
Vita. Includes bibliographical references (leaves 62-70).
Vita. Includes bibliographical references (leaves 62-70).
Externí odkaz:
http://hdl.handle.net/1773/6091
Autor:
Tutar, Ismail B.1, Pathak, Sayan D.1,2,3, Lixin Gong3, Cho, Paul S.4, Wallner, Kent4, Yongmin Kim5 ykim@u.washington.edu
Publikováno v:
IEEE Transactions on Medical Imaging. Dec2006, Vol. 25 Issue 12, p1645-1654. 10p. 2 Charts, 9 Graphs.
Autor:
Tutar, Ismail B., Narayanan, Sreeram, Lenz, Hila, Nurani, Rizwan, Orio, Peter, Cho, Paul S., Wallner, Kent, Kim, Yongmin
Publikováno v:
Proceedings of SPIE; Nov2007, Issue 1, p650914-650914-9, 9p
Autor:
Gong, Lixin, Ng, Lydia, Pathak, Sayan D., Tutar, Ismail, Cho, Paul S., Haynor, David R., Kim, Yongmin
Publikováno v:
Proceedings of SPIE; Nov2005, Issue 1, p1648-1657, 10p