Výsledky vyhledávání

Report

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

Autor: Yang, Huzheng, Gee, James, Shi, Jianbo

We study the intriguing connection between visual data, deep networks, and the brain. Our method creates a universal channel alignment by using brain voxel fMRI response prediction as the training objective. We discover that deep networks, trained wi

Externí odkaz: http://arxiv.org/abs/2406.18344

Zobrazit plný text záznamu

Report

Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models

Autor: Xu, Katherine, Zhang, Lingzhi, Shi, Jianbo

Recent advances in text-to-image (T2I) diffusion models have facilitated creative and photorealistic image synthesis. By varying the random seeds, we can generate various images for a fixed text prompt. Technically, the seed controls the initial nois

Externí odkaz: http://arxiv.org/abs/2405.14828

Zobrazit plný text záznamu

Report

Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond

Autor: Xu, Katherine, Zhang, Lingzhi, Shi, Jianbo

Modern text-to-image (T2I) diffusion models can generate images with remarkable realism and creativity. These advancements have sparked research in fake image detection and attribution, yet prior studies have not fully explored the practical and scie

Externí odkaz: http://arxiv.org/abs/2403.19653

Zobrazit plný text záznamu

Report

Amodal Completion via Progressive Mixed Context Diffusion

Autor: Xu, Katherine, Zhang, Lingzhi, Shi, Jianbo

Our brain can effortlessly recognize objects even when partially hidden from view. Seeing the visible of the hidden is called amodal completion; however, this task remains a challenge for generative AI despite rapid progress. We propose to sidestep m

Externí odkaz: http://arxiv.org/abs/2312.15540

Zobrazit plný text záznamu

Report

Brain Decodes Deep Nets

Autor: Yang, Huzheng, Gee, James, Shi, Jianbo

We developed a tool for visualizing and analyzing large pre-trained vision models by mapping them onto the brain, thus exposing their hidden inside. Our innovation arises from a surprising usage of brain encoding: predicting brain fMRI measurements i

Externí odkaz: http://arxiv.org/abs/2312.01280

Zobrazit plný text záznamu

Report

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Autor: Grauman, Kristen, Westbury, Andrew, Torresani, Lorenzo, Kitani, Kris, Malik, Jitendra, Afouras, Triantafyllos, Ashutosh, Kumar, Baiyya, Vijay, Bansal, Siddhant, Boote, Bikram, Byrne, Eugene, Chavis, Zach, Chen, Joya, Cheng, Feng, Chu, Fu-Jen, Crane, Sean, Dasgupta, Avijit, Dong, Jing, Escobar, Maria, Forigua, Cristhian, Gebreselasie, Abrham, Haresh, Sanjay, Huang, Jing, Islam, Md Mohaiminul, Jain, Suyog, Khirodkar, Rawal, Kukreja, Devansh, Liang, Kevin J, Liu, Jia-Wei, Majumder, Sagnik, Mao, Yongsen, Martin, Miguel, Mavroudi, Effrosyni, Nagarajan, Tushar, Ragusa, Francesco, Ramakrishnan, Santhosh Kumar, Seminara, Luigi, Somayazulu, Arjun, Song, Yale, Su, Shan, Xue, Zihui, Zhang, Edward, Zhang, Jinxu, Castillo, Angela, Chen, Changan, Fu, Xinzhu, Furuta, Ryosuke, Gonzalez, Cristina, Gupta, Prince, Hu, Jiabo, Huang, Yifei, Huang, Yiming, Khoo, Weslie, Kumar, Anush, Kuo, Robert, Lakhavani, Sach, Liu, Miao, Luo, Mi, Luo, Zhengyi, Meredith, Brighid, Miller, Austin, Oguntola, Oluwatumininu, Pan, Xiaqing, Peng, Penny, Pramanick, Shraman, Ramazanova, Merey, Ryan, Fiona, Shan, Wei, Somasundaram, Kiran, Song, Chenan, Southerland, Audrey, Tateno, Masatoshi, Wang, Huiyu, Wang, Yuchen, Yagi, Takuma, Yan, Mingfei, Yang, Xitong, Yu, Zecheng, Zha, Shengxin Cindy, Zhao, Chen, Zhao, Ziwei, Zhu, Zhifan, Zhuo, Jeff, Arbelaez, Pablo, Bertasius, Gedas, Crandall, David, Damen, Dima, Engel, Jakob, Farinella, Giovanni Maria, Furnari, Antonino, Ghanem, Bernard, Hoffman, Judy, Jawahar, C. V., Newcombe, Richard, Park, Hyun Soo, Rehg, James M., Sato, Yoichi, Savva, Manolis, Shi, Jianbo, Shou, Mike Zheng, Wray, Michael

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike re

Externí odkaz: http://arxiv.org/abs/2311.18259

Zobrazit plný text záznamu

Report

Perceptual Artifacts Localization for Image Synthesis Tasks

Autor: Zhang, Lingzhi, Xu, Zhengjie, Barnes, Connelly, Zhou, Yuqian, Liu, Qing, Zhang, He, Amirghodsi, Sohrab, Lin, Zhe, Shechtman, Eli, Shi, Jianbo

Recent advancements in deep generative models have facilitated the creation of photo-realistic images across various tasks. However, these generated images often exhibit perceptual artifacts in specific regions, necessitating manual correction. In th

Externí odkaz: http://arxiv.org/abs/2310.05590

Zobrazit plný text záznamu

Report

Memory Encoding Model

Autor: Yang, Huzheng, Gee, James, Shi, Jianbo

We explore a new class of brain encoding model by adding memory-related information as input. Memory is an essential brain mechanism that works alongside visual stimuli. During a vision-memory cognitive task, we found the non-visual brain is largely

Externí odkaz: http://arxiv.org/abs/2308.01175

Zobrazit plný text záznamu

Report

Retinotopy Inspired Brain Encoding Model and the All-for-One Training Recipe

Autor: Yang, Huzheng, Shi, Jianbo, Gee, James

Brain encoding models aim to predict brain voxel-wise responses to stimuli images, replicating brain signals captured by neuroimaging techniques. There is a large volume of publicly available data, but training a comprehensive brain encoding model is

Externí odkaz: http://arxiv.org/abs/2307.14021

Zobrazit plný text záznamu

Report

Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Autor: Zhang, Renrui, Wang, Liuhui, Guo, Ziyu, Wang, Yali, Gao, Peng, Li, Hongsheng, Shi, Jianbo

We present a Non-parametric Network for 3D point cloud analysis, Point-NN, which consists of purely non-learnable components: farthest point sampling (FPS), k-nearest neighbors (k-NN), and pooling operations, with trigonometric functions. Surprisingl

Externí odkaz: http://arxiv.org/abs/2303.08134

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání