Výsledky vyhledávání - "Ma, Pingchuan"

Report

ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores

Autor: Jiang, Ziyang, Ma, Pingchuan, Zhang, Meng, Huang, Rena, Gu, Jiaqi

Photonic tensor cores (PTCs) are essential building blocks for optical artificial intelligence (AI) accelerators based on programmable photonic integrated circuits. Most PTC designs today are manually constructed, with low design efficiency and unsat

Externí odkaz: http://arxiv.org/abs/2410.01313

Zobrazit plný text záznamu

Report

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Autor: Kotovenko, Dmytro, Grebenkova, Olga, Sarafianos, Nikolaos, Paliwal, Avinash, Ma, Pingchuan, Poursaeed, Omid, Mohan, Sreyas, Fan, Yuchen, Li, Yilei, Ranjan, Rakesh, Ommer, Björn

While style transfer techniques have been well-developed for 2D image stylization, the extension of these methods to 3D scenes remains relatively unexplored. Existing approaches demonstrate proficiency in transferring colors and textures but often st

Externí odkaz: http://arxiv.org/abs/2409.17917

Zobrazit plný text záznamu

Report

Large Language Models Are Strong Audio-Visual Speech Recognition Learners

Autor: Cappellazzo, Umberto, Kim, Minsu, Chen, Honglie, Ma, Pingchuan, Petridis, Stavros, Falavigna, Daniele, Brutti, Alessio, Pantic, Maja

Multimodal large language models (MLLMs) have recently become a focal point of research due to their formidable multimodal understanding capabilities. For example, in the audio and speech domains, an LLM can be equipped with (automatic) speech recogn

Externí odkaz: http://arxiv.org/abs/2409.12319

Zobrazit plný text záznamu

Report

KAN 2.0: Kolmogorov-Arnold Networks Meet Science

Autor: Liu, Ziming, Ma, Pingchuan, Wang, Yixuan, Matusik, Wojciech, Tegmark, Max

A major challenge of AI + Science lies in their inherent incompatibility: today's AI is primarily based on connectionism, while science depends on symbolism. To bridge the two worlds, we propose a framework to seamlessly synergize Kolmogorov-Arnold N

Externí odkaz: http://arxiv.org/abs/2408.10205

Zobrazit plný text záznamu

Report

Diffusion Models and Representation Learning: A Survey

Autor: Fuest, Michael, Ma, Pingchuan, Gui, Ming, Fischer, Johannes S., Hu, Vincent Tao, Ommer, Bjorn

Diffusion Models are popular generative modeling methods in various vision tasks, attracting significant attention. They can be considered a unique instance of self-supervised learning methods due to their independence from label annotation. This sur

Externí odkaz: http://arxiv.org/abs/2407.00783

Zobrazit plný text záznamu

Report

Dynamic Data Pruning for Automatic Speech Recognition

Autor: Xiao, Qiao, Ma, Pingchuan, Fernandez-Lopez, Adriana, Wu, Boqian, Yin, Lu, Petridis, Stavros, Pechenizkiy, Mykola, Pantic, Maja, Mocanu, Decebal Constantin, Liu, Shiwei

The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has bee

Externí odkaz: http://arxiv.org/abs/2406.18373

Zobrazit plný text záznamu

Report

MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization

Autor: Fernandez-Lopez, Adriana, Chen, Honglie, Ma, Pingchuan, Yin, Lu, Xiao, Qiao, Petridis, Stavros, Liu, Shiwei, Pantic, Maja

Pre-trained models have been a foundational approach in speech recognition, albeit with associated additional costs. In this study, we propose a regularization technique that facilitates the training of visual and audio-visual speech recognition mode

Externí odkaz: http://arxiv.org/abs/2406.17614

Zobrazit plný text záznamu

Report

PIC2O-Sim: A Physics-Inspired Causality-Aware Dynamic Convolutional Neural Operator for Ultra-Fast Photonic Device FDTD Simulation

Autor: Ma, Pingchuan, Yang, Haoyu, Gao, Zhengqi, Boning, Duane S., Gu, Jiaqi

The finite-difference time-domain (FDTD) method, which is important in photonic hardware design flow, is widely adopted to solve time-domain Maxwell equations. However, FDTD is known for its prohibitive runtime cost, taking minutes to hours to simula

Externí odkaz: http://arxiv.org/abs/2406.17810

Zobrazit plný text záznamu

Report

Scalable Differentiable Causal Discovery in the Presence of Latent Confounders with Skeleton Posterior (Extended Version)

Autor: Ma, Pingchuan, Ding, Rui, Fu, Qiang, Zhang, Jiaru, Wang, Shuai, Han, Shi, Zhang, Dongmei

Differentiable causal discovery has made significant advancements in the learning of directed acyclic graphs. However, its application to real-world datasets remains restricted due to the ubiquity of latent confounders and the requirement to learn ma

Externí odkaz: http://arxiv.org/abs/2406.10537

Zobrazit plný text záznamu

Report

SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner

Autor: Wang, Xunguang, Wu, Daoyuan, Ji, Zhenlan, Li, Zongjie, Ma, Pingchuan, Wang, Shuai, Li, Yingjiu, Liu, Yang, Liu, Ning, Rahmel, Juergen

Jailbreaking is an emerging adversarial attack that bypasses the safety alignment deployed in off-the-shelf large language models (LLMs) and has evolved into multiple categories: human-based, optimization-based, generation-based, and the recent indir

Externí odkaz: http://arxiv.org/abs/2406.05498

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání