Zobrazeno 1 - 10
of 3 552
pro vyhledávání: '"AN, Zicong"'
Mixture-of-Experts (MoE) is an emerging technique for scaling large models with sparse activation. MoE models are typically trained in a distributed manner with an expert parallelism scheme, where experts in each MoE layer are distributed across mult
Externí odkaz:
http://arxiv.org/abs/2411.15419
Autor:
Wang, Yanjing, Wu, Lizhou, Hong, Wentao, Ou, Yang, Wang, Zicong, Gao, Sunfeng, Zhang, Jie, Ma, Sheng, Dong, Dezun, Qi, Xingyun, Lai, Mingche, Xiao, Nong
Compute eXpress Link (CXL) is a pivotal technology for memory disaggregation in future heterogeneous computing systems, enabling on-demand memory expansion and improved resource utilization. Despite its potential, CXL is in its early stages with limi
Externí odkaz:
http://arxiv.org/abs/2411.02282
Blockchain databases have attracted widespread attention but suffer from poor scalability due to underlying non-scalable blockchains. While blockchain sharding is necessary for a scalable blockchain database, it poses a new challenge named on-chain c
Externí odkaz:
http://arxiv.org/abs/2407.03750
The Volterra-type integral operator plays an essential role in modern complex analysis and operator theory. Recently, Chalmoukis \cite{Cn} introduced a generalized integral operator, say $I_{g,a}$, defined by $$I_{g,a}f=I^n(a_0f^{(n-1)}g'+a_1f^{(n-2)
Externí odkaz:
http://arxiv.org/abs/2405.16228
Let $n$ be a positive integer and $\mathbf{g}=(g_0,g_1,\cdots,g_{n-1})$, with $g_k\in H(\mathbb{D})$ for $k=0,1,\cdots,n-1$. Let $I_{\mathbf{g}}^{(n)}$ be the generalized Volterra-type operators on $H(\mathbb{C})$, which is represented as $$ I_{\math
Externí odkaz:
http://arxiv.org/abs/2405.11692
Human hands possess the dexterity to interact with diverse objects such as grasping specific parts of the objects and/or approaching them from desired directions. More importantly, humans can grasp objects of any shape without object-specific skills.
Externí odkaz:
http://arxiv.org/abs/2403.19649
Autor:
Fan, Zicong, Ohkawa, Takehiko, Yang, Linlin, Lin, Nie, Zhou, Zhishan, Zhou, Shihao, Liang, Jiajun, Gao, Zhong, Zhang, Xuanyang, Zhang, Xue, Li, Fei, Liu, Zheng, Lu, Feng, Zeid, Karim Abou, Leibe, Bastian, On, Jeongwan, Baek, Seungryul, Prakash, Aditya, Gupta, Saurabh, He, Kun, Sato, Yoichi, Hilliges, Otmar, Chang, Hyung Jin, Yao, Angela
We interact with the world with our hands and see it through our own (egocentric) perspective. A holistic 3Dunderstanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation.
Externí odkaz:
http://arxiv.org/abs/2403.16428
Steganography is the art of hiding secret data into the cover media for covert communication. In recent years, more and more deep neural network (DNN)-based steganographic schemes are proposed to train steganographic networks for secret embedding and
Externí odkaz:
http://arxiv.org/abs/2402.17210
Transformer model empowered architectures have become a pillar of cloud services that keeps reshaping our society. However, the dynamic query loads and heterogeneous user requirements severely challenge current transformer serving systems, which rely
Externí odkaz:
http://arxiv.org/abs/2401.05031
Autor:
Zhou, Jiahang, Chen, Yanyu, Hong, Zicong, Chen, Wuhui, Yu, Yue, Zhang, Tao, Wang, Hui, Zhang, Chuanfu, Zheng, Zibin
Foundation models (e.g., ChatGPT, DALL-E, PengCheng Mind, PanGu-$\Sigma$) have demonstrated extraordinary performance in key technological areas, such as natural language processing and visual recognition, and have become the mainstream trend of arti
Externí odkaz:
http://arxiv.org/abs/2401.02643