Zobrazeno 1 - 10
of 1 876
pro vyhledávání: '"P. Yikang"'
Stereo matching has emerged as a cost-effective solution for road surface 3D reconstruction, garnering significant attention towards improving both computational efficiency and accuracy. This article introduces decisive disparity diffusion (D3Stereo)
Externí odkaz:
http://arxiv.org/abs/2411.03717
Autor:
Mi, Liang, Wang, Weijun, Tu, Wenming, He, Qingfeng, Kong, Rui, Fang, Xinyu, Dong, Yazhu, Zhang, Yikang, Li, Yunchun, Li, Meng, Dai, Haipeng, Chen, Guihai, Liu, Yunxin
Large Multimodal Models (LMMs) have shown significant progress in various complex vision tasks with the solid linguistic and reasoning capacity inherited from large language models (LMMs). Low-rank adaptation (LoRA) offers a promising method to integ
Externí odkaz:
http://arxiv.org/abs/2411.00915
The self-attention mechanism traditionally relies on the softmax operator, necessitating positional embeddings like RoPE, or position biases to account for token order. But current methods using still face length generalisation challenges. We propose
Externí odkaz:
http://arxiv.org/abs/2410.17980
We propose an importance sampling method for tractable and efficient estimation of counterfactual expressions in general settings, named Exogenous Matching. By minimizing a common upper bound of counterfactual estimators, we transform the variance mi
Externí odkaz:
http://arxiv.org/abs/2410.13914
Equivariant neural networks incorporate symmetries into their architecture, achieving higher generalization performance. However, constructing equivariant neural networks typically requires prior knowledge of data types and symmetries, which is diffi
Externí odkaz:
http://arxiv.org/abs/2410.09841
Autor:
Zhu, Yikang, Su, Zhaofeng
Quantum coherence is one of the fundamental properties of quantum mechanics and also acts as a valuable resource for a variety of practical applications, which includes quantum computing and quantum information processing. Evaluating the dilution of
Externí odkaz:
http://arxiv.org/abs/2409.08876
Dynamic coronary roadmapping is a technology that overlays the vessel maps (the "roadmap") extracted from an offline image sequence of X-ray angiography onto a live stream of X-ray fluoroscopy in real-time. It aims to offer navigational guidance for
Externí odkaz:
http://arxiv.org/abs/2408.15947
Autor:
Shen, Yikang, Stallone, Matthew, Mishra, Mayank, Zhang, Gaoyuan, Tan, Shawn, Prasad, Aditya, Soria, Adriana Meza, Cox, David D., Panda, Rameswar
Finding the optimal learning rate for language model pretraining is a challenging task. This is not only because there is a complicated correlation between learning rate, batch size, number of training tokens, model size, and other hyperparameters bu
Externí odkaz:
http://arxiv.org/abs/2408.13359
The advent of Large Language Models (LLMs) has significantly transformed the fields of natural and social sciences. Generative Agent-Based Models (GABMs), which utilize large language models in place of real subjects, are gaining increasing public at
Externí odkaz:
http://arxiv.org/abs/2408.09175
Medical image segmentation is crucial for clinical decision-making, but the scarcity of annotated data presents significant challenges. Few-shot segmentation (FSS) methods show promise but often require retraining on the target domain and struggle to
Externí odkaz:
http://arxiv.org/abs/2408.08813