Zobrazeno 1 - 10
of 32 329
pro vyhledávání: '"Vision-Transformer"'
Transforming mathematical expressions into LaTeX poses a significant challenge. In this paper, we examine the application of advanced transformer-based architectures to address the task of converting handwritten or digital mathematical expression ima
Externí odkaz:
http://arxiv.org/abs/2412.03853
Rolling bearings play a crucial role in industrial machinery, directly influencing equipment performance, durability, and safety. However, harsh operating conditions, such as high speeds and temperatures, often lead to bearing malfunctions, resulting
Externí odkaz:
http://arxiv.org/abs/2412.00085
Autor:
Gondhalekar, Yash, Moriwaki, Kana
Parameter inference is a crucial task in modern cosmology that requires accurate and fast computational methods to handle the high precision and volume of observational datasets. In this study, we explore a hybrid vision transformer, the Convolution
Externí odkaz:
http://arxiv.org/abs/2411.14392
Autor:
Huang, Feiyang
This paper presents ViTOC (Vision Transformer and Object-aware Captioner), a novel vision-language model for image captioning that addresses the challenges of accuracy and diversity in generated descriptions. Unlike conventional approaches, ViTOC emp
Externí odkaz:
http://arxiv.org/abs/2411.07265
Facial landmark detection is a fundamental problem in computer vision for many downstream applications. This paper introduces a new facial landmark detector based on vision transformers, which consists of two unique designs: Dual Vision Transformer (
Externí odkaz:
http://arxiv.org/abs/2411.07167
Autor:
Liang, Jiajia
Image shadow removal is a crucial task in computer vision. In real-world scenes, shadows alter image color and brightness, posing challenges for perception and texture recognition. Traditional and deep learning methods often overlook the distinct nee
Externí odkaz:
http://arxiv.org/abs/2501.01864
Publikováno v:
Journal of Silk. 2024, Vol. 61 Issue 11, p77-83. 7p.
Autor:
Taylor, Jacob R., Sarma, Sankar Das
1D superconductor-semiconductor nanowires are leading candidates for topological quantum computation due to their ability to host Majorana zero modes (MZMs). However, standard methods for identifying MZMs are often inadequate, particularly in the pre
Externí odkaz:
http://arxiv.org/abs/2412.06768
Semantic communications provide significant performance gains over traditional communications by transmitting task-relevant semantic features through wireless channels. However, most existing studies rely on end-to-end (E2E) training of neural-type e
Externí odkaz:
http://arxiv.org/abs/2412.06038
Publikováno v:
Machine Learning and Knowledge Discovery in Databases.Applied Data Science Track, vol 14950, Springer (2024) 116-132
One of the most promising use-cases for machine learning in industrial manufacturing is the early detection of defective products using a quality control system. Such a system can save costs and reduces human errors due to the monotonous nature of vi
Externí odkaz:
http://arxiv.org/abs/2411.14953