Zobrazeno 1 - 10
of 177
pro vyhledávání: '"I.5.5"'
This paper introduces to a structured application of the One-Class approach and the One-Class-One-Network model for supervised classification tasks, specifically addressing a vowel phonemes classification case study within the Automatic Speech Recogn
Externí odkaz:
http://arxiv.org/abs/2410.05320
Publikováno v:
in Proceedings of the 5th IEEE International Symposium on the Internet of Sounds (IEEE IS2 2024, https://internetofsounds.net/is2_2024/)
This paper explores a structured application of the One-Class approach and the One-Class-One-Network model for supervised classification tasks, focusing on vowel phonemes classification and speakers recognition for the Automatic Speech Recognition (A
Externí odkaz:
http://arxiv.org/abs/2410.04098
Object detection is a main task in computer vision. Template matching is the reference method for detecting objects with arbitrary templates. However, template matching computational complexity depends on the rotation accuracy, being a limiting facto
Externí odkaz:
http://arxiv.org/abs/2408.02398
We propose a novel architecture and method of explainable classification with Concept Bottleneck Models (CBMs). While SOTA approaches to Image Classification task work as a black box, there is a growing demand for models that would provide interprete
Externí odkaz:
http://arxiv.org/abs/2404.03323
Autor:
Neuper, Walther
Publikováno v:
EPTCS 400, 2024, pp. 120-138
The paper presents the second part of a precise description of the prototype that has been developed in the course of the ISAC project over the last two decades. This part describes the "specify-phase", while the first part describing the "solve-phas
Externí odkaz:
http://arxiv.org/abs/2404.05462
Humans have good natural intuition to recognize when another person has something to say. It would be interesting if an AI can also recognize intentions to speak. Especially in scenarios when an AI is guiding a group discussion, this can be a useful
Externí odkaz:
http://arxiv.org/abs/2401.05849
Publikováno v:
Revista Electro, Vol. 45, pp. 184-189, 2023
In emergency situations, the high-speed movement of an ambulance through the city streets can be hindered by vehicular traffic. This work presents a method for detecting emergency vehicle sirens in real time. To obtain the audio fingerprint of a Hi-L
Externí odkaz:
http://arxiv.org/abs/2309.13920
Autor:
Helal, Manal E., Helal, Mohammed E.
Patents provide a rich source of information about design innovations. Patent mining techniques employ various technologies, such as text mining, machine learning, natural language processing, and ontology-building techniques. An automated graph data
Externí odkaz:
http://arxiv.org/abs/2305.00309
Convolution Neural Networks (CNNs) are widely used in medical image analysis, but their performance degrade when the magnification of testing images differ from the training images. The inability of CNNs to generalize across magnification scales can
Externí odkaz:
http://arxiv.org/abs/2302.11488
Stress has a great effect on people's lives that can not be understated. While it can be good, since it helps humans to adapt to new and different situations, it can also be harmful when not dealt with properly, leading to chronic stress. The objecti
Externí odkaz:
http://arxiv.org/abs/2212.14006