Výsledky vyhledávání

Report

LC-Protonets: Multi-label Few-shot learning for world music audio tagging

Autor: Papaioannou, Charilaos, Benetos, Emmanouil, Potamianos, Alexandros

We introduce Label-Combination Prototypical Networks (LC-Protonets) to address the problem of multi-label few-shot classification, where a model must generalize to new classes based on only a few available examples. Extending Prototypical Networks, L

Externí odkaz: http://arxiv.org/abs/2409.11264

Zobrazit plný text záznamu

Report

Acoustic identification of individual animals with hierarchical contrastive learning

Autor: Nolasco, Ines, Moummad, Ilyass, Stowell, Dan, Benetos, Emmanouil

Acoustic identification of individual animals (AIID) is closely related to audio-based species classification but requires a finer level of detail to distinguish between individual animals within the same species. In this work, we frame AIID as a hie

Externí odkaz: http://arxiv.org/abs/2409.08673

Zobrazit plný text záznamu

Report

Domain-Invariant Representation Learning of Bird Sounds

Autor: Moummad, Ilyass, Serizel, Romain, Benetos, Emmanouil, Farrugia, Nicolas

Passive acoustic monitoring (PAM) is crucial for bioacoustic research, enabling non-invasive species tracking and biodiversity monitoring. Citizen science platforms like Xeno-Canto provide large annotated datasets from focal recordings, where the tar

Externí odkaz: http://arxiv.org/abs/2409.08589

Zobrazit plný text záznamu

Report

Foundation Models for Music: A Survey

In recent years, foundation models (FMs) such as large language models (LLMs) and latent diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This comprehensive review examines state-of-the-art (SOTA) pre-trained models

Externí odkaz: http://arxiv.org/abs/2408.14340

Zobrazit plný text záznamu

Report

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models

Autor: Weck, Benno, Manco, Ilaria, Benetos, Emmanouil, Quinton, Elio, Fazekas, George, Bogdanov, Dmitry

Multimodal models that jointly process audio and language hold great promise in audio understanding and are increasingly being adopted in the music domain. By allowing users to query via text and obtain information about a given audio input, these mo

Externí odkaz: http://arxiv.org/abs/2408.01337

Zobrazit plný text záznamu

Report

Can LLMs 'Reason' in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation

Autor: Zhou, Ziya, Wu, Yuhang, Wu, Zhiyue, Zhang, Xinyue, Yuan, Ruibin, Ma, Yinghao, Wang, Lu, Benetos, Emmanouil, Xue, Wei, Guo, Yike

Symbolic Music, akin to language, can be encoded in discrete symbols. Recent research has extended the application of large language models (LLMs) such as GPT-4 and Llama2 to the symbolic music domain including understanding and generation. Yet scant

Externí odkaz: http://arxiv.org/abs/2407.21531

Zobrazit plný text záznamu

Report

Stochastic branching models for the telomeres dynamics in a model including telomerase activity

Autor: Benetos, Athanase, Fritsch, Coralie, Horton, Emma, Lenotre, Lionel, Toupance, Simon, Villemonais, Denis

Telomeres are repetitive sequences of nucleotides at the end of chromosomes, whose evolution over time is intrinsically related to biological ageing. In most cells, with each cell division, telomeres shorten due to the so-called end replication probl

Externí odkaz: http://arxiv.org/abs/2407.11453

Zobrazit plný text záznamu

Report

YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation

Autor: Chang, Sungkyun, Benetos, Emmanouil, Kirchhoff, Holger, Dixon, Simon

Multi-instrument music transcription aims to convert polyphonic music recordings into musical scores assigned to each instrument. This task is challenging for modeling as it requires simultaneously identifying multiple instruments and transcribing th

Externí odkaz: http://arxiv.org/abs/2407.04822

Zobrazit plný text záznamu

Report

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model

Autor: Huang, Jiawen, Benetos, Emmanouil

Multilingual automatic lyrics transcription (ALT) is a challenging task due to the limited availability of labelled data and the challenges introduced by singing, compared to multilingual automatic speech recognition. Although some multilingual singi

Externí odkaz: http://arxiv.org/abs/2406.17618

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání