Výsledky vyhledávání - "Olvera, Michel"

Report

An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment

Autor: Malard, Hugo, Olvera, Michel, Lathuiliere, Stéphane, Essid, Slim

Multimodal large language models have fueled progress in image captioning. These models, fine-tuned on vast image datasets, exhibit a deep understanding of semantic concepts. In this work, we show that this ability can be re-purposed for audio captio

Externí odkaz: http://arxiv.org/abs/2410.05997

Zobrazit plný text záznamu

Report

A sound description: Exploring prompt templates and class descriptions to enhance zero-shot audio classification

Autor: Olvera, Michel, Stamatiadis, Paraskevas, Essid, Slim

Audio-text models trained via contrastive learning offer a practical approach to perform audio classification through natural language prompts, such as "this is a sound of" followed by category names. In this work, we explore alternative prompt templ

Externí odkaz: http://arxiv.org/abs/2409.13676

Zobrazit plný text záznamu

Report

SALT: Standardized Audio event Label Taxonomy

Autor: Stamatiadis, Paraskevas, Olvera, Michel, Essid, Slim

Publikováno v: DCASE, Oct 2024, Tokyo, Japan

Machine listening systems often rely on fixed taxonomies to organize and label audio data, key for training and evaluating deep neural networks (DNNs) and other supervised algorithms. However, such taxonomies face significant constraints: they are co

Externí odkaz: http://arxiv.org/abs/2409.11746

Zobrazit plný text záznamu

Report

On the choice of the optimal temporal support for audio classification with Pre-trained embeddings

Autor: Quelennec, Aurian, Olvera, Michel, Peeters, Geoffroy, Essid, Slim

Current state-of-the-art audio analysis systems rely on pre-trained embedding models, often used off-the-shelf as (frozen) feature extractors. Choosing the best one for a set of tasks is the subject of many recent publications. However, one aspect of

Externí odkaz: http://arxiv.org/abs/2312.14005

Zobrazit plný text záznamu

Report

Foreground-Background Ambient Sound Scene Separation

Autor: Olvera, Michel, Vincent, Emmanuel, Serizel, Romain, Gasso, Gilles

Publikováno v: 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands

Ambient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separatio

Externí odkaz: http://arxiv.org/abs/2005.07006

Zobrazit plný text záznamu

Report

Asteroid: the PyTorch-based audio source separation toolkit for researchers

Autor: Pariente, Manuel, Cornell, Samuele, Cosentino, Joris, Sivasankaran, Sunit, Tzinis, Efthymios, Heitkaemper, Jens, Olvera, Michel, Stöter, Fabian-Robert, Hu, Mathieu, Martín-Doñas, Juan M., Ditter, David, Frank, Ariel, Deleforge, Antoine, Vincent, Emmanuel

This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve rep

Externí odkaz: http://arxiv.org/abs/2005.04132

Zobrazit plný text záznamu

Détection robuste d'événements sonores

Autor: Olvera, Michel

Publikováno v: Computer Science [cs]. Université de Lorraine, 2022. English. ⟨NNT : 2022LORR0324⟩

From industry to general interest applications, computational analysis of sound scenes and events allows us to interpret the continuous flow of everyday sounds. One of the main degradations encountered when moving from lab conditions to the real worl

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od_______165::65c39d254fbd8d3e69738665ec3c061d
https://hal.univ-lorraine.fr/tel-04087756

Zobrazit plný text záznamu

On The Impact of Normalization Strategies in Unsupervised Adversarial Domain Adaptation for Acoustic Scene Classification

Autor: Olvera, Michel, Vincent, Emmanuel, Gasso, Gilles

Publikováno v: ICASSP 2022-IEEE International Conference on Acoustics, Speech and Signal Processing
ICASSP 2022-IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747540⟩

International audience; Acoustic scene classification systems face performance degradation when trained and tested on data recorded by different devices. Unsupervised domain adaptation methods have been studied to reduce the impact of this mismatch.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::afc3f75c4eb0097c75001041fff12428
https://doi.org/10.1109/icassp43922.2022.9747540

Zobrazit plný text záznamu

Improving Sound Event Detection with Auxiliary Foreground-Background Classification and Domain Adaptation

Autor: Olvera, Michel, Vincent, Emmanuel, Gasso, Gilles

Publikováno v: DCASE 2021-6th Workshop on Detection and Classification of Acoustic Scenes and Events
DCASE 2021-6th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2021, Virtual, Spain

International audience; In this paper we provide two methods that improve the detection of sound events in domestic environments. First, motivated by the broad categorization of domestic sounds as foreground or background events according to their sp

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::9e42052ad5fd710a1a2b50d212a231c0
https://hal.inria.fr/hal-03387778/file/DCASE_Workshop_2021.pdf

Zobrazit plný text záznamu

Domain-Adversarial Training and Trainable Parallel Front-end for the DCASE 2020 Task 4 Sound Event Detection Challenge

Autor: Cornell, Samuele, Olvera, Michel, Pariente, Manuel, Pepe, Giovanni, Principi, Emanuele, Gabrielli, Leonardo, Squartini, Stefano

Publikováno v: DCASE 2020-5th Workshop on Detection and Classification of Acoustic Scenes and Events
DCASE 2020-5th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2020, Virtual, Japan

International audience; In this paper, we propose several methods for improving Sound Event Detection systems performance in the context of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2020 Task 4 challenge. Our main contrib

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::c701f09d2e903c45d613c0e639563a0a
https://hal.inria.fr/hal-02962911

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání