Zobrazeno 1 - 10
of 4 141
pro vyhledávání: '"P. Larcher"'
Autor:
Larcher, Theo, Picek, Lukas, Deneu, Benjamin, Lorieul, Titouan, Servajean, Maximilien, Joly, Alexis
This paper describes a deep-SDM framework, MALPOLON. Written in Python and built upon the PyTorch library, this framework aims to facilitate training and inferences of deep species distribution models (deep-SDM) and sharing for users with only genera
Externí odkaz:
http://arxiv.org/abs/2409.18102
Autor:
Tur, Anil Osman, Conti, Alessandro, Beyan, Cigdem, Boscaini, Davide, Larcher, Roberto, Messelodi, Stefano, Poiesi, Fabio, Ricci, Elisa
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods. The zero-shot assumption is essential to avoid the need for re-training the classifier every time a n
Externí odkaz:
http://arxiv.org/abs/2409.14963
Autor:
Picek, Lukas, Botella, Christophe, Servajean, Maximilien, Leblanc, César, Palard, Rémi, Larcher, Théo, Deneu, Benjamin, Marcos, Diego, Bonnet, Pierre, Joly, Alexis
The difficulty of monitoring biodiversity at fine scales and over large areas limits ecological knowledge and conservation efforts. To fill this gap, Species Distribution Models (SDMs) predict species across space from spatially explicit features. Ye
Externí odkaz:
http://arxiv.org/abs/2408.13928
Biometric recognition systems are security systems based on intrinsic properties of their users, usually encoded in high dimension representations called embeddings, which potential theft would represent a greater threat than a temporary password or
Externí odkaz:
http://arxiv.org/abs/2408.08918
Publikováno v:
Speaker and Language Recognition Workshop - Odyssey, Jun 2024, Qu{\'e}bec (Canada), Canada
Speech resynthesis is a generic task for which we want to synthesize audio with another audio as input, which finds applications for media monitors and journalists.Among different tasks addressed by speech resynthesis, voice conversion preserves the
Externí odkaz:
http://arxiv.org/abs/2408.02712
Speaker Diarization (SD) aims at grouping speech segments that belong to the same speaker. This task is required in many speech-processing applications, such as rich meeting transcription. In this context, distant microphone arrays usually capture th
Externí odkaz:
http://arxiv.org/abs/2406.03251
Autor:
Constum, Thomas, Preel, Lucas, Larcher, Théo, Tranouez, Pierrick, Paquet, Thierry, Brée, Sandra
The EXO-POPP project aims to establish a comprehensive database comprising 300,000 marriage records from Paris and its suburbs, spanning the years 1880 to 1940, which are preserved in over 130,000 scans of double pages. Each marriage record may encom
Externí odkaz:
http://arxiv.org/abs/2404.19329
Autor:
Uro, Rémi, Doukhan, David, Rilliard, Albert, Larcher, Laëtitia, Adgharouamane, Anissa-Claire, Tahon, Marie, Laurent, Antoine
Publikováno v:
Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)
This paper presents a semi-automatic approach to create a diachronic corpus of voices balanced for speaker's age, gender, and recording period, according to 32 categories (2 genders, 4 age ranges and 4 recording periods). Corpora were selected at Fre
Externí odkaz:
http://arxiv.org/abs/2404.17552
Voice Activity Detection (VAD) and Overlapped Speech Detection (OSD) are key pre-processing tasks for speaker diarization. In the meeting context, it is often easier to capture speech with a distant device. This consideration however leads to severe
Externí odkaz:
http://arxiv.org/abs/2402.08312
The study of hardest and easiest fitness landscapes is an active area of research. Recently, Kaufmann, Larcher, Lengler and Zou conjectured that for the self-adjusting $(1,\lambda)$-EA, Adversarial Dynamic BinVal (ADBV) is the hardest dynamic monoton
Externí odkaz:
http://arxiv.org/abs/2311.07438