Výsledky vyhledávání

Report

MALPOLON: A Framework for Deep Species Distribution Modeling

Autor: Larcher, Theo, Picek, Lukas, Deneu, Benjamin, Lorieul, Titouan, Servajean, Maximilien, Joly, Alexis

This paper describes a deep-SDM framework, MALPOLON. Written in Python and built upon the PyTorch library, this framework aims to facilitate training and inferences of deep species distribution models (deep-SDM) and sharing for users with only genera

Externí odkaz: http://arxiv.org/abs/2409.18102

Zobrazit plný text záznamu

Report

Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models

Autor: Tur, Anil Osman, Conti, Alessandro, Beyan, Cigdem, Boscaini, Davide, Larcher, Roberto, Messelodi, Stefano, Poiesi, Fabio, Ricci, Elisa

In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods. The zero-shot assumption is essential to avoid the need for re-training the classifier every time a n

Externí odkaz: http://arxiv.org/abs/2409.14963

Zobrazit plný text záznamu

Report

GeoPlant: Spatial Plant Species Prediction Dataset

Autor: Picek, Lukas, Botella, Christophe, Servajean, Maximilien, Leblanc, César, Palard, Rémi, Larcher, Théo, Deneu, Benjamin, Marcos, Diego, Bonnet, Pierre, Joly, Alexis

The difficulty of monitoring biodiversity at fine scales and over large areas limits ecological knowledge and conservation efforts. To fill this gap, Species Distribution Models (SDMs) predict species across space from spatially explicit features. Ye

Externí odkaz: http://arxiv.org/abs/2408.13928

Zobrazit plný text záznamu

Report

Supervised and Unsupervised Alignments for Spoofing Behavioral Biometrics

Autor: Thebaud, Thomas, Lan, Gaël Le, Larcher, Anthony

Biometric recognition systems are security systems based on intrinsic properties of their users, usually encoded in high dimension representations called embeddings, which potential theft would represent a greater threat than a temporary password or

Externí odkaz: http://arxiv.org/abs/2408.08918

Zobrazit plný text záznamu

Report

Automatic Voice Identification after Speech Resynthesis using PPG

Autor: Gaudier, Thibault, Tahon, Marie, Larcher, Anthony, Estève, Yannick

Publikováno v: Speaker and Language Recognition Workshop - Odyssey, Jun 2024, Qu{\'e}bec (Canada), Canada

Speech resynthesis is a generic task for which we want to synthesize audio with another audio as input, which finds applications for media monitors and journalists.Among different tasks addressed by speech resynthesis, voice conversion preserves the

Externí odkaz: http://arxiv.org/abs/2408.02712

Zobrazit plný text záznamu

Report

ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings

Autor: Mariotte, Theo, Larcher, Anthony, Montresor, Silvio, Thomas, Jean-Hugh

Speaker Diarization (SD) aims at grouping speech segments that belong to the same speaker. This task is required in many speech-processing applications, such as rich meeting transcription. In this context, distant microphone arrays usually capture th

Externí odkaz: http://arxiv.org/abs/2406.03251

Zobrazit plný text záznamu

Report

End-to-end information extraction in handwritten documents: Understanding Paris marriage records from 1880 to 1940

Autor: Constum, Thomas, Preel, Lucas, Larcher, Théo, Tranouez, Pierrick, Paquet, Thierry, Brée, Sandra

The EXO-POPP project aims to establish a comprehensive database comprising 300,000 marriage records from Paris and its suburbs, spanning the years 1880 to 1940, which are preserved in over 130,000 scans of double pages. Each marriage record may encom

Externí odkaz: http://arxiv.org/abs/2404.19329

Zobrazit plný text záznamu

Report

A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification

Autor: Uro, Rémi, Doukhan, David, Rilliard, Albert, Larcher, Laëtitia, Adgharouamane, Anissa-Claire, Tahon, Marie, Laurent, Antoine

Publikováno v: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)

This paper presents a semi-automatic approach to create a diachronic corpus of voices balanced for speaker's age, gender, and recording period, according to 32 categories (2 genders, 4 age ranges and 4 recording periods). Corpora were selected at Fre

Externí odkaz: http://arxiv.org/abs/2404.17552

Zobrazit plný text záznamu

Report

Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

Autor: Mariotte, Théo, Larcher, Anthony, Montrésor, Silvio, Thomas, Jean-Hugh

Voice Activity Detection (VAD) and Overlapped Speech Detection (OSD) are key pre-processing tasks for speaker diarization. In the meeting context, it is often easier to capture speech with a distant device. This consideration however leads to severe

Externí odkaz: http://arxiv.org/abs/2402.08312

Zobrazit plný text záznamu

Report

Hardest Monotone Functions for Evolutionary Algorithms

Autor: Kaufmann, Marc, Larcher, Maxime, Lengler, Johannes, Sieberling, Oliver

The study of hardest and easiest fitness landscapes is an active area of research. Recently, Kaufmann, Larcher, Lengler and Zou conjectured that for the self-adjusting $(1,\lambda)$-EA, Adversarial Dynamic BinVal (ADBV) is the hardest dynamic monoton

Externí odkaz: http://arxiv.org/abs/2311.07438

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání