How to design a dataset compliant with an ML-based system ODD?

Autor: Cappi, Cyril, Cohen, Noémie, Ducoffe, Mélanie, Gabreau, Christophe, Gardes, Laurent, Gauffriau, Adrien, Ginestet, Jean-Brice, Mamalet, Franck, Mussot, Vincent, Pagetti, Claire, Vigouroux, David
Rok vydání: 2024
Předmět:
Druh dokumentu: Working Paper
Popis: This paper focuses on a Vision-based Landing task and presents the design and the validation of a dataset that would comply with the Operational Design Domain (ODD) of a Machine-Learning (ML) system. Relying on emerging certification standards, we describe the process for establishing ODDs at both the system and image levels. In the process, we present the translation of high-level system constraints into actionable image-level properties, allowing for the definition of verifiable Data Quality Requirements (DQRs). To illustrate this approach, we use the Landing Approach Runway Detection (LARD) dataset which combines synthetic imagery and real footage, and we focus on the steps required to verify the DQRs. The replicable framework presented in this paper addresses the challenges of designing a dataset compliant with the stringent needs of ML-based systems certification in safety-critical applications.
Comment: 12th European Congress on Embedded Real Time Software and Systems, Jun 2024, Toulouse, France. arXiv admin note: text overlap with arXiv:2304.09938
Databáze: arXiv