Towards a Taxonomy Machine: A Training Set of 5.6 Million Arthropod Images.

Autor: Steinke, Dirk, Ratnasingham, Sujeevan, Agda, Jireh, Ait Boutou, Hamzah, Box, Isaiah C. H., Boyle, Mary, Chan, Dean, Feng, Corey, Lowe, Scott C., McKeown, Jaclyn T. A., McLeod, Joschka, Sanchez, Alan, Smith, Ian, Walker, Spencer, Wei, Catherine Y.-Y., Hebert, Paul D. N.
Předmět:
Zdroj: Data (2306-5729); Nov2024, Vol. 9 Issue 11, p122, 8p
Abstrakt: The taxonomic identification of organisms from images is an active research area within the machine learning community. Current algorithms are very effective for object recognition and discrimination, but they require extensive training datasets to generate reliable assignments. This study releases 5.6 million images with representatives from 10 arthropod classes and 26 insect orders. All images were taken using a Keyence VHX-7000 Digital Microscope system with an automatic stage to permit high-resolution (4K) microphotography. Providing phenotypic data for 324,000 species derived from 48 countries, this release represents, by far, the largest dataset of standardized arthropod images. As such, this dataset is well suited for testing the efficacy of machine learning algorithms for identifying specimens into higher taxonomic categories. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index