Learning multi-scale functional representations of proteins from single-cell microscopy data

Autor: Razdaibiedina, Anastasia, Brechalov, Alexander
Rok vydání: 2022
Předmět:
Druh dokumentu: Working Paper
Popis: Protein function is inherently linked to its localization within the cell, and fluorescent microscopy data is an indispensable resource for learning representations of proteins. Despite major developments in molecular representation learning, extracting functional information from biological images remains a non-trivial computational task. Current state-of-the-art approaches use autoencoder models to learn high-quality features by reconstructing images. However, such methods are prone to capturing noise and imaging artifacts. In this work, we revisit deep learning models used for classifying major subcellular localizations, and evaluate representations extracted from their final layers. We show that simple convolutional networks trained on localization classification can learn protein representations that encapsulate diverse functional information, and significantly outperform autoencoder-based models. We also propose a robust evaluation strategy to assess quality of protein representations across different scales of biological function.
Comment: ICLR MLDD 2022
Databáze: arXiv