Polarization- and CGR-based binary representations as identifiers of the nucleotide sequences in bioinformatics

Autor: Zimnyakov, Dmitry Александрович, Alonova, Marina Vasilevna, Skripal, Anatolij Vladimirovich, Inkin, Maksim Glebovich, Zaytsev, Sergey S, Feodorova, Valentina
Jazyk: English<br />Russian
Rok vydání: 2024
Předmět:
Zdroj: Известия высших учебных заведений: Прикладная нелинейная динамика, Vol 32, Iss 4, Pp 439-459 (2024)
Druh dokumentu: article
ISSN: 0869-6632
2542-1905
DOI: 10.18500/0869-6632-003110
Popis: Purpose of this work is the comparative analysis of two approaches to the synthesis of two-dimensional binary identifiers of nucleotide sequences obtained using DNA sequencing of biological objects. Methods. One of the approaches is based on modeling the polarization-dependent diffraction of a coherent readout beam on a two-dimensional phase-modulating structure (phase screen) associated with the symbolic sequence obtained as a result of DNA sequencing. Another approach uses a two-dimensional representation of the symbolic sequence using a chaos game representation (CGR). To obtain a finite-element CGR mapping, it is fragmented into a given number of cells, ensuring acceptable sensitivity of the synthesized binary identifier to structural changes in the displayed sequence. Results. The comparative analysis was carried out using fragments of symbol sequences corresponding to various strains (Wuhan, Delta, Omicron) of the SarSCoV2 virus. In the course of the analysis, the correlation coefficients between the binary identifiers corresponding to various strains were obtained and compared with each other. Conclusion. It has been established that binary identifiers synthesized using the polarization encoding technique are characterized by significantly higher sensitivity to structural changes in the analyzed sequences and smaller sizes compared to CGR binary identifiers.
Databáze: Directory of Open Access Journals