Zobrazeno 1 - 10
of 270
pro vyhledávání: '"Paton Norman W."'
Embeddings are now used to underpin a wide variety of data management tasks, including entity resolution, dataset search and semantic type detection. Such applications often involve datasets with numerical columns, but there has been more emphasis pl
Externí odkaz:
http://arxiv.org/abs/2410.07485
Deep clustering (DC), a fusion of deep representation learning and clustering, has recently demonstrated positive results in data science, particularly text processing and computer vision. However, joint optimization of feature learning and data dist
Externí odkaz:
http://arxiv.org/abs/2405.17723
Deep Learning (DL) techniques now constitute the state-of-the-art for important problems in areas such as text and image processing, and there have been impactful results that deploy DL in several data management tasks. Deep Clustering (DC) has recen
Externí odkaz:
http://arxiv.org/abs/2305.13494
Publikováno v:
Journal of Integrative Bioinformatics, Vol 8, Iss 2, Pp 187-203 (2011)
The generation and use of metabolic network reconstructions has increased over recent years. The development of such reconstructions has typically involved a time-consuming, manual process. Recent work has shown that steps undertaken in reconstructin
Externí odkaz:
https://doaj.org/article/500807ed908343bcaea7718808e0743c
Autor:
Alam Intikhab, Cornell Mike, Soanes Darren M., Hedeler Cornelia, Wong Han Min, Rattray Magnus, Hubbard Simon J., Talbot Nicholas J., Oliver Stephen G., Paton Norman W.
Publikováno v:
Journal of Integrative Bioinformatics, Vol 4, Iss 3, Pp 112-122 (2007)
The continuing and rapid increase in the number of fully sequenced genomes is creating new opportunities for comparative studies. However, although many genomic databases store data from multiple organisms, for the most part they provide limited supp
Externí odkaz:
https://doaj.org/article/b35bfefb75e94b5a9a530aed3484dcde
A data lake is a repository of data with potential for future analysis. However, both discovering what data is in a data lake and exploring related data sets can take significant effort, as a data lake can contain an intimidating amount of heterogene
Externí odkaz:
http://arxiv.org/abs/2206.03881
Publikováno v:
In Expert Systems With Applications 15 October 2024 252 Part B
Publikováno v:
2020 IEEE 36th International Conference on Data Engineering (ICDE)
Data analytics stands to benefit from the increasing availability of datasets that are held without their conceptual relationships being explicitly known. When collected, these datasets form a data lake from which, by processes like data wrangling, s
Externí odkaz:
http://arxiv.org/abs/2011.10427
Publikováno v:
2021 IEEE 37th International Conference on Data Engineering (ICDE)
Accurately identifying different representations of the same real-world entity is an integral part of data cleaning and many methods have been proposed to accomplish it. The challenges of this entity resolution task that demand so much research atten
Externí odkaz:
http://arxiv.org/abs/2011.10406