Enterprise-wide Machine Learning using Teradata Vantage: An Integrated Analytics Platform

Autor: Awny Al-Omari, Choudur Lakshminarayan, Faraz Ahmad, Sri Raghavan, Khaled Bouaziz, Thiagarajan Ramakrishnan, Prama Agarwal
Rok vydání: 2019
Předmět:
Zdroj: IEEE BigData
Popis: Big data characterized by variety can be divided into 3 principal categories: numeric structured data, semi-structured data, and unstructured multimedia data involving audio, video, and text. Decision making requires multiple analytical engines suitable for each type of data, programming languages, algorithms, visualization tools, and user interfaces. More often than not, industrial analytics is conducted ad hoc by lashing together analytics components such as distributed data sources, analytics engines, and algorithms. This kind of piecemeal approach ignores scale, security, governance, reliability, model management and fault tolerance that are paramount for industrial strength analytics. A unified, versatile, and robust architecture that combines various components in a single integrated platform is the need of the hour. Teradata Vantage (TD Vantage) is such a platform for delivering production quality enterprise analytics at scale. In this paper, we outline the proposed TD Vantage (available in the market and under continuous development) that unifies data, engines, and algorithms operating in a seamless symphony. We will demonstrate its capabilities through three proofs of concept biz: image data using TensorFlow, text data using Spark, and transaction data using Aster (now renamed Machine Learning Engine or MLE), with Teradata orchestrating interactions among the various components.
Databáze: OpenAIRE