Zobrazeno 1 - 10
of 13
pro vyhledávání: '"K. Hima Prasad"'
Autor:
Tanveer A. Faruquie, Snigdha Chaturvedi, L. Venkata Subramaniam, Mukesh K. Mohania, K. Hima Prasad
Publikováno v:
SOLI
Record Linkage is an essential but expensive step in enterprise data management. In most deployments, blocking techniques are employed which can reduce the number of record pair comparisons and hence, the computational complexity of the task. Blockin
Publikováno v:
SOLI
The threats of the 21st century are too complex, difficult and time consuming to discern with traditional intelligence practices that shun advances in information technology and rely heavily on human experts. Good information is fundamental to unders
Autor:
K. Hima Prasad, Tanveer A. Faruquie, Snigdha Chaturvedi, Mukesh K. Mohania, L. Venkata Subramaniam
Publikováno v:
SOLI
Enterprise datasets are often noisy. Several columns can have non-standard, erroneous or missing information. Poor quality data can lead to incorrect reporting and wrong conclusions being drawn. Data cleansing involves standardizing such data to impr
Autor:
K. Hima Prasad, Ullas Nambiar, L. Venkata Subramaniam, Mukesh K. Mohania, Tanveer A. Faruquie
Publikováno v:
IEEE SCC
Businesses are increasingly realizing the value of creating a {\it single view} of its customers and partners by integrating information residing in 'siloed' datasets within and outside the enterprise. However, the task of {\it augmenting} data avail
Autor:
Snigdha Chaturvedi, Sachindra Joshi, K. Hima Prasad, L. Venkata Subramaniam, Mukesh K. Mohania, Tanveer A. Faruquie
Publikováno v:
2011 Annual SRII Global Conference.
Data quality improvement is an important aspect of enterprise data management. Data characteristics can change with customers, with domain and geography making data quality improvement a challenging task. Data quality improvement is often an iterativ
Autor:
Tanveer A. Faruquie, Sriram Padmanabhan, Girish Venkatachaliah, K. Hima Prasad, Snigdha Chaturvedi, L. Venkata Subramaniam
Publikováno v:
2011 Annual SRII Global Conference.
Enterprises today accumulate huge quantities of data which is often noisy and unstructured in nature making data cleansing an important task. Data cleansing refers to standardizing data from different sources to a common format so that data can be be
Autor:
Govind Kothari, L. Venkata Subramaniam, Tanveer A. Faruquie, K. Hima Prasad, Mukesh K. Mohania
Publikováno v:
ICPR
Address Cleansing is very challenging, particularly for geographies with variability in writing addresses. Supervised learners can be easily trained for different data sources. However, training requires labeling large corpora for each data source wh
Autor:
Tanveer A. Faruquie, Mohan N. Dani, K. Hima Prasad, Mukesh K. Mohania, L. Venkata Subramaniam, Rishabh Garg, Govind Kothari, Varsha N. Swamy
Publikováno v:
IEEE SCC
Poor Data Quality is a serious problem affecting enterprises. Enterprise databases are large and manual data cleansing is not feasible. For such large databases it is logical to attempt to cleanse the data in an automated way. This has led to the dev
Autor:
Tanveer A. Faruquie, L. Venkata Subramaniam, Mukesh K. Mohania, Girish Venkatachaliah, K. Hima Prasad
Publikováno v:
IEEE SCC
Data processing on the cloud is increasingly used for offering cost effective services. In this paper, we present a method for resource allocation for data processing services over the cloud taking into account not just the processing power and memor
Autor:
Tanveer A. Faruquie, K. Hima Prasad, L. Venkata Subramaniam, Pramit Basu, Girish Venkatachaliah, Mukesh K. Mohania, Shrinivas Kulkarni
Publikováno v:
ICDE
There is often a transient need within enterprises for data cleansing which can be satisfied by offering data cleansing as a transient service. Every time a data cleansing need arises it should be possible to provision hardware, software and staff fo