Zobrazeno 1 - 10
of 85
pro vyhledávání: '"Sainyam Galhotra"'
Publikováno v:
ACM SIGMOD Record. 51:30-37
Well-functioning data markets match sellers with buyers to allocate data effectively. Although most of today's data markets fall short of this ideal, there is a renewed interest in online data marketplaces that may fulfill the promise of data markets
Publikováno v:
Entropy, Vol 23, Iss 12, p 1571 (2021)
The deployment of machine learning (ML) systems in applications with societal impact has motivated the study of fairness for marginalized groups. Often, the protected attribute is absent from the training dataset for legal reasons. However, datasets
Externí odkaz:
https://doaj.org/article/b75d79d5f2a94e949c2f858a7538ed22
Autor:
Sainyam Galhotra, Udayan Khurana
Publikováno v:
Proceedings of the VLDB Endowment. 15:3562-3565
In data science problems, understanding the data is a crucial first step. However, it can be challenging and time intensive for a data scientist who is not an expert in that domain. Several downstream tasks such as feature engineering and data curati
Autor:
Sainyam Galhotra, Anna Fariha, Raoni Lourenço, Juliana Freire, Alexandra Meliou, Divesh Srivastava
Publikováno v:
Proceedings of the 2022 International Conference on Management of Data.
Publikováno v:
Proceedings of the 2022 International Conference on Management of Data.
Publikováno v:
Proceedings of the 2022 International Conference on Management of Data.
Publikováno v:
Proceedings of the ACM Web Conference 2022.
Autor:
Udayan Khurana, Sainyam Galhotra
Publikováno v:
CIKM
Determining the semantic concepts of columns in tabular data is of use for many applications ranging from data integration, cleaning, search to feature engineering and model building in machine learning. Several prior works have proposed supervised l
Publikováno v:
Entropy
Entropy, Vol 23, Iss 1571, p 1571 (2021)
Entropy; Volume 23; Issue 12; Pages: 1571
Entropy, Vol 23, Iss 1571, p 1571 (2021)
Entropy; Volume 23; Issue 12; Pages: 1571
The deployment of machine learning (ML) systems in applications with societal impact has motivated the study of fairness for marginalized groups. Often, the protected attribute is absent from the training dataset for legal reasons. However, datasets
Publikováno v:
SIGMOD Conference
Creating and collecting labeled data is one of the major bottlenecks in machine learning pipelines and the emergence of automated feature generation techniques such as deep learning, which typically requires a lot of training data, has further exacer