The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Autor: Richard Mann, Faisal Mushtaq, Alan White, Gabriel Cervantes, Tom Pike, Dalton Coker, Stuart Murdoch, Tim Hiles, Clare Smith, David Berridge, Geoff Hall, Suzanne Hinchliffe, Stephen Smye, Richard McGilchrist Wilkie, Peter Lodge, Mark Mon-Williams
Jazyk: angličtina
Rok vydání: 2016
Předmět:
Zdroj: Frontiers in Public Health, Vol 4 (2016)
Druh dokumentu: article
ISSN: 2296-2565
DOI: 10.3389/fpubh.2016.00248
Popis: Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long-term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating healthcare delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit ‘big data’.
Databáze: Directory of Open Access Journals