SOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information.

TitleSOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information.
Publication TypeJournal Article
Year of Publication2015
AuthorsHusain, Syed S., Kalinin Alexandr, Truong Anh, and Dinov Ivo D.
JournalJ Big Data
Volume2
Date Published2015
Abstract

INTRODUCTION: Intuitive formulation of informative and computationally-efficient queries on big and complex datasets present a number of challenges. As data collection is increasingly streamlined and ubiquitous, data exploration, discovery and analytics get considerably harder. Exploratory querying of heterogeneous and multi-source information is both difficult and necessary to advance our knowledge about the world around us.RESEARCH DESIGN: We developed a mechanism to integrate dispersed multi-source data and service the mashed information via human and machine interfaces in a secure, scalable manner. This process facilitates the exploration of subtle associations between variables, population strata, or clusters of data elements, which may be opaque to standard independent inspection of the individual sources. This a new platform includes a device agnostic tool (Dashboard webapp, http://socr.umich.edu/HTML5/Dashboard/) for graphical querying, navigating and exploring the multivariate associations in complex heterogeneous datasets.RESULTS: The paper illustrates this core functionality and serviceoriented infrastructure using healthcare data (e.g., US data from the 2010 Census, Demographic and Economic surveys, Bureau of Labor Statistics, and Center for Medicare Services) as well as Parkinson's Disease neuroimaging data. Both the back-end data archive and the front-end dashboard interfaces are continuously expanded to include additional data elements and new ways to customize the human and machine interactions.CONCLUSIONS: A client-side data import utility allows for easy and intuitive integration of user-supplied datasets. This completely open-science framework may be used for exploratory analytics, confirmatory analyses, meta-analyses, and education and training purposes in a wide variety of fields.

DOI10.1186/s40537-015-0018-z
Alternate JournalJ Big Data
PubMed ID26236573
PubMed Central IDPMC4520712
Grant ListP20 NR015331 / NR / NINR NIH HHS / United States
P30 DK089503 / DK / NIDDK NIH HHS / United States
P50 NS091856 / NS / NINDS NIH HHS / United States
U54 EB020406 / EB / NIBIB NIH HHS / United States