Pattern recognition of big nutritional data in RCT

  • Authors:
  • Jin Wang;Hua Fang;Honggang Wang;Gin-Fei Olendzki;Chonggang Wang;Yunsheng Ma

  • Affiliations:
  • University of Massachusetts, Dartmouth, MA;University of Massachusetts Medical School, Worcester, MA;University of Massachusetts, Dartmouth, MA;University of Massachusetts Medical School, Worcester, MA;Interdigital, King of Prussia, PA;University of Massachusetts Medical School, Worcester, MA

  • Venue:
  • BodyNets '13 Proceedings of the 8th International Conference on Body Area Networks
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

As technology develops and research environment improves, large volume of data is collected for analyses. Unfortunately, these data are collected but not fully used or untouched. Particularly, such big data from health and medical studies pose significant challenges to the methodological field. This paper presents a new multi-clustering approach for pattern recognition of big data in a randomized controlled trial (RCT) with multi-validation criteria. Specifically, a nutritional dataset was used to demonstrate our approach, which was generated from an NIH-funded RCT for patients with metabolic syndrome. The proposed approach includes a suite of emerging and popular clustering methods: probability-based Gaussian Mixture Model (GMM), Hidden Markov Random Fields(HMRFs), Self-Organizing Map (SOM)-based neural networks, K-means and Agglomerative Hierarchical method. Using our RCT data and multi-validation criteria, our approach identified a most sufficient set of nutritional variables and detected distinct dietary change patterns with a universal agreement among the proposed multi-methods. The trajectory patterns were then generated using the method with the most clustering accuracy which was cross-validated via simulation. These patterns generated new and finer results for outcomes of the RCT. While our approach demonstrated a more accurate and comprehensive clustering only for nutritional data in RCT, it can be generalized to big data in other research fields.