Background: Continuous glucose monitoring (CGM) time-series are often analyzed, retrospectively, to investigate glucose variability (GV), a risk factor for the development of complications in type 1 diabetes (T1D). In the literature, several tens of different indices for GV quantification have been proposed, but many of them carry very similar information. The aim of this article is to select a relatively small subset of GV indices from a wider pool of metrics, to obtain a parsimonious but still comprehensive description of GV in T1D datasets.
Materials and methods: A pool of 25 GV indices was evaluated on two CGM time-series datasets of 17 and 16 T1D subjects, respectively, collected during the European Union Seventh Framework Programme project "Diadvisor" (2008-2012) in two different clinical research centers using the Dexcom(®) (San Diego, CA) SEVEN(®) Plus. After the indices were centered and scaled, the Sparse Principal Component Analysis (SPCA) technique was used to determine a reduced set of metrics that allows preserving a high percentage of the variance of the whole original set. In order to assess whether or not the selected subset of GV indices is dataset-dependent, the analysis was applied to both datasets, as well as to the one obtained by merging them.
Results: SPCA revealed that a subset of up to 10 different GV indices can be sufficient to preserve more than the 60% of the variance originally explained by all the 25 variables. It is remarkable that four of these GV indices (i.e., Index of Glycemic Control, percentage of Glycemic Risk Assessment Diabetes Equation score due to euglycemia, percentage Coefficient of Variation, and Low Blood Glucose Index) were selected for all the considered T1D datasets.
Conclusions: The SPCA methodology appears a suitable candidate to identify, among the large number of literature GV indices, subsets that allow obtaining a parsimonious, but still comprehensive, description of GV.