Dynamic Blood Sugar Prediction Based on LSTM Neural Networks

Click the blue WeChat name below the title to quickly follow

Abstract

Objective This study compares the prediction effects of unidimensional and multidimensional input models of Long Short-Term Memory (LSTM) neural networks and Back Propagation (BP) neural networks in the field of dynamic blood sugar.Method This study collected blood sugar values from 18 type 2 diabetes patients at Tianjin Medical University Zhu Xianyi Memorial Hospital from June 2021 to January 2022, along with exercise step counts and dietary caloric intake data from one patient. Two prediction models based on deep learning LSTM neural networks were established: a unidimensional input model using only blood sugar data and a multidimensional input model using blood sugar, step counts, and caloric intake data. The time gradients for predicting future blood sugar were set to 6, 12, and 24 hours, and the Root Mean Square Error (RMSE) between predicted and actual values was calculated to assess the differences between models. A unidimensional prediction model based on BP neural networks was also established for comparison with the LSTM neural network results.Results For predictions at 6, 12, and 24 hours, the RMSE of the unidimensional prediction model based on LSTM neural networks was 0.47, 0.55, and 0.61 respectively; the RMSE of the multidimensional prediction model was 0.31, 0.50, and 0.56 respectively; and the RMSE of the unidimensional prediction model based on BP neural networks was 0.38, 0.59, and 0.63 respectively. Both the unidimensional and multidimensional prediction models based on LSTM neural networks demonstrated high accuracy, with accuracy decreasing as the prediction time gradient increased. Comparison of the same patient confirmed that the multidimensional prediction model had higher accuracy; additionally, the accuracy of LSTM neural network results was higher than that of BP neural network results.Conclusion LSTM neural networks can serve as an effective means in the field of dynamic blood sugar prediction.

Si Jiarui¹Yang Yifei¹Diliyaer Abudukeremu²Li Jing³

¹Tianjin Medical University, School of Basic Medicine, Tianjin 300070;

²Tianjin Medical University, Second Clinical Medical College, Tianjin 300222;

³Tianjin Medical University Zhu Xianyi Memorial Hospital, Tianjin 300070

Corresponding author: Li Jing, Email: 2003⁃[email protected]

Diabetes is a common chronic disease primarily caused by insufficient insulin secretion or insulin utilization disorders, with typical clinical manifestations including polyuria, polydipsia, polyphagia, weight loss, and abnormal glucose tolerance. Diabetes is divided into type 1 and type 2, with type 2 being more common. The etiology of type 2 diabetes is complex, resulting from various factors such as genetic, environmental, age, racial, and personal lifestyle factors leading to insufficient insulin secretion or insulin resistance. Blood sugar indicators are crucial for controlling the condition of diabetes. Patients with diabetes experience insulin secretion or utilization disorders, and prolonged high blood sugar states can easily lead to disturbances in carbohydrate, protein, and fat metabolism^[1]. If blood sugar is not controlled in a timely manner, it can lead to severe complications affecting multiple organs such as the heart, kidneys, and eyes^[2], posing a serious threat to the health of patients. To avoid the risks of hypoglycemia, hyperglycemia, and subsequent complications, type 2 diabetes patients need to implement strict self-management (such as diet, exercise, sleep, etc.) to control blood sugar levels. Studies have shown that blood sugar fluctuations in diabetes patients are closely related to their diet and exercise.

In recent years, deep learning technologies have played an increasingly important role in the field of diabetes blood sugar prediction. Recurrent Neural Networks (RNNs) are a type of neural network that takes sequential data as input, recursively processes it in the direction of sequence evolution, with all nodes connected in a chain^[3]. Long Short-Term Memory (LSTM) neural networks are a specialized type of RNN that perform better when handling medical data with time series characteristics. However, current research on using LSTM neural networks for blood sugar prediction in diabetes patients remains at a relatively basic stage. Peng Xiuli et al.^[4] compared the low blood sugar warning capabilities of LSTM networks with Gated Recurrent Units (GRUs) for type 1 and type 2 diabetes patients, but the warning time scale was too short. Martínez-Delgado et al.^[5] used RNN models based on carbohydrate and insulin absorption curves to predict impending blood sugar levels in type 1 diabetes patients but did not consider factors such as exercise and had a small sample size. Rabby et al.^[6] proposed a new method for predicting blood sugar levels using a stacked LSTM-based deep recurrent neural network model that accounts for sensor failures, corrected using Kalman smoothing techniques, but there remained significant differences between predicted and actual values.

This study utilizes LSTM neural networks in deep learning, combined with patients’ multidimensional lifestyle data, to predict blood sugar for the next 6, 12, and 24 hours, allowing for early intervention before adverse blood sugar events occur, thereby effectively helping patients maintain blood sugar levels and prevent various complications of diabetes.

Materials and Methods

1. Experimental Data Collection

From June 2021 to January 2022, 18 type 2 diabetes patients who met the following criteria were selected at Tianjin Medical University Zhu Xianyi Memorial Hospital: aged 40-55 years, with stable prior blood sugar, good self-care abilities, no severe complications, and no need for medication. A total of 18 patients were included, and they participated in the experiment after informed consent. One patient collected continuous exercise data for 14 days using a fitness tracker and calculated caloric intake for each meal based on photos of daily meals using the Mint Health APP (Shanghai Mint Health Technology Co., Ltd., version 11.6.3). Continuous blood sugar data were collected using the Abbott Freestyle Libre glucose monitoring system (model: ART33503-001 Rev.D 08/16). Due to the need for a 1-day adaptation period for patients, the data used for model establishment excluded the first day. Among them, 17 patients’ data consisted solely of blood sugar values monitored every 15 minutes, resulting in 96 data points per day. The data from one patient was more complete, including both blood sugar values and exercise and dietary intake data. This study was approved by the Ethics Committee of Tianjin Medical University, with research number TmuhMEC2021006.

2. Statistical Methods

Root Mean Square Error (RMSE) is the square root of the squared deviation between predicted and actual values divided by the number of observations, measuring the deviation between predicted and actual values, and is sensitive to outliers in the data.

Compared to the standard deviation (SD), which measures the dispersion of a dataset, RMSE measures the deviation between predicted and actual values; their research subjects and purposes differ, but the calculation processes are similar.

In this study, RMSE was used to compare the predictive effects of LSTM neural network models and BP neural network models, and to compare the accuracy of multidimensional and unidimensional inputs in the LSTM neural network prediction model.

3. Model Establishment

(1) LSTM Neural Network Unidimensional Input Prediction Model

(1) Since this model only used blood sugar data, both the input and output data dimensions were set to 1. (2) The LSTM layer was set to have 96 hidden units. This study compared results with 48, 96, 192, and 288 hidden units and found that the prediction results were similar, but fewer hidden units required less time. Since 96 corresponds to the amount of blood sugar data for one day, it was selected as the final value. (3) The solver was set to Adam, and training was conducted for 100 epochs. The Adam solver combines the advantages of the AdaGrad and RMSProp optimization algorithms, adjusting the learning rate based on the average of recent weight gradients while also considering accumulated squared gradients, improving performance on sparse gradient problems. Setting the training epochs to 100 also allowed for the selection of the best results through different training epochs. (4) To prevent gradient explosion, the gradient threshold was set to 1. The initial learning rate was set to 0.005, and after 50 epochs, the learning rate was reduced by multiplying by a factor of 0.2 to ensure model convergence. (5) The model prediction calculations were performed using a Graphics Processing Unit (GPU) since prediction calculations on large datasets, long sequences, or large networks are typically faster on GPUs than on Central Processing Units (CPUs). The GPU configuration for this experiment was NVIDIA GeForce GTX 1050.

(2) LSTM Neural Network Multidimensional Input Prediction Model

(1) Since blood sugar, step counts, and caloric intake were used as inputs, with blood sugar as the output, the input data dimension was set to 3 and the output data dimension to 1. (2) The hidden neurons in the LSTM layer were set to 96. (3) The solver was set to Adam, and training was conducted for 100 epochs. (4) The gradient threshold was set to 1, the initial learning rate to 0.005, and after 50 epochs, the learning rate was reduced by multiplying by a factor of 0.2. (5) The model prediction environment was selected as GPU.

(3) BP Neural Network Unidimensional Input Prediction Model

(1) The number of hidden neurons was set to 5, determined by comparing gradients of 100, 50, 25, 10, and 5. When the number of hidden neurons was too large, overfitting occurred, leading to low prediction accuracy, so these cases were discarded. (2) The maximum training epochs were also set to 100. (3) The initial learning rate was set to 0.005.

4. Research Methods

(1) Download the FreeStyle Libre software (Abbott Diabetes Care, UK, version 1.0) to access blood sugar data collected from type 2 diabetes patients by the Abbott Freestyle Libre glucose monitoring system, totaling 18 samples. (2) Export the per-minute exercise data collected by the fitness tracker. (3) Use the Mint Health APP to calculate the calories obtained from food for each meal. (4) Divide the data into two groups: the group of 17 patients with only blood sugar data, establishing a unidimensional prediction model with blood sugar data as input for the LSTM neural network. Due to different wearing days, the amount of data collected varied among different patients. For each patient, the data from the last day before the last day was used as the training set to learn the blood sugar change patterns and establish the LSTM prediction model; the last day’s data served as the test set for dynamic blood sugar prediction over different time gradients (6, 12, 24 hours). The training and test sets must first undergo standardization to achieve better fitting and prevent training divergence. This study employed Z-Score standardization, which is suitable for data with unknown maximum and minimum values and can convert different data to the same scale. (5) Compare the predicted values with the actual blood sugar values of the patients, using RMSE to evaluate the model. (6) For the other patient with more complete information, establish a three-dimensional input LSTM neural network prediction model using blood sugar, exercise, and diet data. Similarly, use the last day’s data as the training set and the last day’s data as the test set for dynamic blood sugar prediction over different time gradients (6, 12, 24 hours) and evaluate the model using RMSE. (7) Based on the BP neural network, establish a unidimensional prediction model for blood sugar and evaluate the model using RMSE. (8) Compare the predictive effects of LSTM neural network prediction models and BP neural network prediction models, and compare the accuracy of multidimensional and unidimensional inputs in the LSTM neural network prediction model.

Results

1. LSTM Neural Network Unidimensional Input Prediction Model

By effectively screening the blood sugar data of 17 patients in the unidimensional input prediction model group, removing some patient data that did not match reality or had local omissions, the actual data selected was from 12 patients. For each patient, the data from the last day before the last day was used as the training set, and the last day’s data as the test set. Taking one patient as an example, the comparison of the predicted values and actual values for the next 24 hours using the LSTM neural network can be seen in Figure 1A and Figure 1B, with local comparisons shown in Figure 1C.