Deferring a person from donating due to low haemoglobin (Hb) can be demotivating for the donor, incurs extra costs to blood establishment and may indicate that a donor has donated blood too frequently causing negative health effects such as anaemia [1]. To mitigate these negative effects, it is beneficial to be able to predict the donor's Hb value at a given date or directly predict whether the Hb will be below the deferral limit.
Previously, Baart et al. used logistic regression with non-linear predictors to predict low Hb deferrals [2, 3]. Subsequently, Nasserinejad [4] and Fokkinga [5] used Bayesian linear mixed models (LMMs) to predict Hb.
In this study, we aim to develop prediction methods for Hb/deferral to improve donor health and reduce costs due to deferrals without damaging the blood supply. We essentially reimplement the methods of Nasserinejad and Fokkinga and run them on our larger datasets with additional variables. To our knowledge, we are the first to use genetic information as explanatory variables and to estimate the blood supply and economic effects of deploying a low-Hb deferral model. We also publish our model implementations to make it easier to build future research on our results.
MATERIALS AND METHODSThe blood donation and blood product information of the Finnish Red Cross Blood Service (FRCBS) until 2020 was collected in the eProgesa database (MAKSYSTEM, Paris, France). Here, the eProgesa dataset contains the donation histories of Finnish blood donors from the last 20 years: 6,414,193 donation attempts from 940,831 donors. These data are collected at every blood donation event, and they contain information about the Hb value (pre-donation point-of-care capillary finger-prick sample) [6], time of day, donation location, type of donation and amount of blood collected.
We pre-processed the raw eProgesa data (Figure 1) to obtain a clean dataset for building models. Outliers, missing values and other problematic cases were handled by dropping instead of imputing them (Figure S2). We also derived several new variables from the raw variables (Figure 1c). After pre-processing, we were left with 2,157,733 donations and 449,008 donors .
The Biobank dataset contains genome-wide SNP genotyping data obtained from the Blood Service Biobank and height, weight and smoking variables from the Biobank enrolment questionnaire of 20,222 donors. The FinDonor [7] dataset contains more information about donation events such as blood counts, iron indices and questionnaire data. This dataset is much smaller than the eProgesa data, having a total of 7994 donation events from 2580 donors.
The variables from the eProgesa, Biobank and FinDonor datasets used for training our models are described in Tables S1–S3, respectively. Later in this paper, we refer to the combinations of eProgesa with the Biobank and FinDonor datasets with just Biobank and FinDonor, respectively. Further discussion about the variables used and pre-processing can be found in Section S1.
As the donation history is a longitudinal dataset, we can apply LMMs (where some parameters can be stochastic instead of being fixed, as in normal linear models) to predict Hb. Our model has the form yit = xit β + ci ϕ + bi + εit, where i refers to a donor and t to a donation time. The donation and donor-specific variables are stored in matrices xit and ci, respectively. The donor-specific intercept bi is the only random effect, and it allows deviation between donors caused by unobserved variables. If the previous Hb is among the predictors, then the model is called a dynamic linear mixed model (DLMM). Stan [8] is used to train these models in a Bayesian setting with weakly informative conjugate priors. To estimate the linear models' capability to predict deferral, the predicted Hb is dichotomized with the deferral limits used in Finland (135 g/L for men and 125 g/L for women).
To test whether the dependence of Hb is non-linear with respect to the predictors, we use a random forest (RF) model [9]. Because deferrals are rare in Finnish donation history (approximately 3.2% of donations), in the RF algorithm, we oversample the deferrals so that the trees are created from samples where 50% of the donors have deferral as their last donation to make it easier to train a classifier for deferral. As an RF cannot directly model time series, we add to each donation event information about the previous Hb and the number of lifetime donations. We use randomForest [10] to train an RF whose hyperparameters were optimized with caret [11] using four-fold cross-validation. Details about the linear and RF models and their implementations can be found in Section S2.
We measured the accuracy of Hb prediction with root mean square error (RMSE) and mean absolute error (MAE) and the performance of the binary classifier of deferral with area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPR) and F-score (F1) metrics. More details of the performance measures used can be found in Section S2.4.
Personalized donation intervals can be applied either by estimating a truly personal donation interval for each donor or by creating pre-determined donation interval categories and assigning donors to them. In either case, the total adjustment atot in the population of returning donors is given by the mean of adjustments ai. If we extend the donation interval of donor i by 10%, for example, then ai = 1.1. This adjustment has a direct inverse effect on the flow of returning donors, which we find by subtracting the influx of new donors from the total influx of donors. Thus, the total influx after adjustments is given byTo determine, which subset of full data are best suited as the input for fitting LMMs, we performed three experiments: effect of time series length, effect of amount of data and effect of the imbalance of the division of the donations into accepted and deferred classes on Hb prediction. In addition, to determine whether the rules for selecting the input subset generalize, we divided the data into two equal-sized halves: an exploration part and a final model fitting and testing part. The three experiments described below were all performed on the exploration part of the data.
The distribution of time-series length of female donors in eProgesa data is shown in Figure S8. The number of donors decreases exponentially as a function of the time-series length. To model the data and predict the last donation of each time series, the minimum theoretical time series length is three. This requirement already dropped 50% of the donors from further consideration. To test the effect of the time-series length on Hb prediction, we partitioned the female eProgesa data into subsets based on time-series length, with each subset having donors with the same number of donations. We fitted a DLMM on each of these datasets and predicted the Hb of the last donation of the time series. The results are shown in Table S6. On the one hand, the results seemed to improve as the time-series length increased. On the other hand, the data were scarcer with longer time series. As a compromise, we decided to use the data from donors with at least seven donations in our later analyses of eProgesa and Biobank data.
We also experimented with the effect of the amount of data on the prediction. We randomly took three samples from female eProgesa data with 10,000, 30,000 and 50,000 donors. The amount of data did not show any clear effect on the prediction results (see Table S7).
Next, we considered the imbalance between accepted and deferred donation classes. In the pre-processed eProgesa data, only 12% of the donors had at least one deferral and the number of donors with more deferrals decreased rapidly, as shown in Figure S9. Since we want to be able to accurately predict Hb values that are below the accepted threshold, it is vital that there are enough examples of low Hb donations in the training data. We tried to artificially enrich the fraction of deferrals by dropping out donors with no deferrals. This resulted in a subset of female eProgesa data with the fraction of donors with at least one deferral being 50%. We fitted a DLMM on these data, and the prediction results are shown in Table S8. The results worsened after the enrichment.
As a result of the above exploration, we decided not to enrich the training data used for fitting LMMs. In addition, we only included donors who had donated at least seven times. The resulting dataset was already small enough, so fitting a model on these data was feasible in terms of time and memory required. Hence, no further subsetting of the data was needed. The resulting final testing data had 695,658 donations (398,803 female and 296,855 male) from 47,820 donors (29,298 female and 18,522 male). This dataset was used in the final analyses of the eProgesa data and the Biobank data. A different scheme, explained in Section S3.2.4, was used for FinDonor data, as that data had a very small number of donors and donations.
Figure 2 illustrates the effect sizes of variables of Biobank data in DLMM and the importance of eProgesa variables in RF. Other models gave similar results (Figures S10, S11, S20, S26, S31 and S34). There were no large differences in effect sizes between men and women except for the age-related variables. In both models, previous Hb was clearly the most important variable. The SNP rs199598395 on gene RNF43 had a large influence, but a polygenic score of Hb calculated from UK Biobank data had a smaller effect size than whether the donation was given in April–September.
The effect of the “days to previous full blood donation” was so small that varying it and other time-dependent variables accordingly did not affect deferral prediction enough to enable fully personalized donation intervals. Hence, we analysed the effect of donation activity by demographic group on the low Hb deferral rate for all donations in Finland (Figure S6) and the effect of donation on the iron deficiency rate in FinDonor data (Figure S7). Although no clear association existed between deferral rate and donation activity, a fixed deferral of 12 months is likely to reduce deferrals in the most vulnerable group, that is, women younger than 30 (Figure S6). For ferritin levels of the same group, even a 6-month deferral would decrease the number of yearly donations by up to two or three and hence significantly decrease the prevalence of low iron (Figure S7). In general, a 6-month interval, possibly with supplemental iron, has previously been shown to allow ferritin recuperation for most donors [12, 13].
To estimate the economic effects of deferral prediction, we subsequently used these two alternative donation intervals in case a model predicted a donor to be deferred. When we temporarily extended the donation intervals of all true and predicted deferrals to either 6 or 12 months, we obtained a rough but concrete estimate of the cost effect of the model performance.
The deferral prediction results and economic effects are summarized in Figure 3 and Table S14. According to the AUROC metric, RF performed better than the other models except for the male DLMM with Biobank data. However, there were no large differences in the performance of the LMM and RF, and each was only slightly better than our baseline model, that is, logistic regression with previous Hb as the only predictor.
In addition, all female models and the RF model resulted in cost reductions when a 6-month deferral was applied for those predicted to be deferred (Figure 3). For RF, the economic effect was −0.15 euro per donation, that is, an economic savings of 0.15 euro per donation, and the average interval extension was 1.1. Deployment of this model would result in avoiding 51% of the deferrals. In the models that were trained by stratifying by sex, the average cost effect for males was 0.02 euro per donation, whereas, for women, the effect was −0.11 euro per donation. If both male and female models were applied, then the DLMM with FinDonor data gave the second-largest savings at approximately 0.1 euro per donation, with the average donation interval length being 1.12-fold. This model enabled us to avoid 51% of deferrals.
As the baseline model predicts based on previous Hb only, the probability thresholds that were found to provide the largest savings correspond to specific Hb values. For 6-month deferral, these were 147 g/L for men and 135 g/L for women; and for 12-month deferral, 141 g/L for men and 122 g/L for women.
DISCUSSIONOur reimplementation of the LMMs gives equal or slightly weaker results in terms of MAE and RMSE (Table S13) but better results in terms of AUROC than in Fokkinga [5]. This is probably due to larger data and more variables, but these approaches still fail to predict lower Hb values. For example, the female DLMM on Biobank data predicts for all but one donation where Hb is below 125 g/L higher than 125 g/L Hb. However, if the deferral threshold that gives optimal economic effects is used instead, then we can avoid 49% of the deferrals while falsely predicting as deferred only 18% of the viable donations (Figure 4). We expect the prediction results to be more accurate in countries where the deferral rate is higher than in Finland since the ratio of accepted and deferred donations is more balanced. Although the incorporation of genetic information as predictors improves the prediction (Figure 3, Table S14), the effect appears small in relation to the costs of genotyping. The SNP rs199598395 in the RNF43 gene was discovered by the FinnGen project as a lead SNP for iron deficiency anaemia (http://r4.finngen.fi/pheno/D3_ANAEMIA_IRONDEF). Its effect size is large, but the minor allele is only present in ~2% of donors. Overall, in Finns, it is present in ~1% of people, in Europeans (non-Finnish) ~0.01% and it is not found in other populations [14]. This highlights the possibility that further study of population-specific or rare genetic variation could considerably increase the value of genetic predictors.
Our RF model performs similarly to logistic regression with non-linear predictors [3] in predicting deferral but is simpler and easier to train. There is no apparent performance difference between the LMMs and RF in predicting deferral. Importantly, these complicated models seem to have little benefit over a simple one-predictor logistic regression (baseline model).
Due to the low accuracy in Hb prediction and the fact that the effect of the “days to previous full blood donation” variable is small, we were unable to define completely personalized donation intervals. However, our calculations on the blood supply and economic effects indicate that cost reduction is still possible through a fixed deferral (6 months) given to donors (especially female donors) predicted to be deferred. To our knowledge, this is the first report that estimates the blood supply and economic effects of deploying a deferral prediction model. However, our calculations are based on two assumptions: (1) that every euro spent on marketing will result in a proportional number of new donors coming in and (2) that the Hb values recover as a function of time. Although assumption (1) is certainly not universally valid, we believe that it is very likely to be valid for the small adjustments we make here.
In conclusion, our results suggest that pre-donation Hb data could be used much more efficiently to bring savings and health benefits. Furthermore, savings to donors will result in saved time and travel expenses [15], although we did not include them in our estimation. If the pre-donation Hb value is found to be below the threshold for economic effects but above the deferral limit, then the donor can donate but is deferred, for example, for 6 months. We do not find that the more complicated computational predictors could greatly improve on this. However, more predictive data such as ferritin measurements at every donation, more informative genetic data, or iron consumption and menstruation data could bring significant improvements. We have started evaluating the deployment of the threshold-based system at the FRCBS. This includes assessing the effect of varying the cost parameters, risk analysis and possible testing of the procedure at a single donation site.
The source code of the model implementations is available at GitHub (see Supporting Information for details) and Zenodo [16]. Furthermore, a ready-to-use prediction application as a Docker [17] software container is also provided. Its user interface, which runs in a web browser, facilitates easy use for non-programmers (see Figure 5). These resources allow others to test our models with their data and develop them further; see Section S2.3 for more details.
Maike Sweegers and Katja van den Hurk gave feedback for container input data. We also thank Jukka Partanen and Johanna Castrén for their comments on the manuscript. J.T. performed the research and wrote the first draft of the manuscript; Y.K. developed the linear models and performed early analysis; E.T. developed the economic analysis; F.P. helped in developing and testing the prediction methods; P.B.P. computed the polygenic scores for anaemia; M.H. participated in the development and analysis of the machine learning methods; M.A. implemented the RF method, supervised the research and reviewed and edited the manuscript.
CONFLICT OF INTERESTThere are no conflicts identified.
Filename Description vox13223-sup-0001-Supinfo.pdfPDF document, 6.5 MBAppendix S1 Supporting Information
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
REFERENCES
1Custer B, Chinn A, Hirschler NV, Busch MP, Murphy EL. The consequences of temporary deferral on future whole blood donation. Transfusion. 2007; 47: 1514– 23. 2Baart AM, De Kort WLAM, Moons KGM, Vergouwe Y. Prediction of low haemoglobin levels in whole blood donors. Vox Sang. 2011; 100: 204– 11. 3Baart AM, De Kort WLAM, Atsma F, Moons KGM, Vergouwe Y. Development and validation of a prediction model for low hemoglobin deferral in a large cohort of whole blood donors. Transfusion. 2012; 52: 2559– 69. 4Nasserinejad K. Modeling longitudinal data of blood donors. PhD dissertation. Erasmus University Rotterdam. 2016. 5Fokkinga J. Modelling hemoglobin levels of blood donors. Master's thesis. Erasmus University Rotterdam. 2018. 6Bäckman S, Larjo A, Soikkeli J, Castrén J, Ihalainen J, Syrjälä M. Season and time of day affect capillary blood hemoglobin level and low hemoglobin deferral in blood donors: analysis in a national blood bank. Transfusion. 2016; 56: 1287– 94. 7Lobier M, Niittymäki P, Nikiforow N, Palokangas E, Larjo A, Mattila P, et al. FinDonor 10 000 study: a cohort to identify iron depletion and factors affecting it in Finnish blood donors. Vox Sang. 2020; 1: 36– 46. 8Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: a probabilistic programming language. J Stat Softw. 2017; 76: 1– 32. 9Breiman L. Random forests. Mach Learn. 2001; 45: 5– 32. 10Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002; 2: 18– 22. 11Kuhn M caret: classification and Regression Training [Internet]. 2020. [Cited 2021 Oct 5]. Available from: https://cran.r-project.org/package=caret 12Kiss JE, Brambilla D, Glynn SA, Mast AE, Spencer BR, Stone M, et al. Oral iron supplementation after blood donation: a randomized clinical trial. JAMA. 2015; 313: 575– 83. 13Schotten N, Pasker-de Jong PCM, Moretti D, Zimmermann MB, Geurts-Moespot AJ, Swinkels DW, et al. The donation interval of 56 days requires extension to 180 days for whole blood donors to recover from changes in iron metabolism. Blood. 2016; 128: 2185– 8. 14Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020; 581: 434– 43. 15de Kort W, van den Burg P, Geerligs H, Pasker-de Jong P, Marijt-van der Kreek T. Cost-effectiveness of questionnaires in preventing transfusion-transmitted infections. Transfusion. 2014; 54: 879– 88. 16Toivonen J, Koski Y, Arvas M. Software for the article Prediction and impact of personalised donation intervals. 2021. https://doi.org/10.5281/zenodo.5549879#.YVw31tABAhp.mendeley 17 Docker Inc. Docker 2020 [Internet]. [cited 2021 Oct 5]. Available from: https://www.docker.com/
Comments (0)