Background Periodontal disease is a prevalent chronic inflammatory condition associated with systemic health complications. Early identification of high-risk individuals is crucial for targeted preventive and therapeutic interventions. In this study, we aimed to develop and evaluate machine learning models using routine blood tests to predict periodontal disease risk. This will help to develop accessible and cost-effective screening tools for early detection in a non-dental setting.
Methods This study utilized data from the 2013-2014 National Health and Nutrition Examination Survey (NHANES), including full-mouth periodontal examinations, demographic variables, and routine blood tests (complete blood count [CBC], lipid profile, liver and kidney function tests). Periodontitis was defined as a binary outcome based on attachment loss and probing depth at interproximal sites. Individuals meeting any of these criteria were classified as having periodontitis. Seven machine learning models were developed and evaluated using precision, recall, and accuracy metrics.
Results The Random Forest Classifier achieved the highest performance, scoring 0.91 in precision, recall, and accuracy for predicting periodontitis. Key predictive features included smoking status, age, education level, gamma glutamyl transferase, albumin, blood urea nitrogen, glucose, phosphorus, creatinine, basophil count, and total calcium levels.
Conclusion This study underscores the promise of machine learning, especially the Random Forest Classifier, in predicting periodontal disease risk using routine blood tests and demographics. The model accurately identified periodontitis cases, with key features revealing the complex relationship between systemic health and periodontal disease. This suggests the potential for developing machine learning-based screening tools for early periodontal disease detection in non-dental settings.
Competing Interest StatementThe authors have declared no competing interest.
Funding StatementThis research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
https://www.cdc.gov/nchs/nhanes/index.htm
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
FootnotesAuthor Approval: All authors have seen and approved the manuscript.
Competing Interests Statement: The authors declare no competing interests.
Data Availability Statement: The data that supports the findings of this study are available from the corresponding author upon reasonable request.
Funding Statement: This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Data AvailabilityThe data that supports the findings of this study are available from the corresponding author upon reasonable request.
Comments (0)