Application and Prospects of Artificial Intelligence Technology in Early Screening of Chronic Obstructive Pulmonary Disease at Primary Healthcare Institutions in China

Introduction

Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous lung disease that causes persistent and progressive airflow obstruction, and it has become one of the major threats to global health. Chronic respiratory symptoms caused by COPD, such as difficulty breathing, coughing, and expectoration, not only severely affect the quality of life of patients but also pose an urgent public health issue due to its high mortality rate and economic burden.1 It is estimated that approximately 6% of deaths worldwide are attributed to COPD,2 and the prevalence and mortality rates in China are particularly significant. With the aging population and continuous exposure to risk factors of COPD, the prevalence of COPD in Chinese adults over 40 years old has sharply risen to 13.7%3 in recent years, affecting nearly 100 million adults.4

COPD is a preventable and treatable disease, and early diagnosis is crucial for controlling disease progression and improving prognosis. Screening and early intervention in high-risk groups can delay disease progression and improve outcomes. However, the current awareness and diagnosis rates of COPD remain low. The survey results of the “China Adult Lung Health Study” show that only 10% of the study subjects were aware of the disease name COPD, and less than 10% had undergone lung function tests.4 Primary healthcare institutions play an important role in providing basic healthcare to residents, particularly in the discovery and early diagnosis of COPD among high-risk groups.5 With the development and application of Artificial Intelligence (AI) technology, these institutions are expected to enhance the efficiency and accuracy of COPD screening.

The application of artificial intelligence in medicine, especially within the field of respiratory systems, is gradually becoming a focal point. In this domain, particularly in the diagnosis and management of chronic obstructive pulmonary disease (COPD), the development of digital health technologies and big data science has brought unprecedented opportunities to the medical domain. Artificial intelligence has not only demonstrated high accuracy in interpreting pulmonary function test results,6 but also shown immense potential in medical imaging.7 Radiomics techniques extract a plethora of features from medical images using machine learning methods, providing new avenues for the early diagnosis and risk assessment of COPD. These techniques are capable of detecting early-stage patients that are difficult for the human eye to identify. Furthermore, deep learning techniques, particularly deep convolutional neural networks (CNNs), have exhibited outstanding performance in automatically interpreting lung CT images and diagnosing obstructive lung diseases, offering powerful tools for the diagnosis and management of COPD.8 Additionally, with the continuous evolution of digital health technologies and the development of big data science, an increasing number of studies are focusing on leveraging machine learning and radiomics techniques, combined with large-scale clinical data, to improve early screening, diagnosis, and treatment plans for COPD. The aim is to achieve personalized medicine and enhance patient quality of life.9 In summary, the development of digital health technologies and big data science brings new hope for the diagnosis and management of COPD, while the application of artificial intelligence provides strong support for achieving personalized medicine and improving medical efficiency. Exploring the application of AI in enhancing disease screening and early diagnosis capabilities in primary healthcare institutions is of key significance in addressing this major public health challenge.

Methods

A comprehensive search of Chinese and English databases was conducted, including the CNKI and Wanfang databases within China, as well as the Embase and PubMed databases, to retrieve relevant data. Additionally, potential studies meeting the criteria were identified through manual searches. The research scope was limited to Chinese and English literature published between January 1, 2019, and January 1, 2024. Keywords used included both Chinese and English terms such as Chronic Obstructive Pulmonary Disease (COPD), Early Screening, COPD Screening, Artificial Intelligence, Primary Healthcare Institutions, Healthcare Data Analysis, and Healthcare in China. Following database searches, articles were manually screened to select those that met the criteria for inclusion in this study. Specifically, 12 articles were selected from the CNKI database, 4 articles from the Wanfang database, 19 articles from the English PubMed database, and 15 articles from the Embase database.

Target Population for Screening Strategies

The high-risk group for COPD includes individuals with: a long history of smoking (over 20 pack-years); recurrent chest infections; early life events, such as frequent respiratory infections in earlier years; increasing age, especially those over 40; and household air pollution. In low and middle-income countries, nearly 3 billion people use biomass and coal as the main sources of energy for cooking, heating, and other household needs, thus posing a substantial global risk to a large population.10

Current COPD Screening Methods and Tools Symptom Assessment and Physical Examination

This involves taking a medical history, focusing on chronic cough, expectoration, difficulty breathing, chest tightness, and other symptoms. Additionally, assessing the patient’s quality of life is important to understand the impact of symptoms on daily activities.10 This assessment can predict the occurrence and progression of COPD. Some primary healthcare institutions can screen high-risk groups for COPD based on symptom and quality of life assessments.11 Physical examination is crucial in the care of COPD patients, but in terms of diagnosis, its sensitivity and specificity are relatively low due to the fact that signs of airflow obstruction typically do not appear until lung function is significantly impaired.1

Pulmonary Function Tests

These include spirometry and peak flow meter tests. Spirometry is the gold standard for diagnosing COPD.1 The test measures the volume of air expelled by the patient in a certain amount of time, as well as the total volume of air that can be exhaled. Spirometry can assess airflow limitation, characterized by a reduced ratio of forced expiratory volume in one second (FEV1) to forced vital capacity (FVC).1

PEF Measurement

Peak Expiratory Flow (PEF) measurement assesses the speed of forced exhalation, representing the peak flow rate of air during exhalation. It is used as a screening tool to identify airflow limitation. Peak flow meter tests measure the maximum speed of airflow expelled from the lungs, providing quick information about changes in lung function, but they are less diagnostic compared to spirometry.12

Imaging Studies

Chest X-rays, while not decisive for the diagnosis of COPD, can help rule out other lung diseases such as tuberculosis or lung cancer. High-Resolution Computed Tomography (HRCT) is used in some cases for more detailed assessment of lung structure, especially in patients with atypical X-ray findings.13

Biomarkers and Blood Tests

Although there are no specific biomarkers for the screening of COPD at present, certain blood tests (such as blood gas analysis, C-reactive protein) can provide information about the patient’s inflammatory status and oxygenation.14

Questionnaire Surveys

Questionnaires are an economical and convenient method for early screening of COPD. The GOLD guidelines recommend that primary healthcare institutions use questionnaires for active case finding. Questionnaire surveys are significant for primary healthcare institutions in identifying high-risk groups for COPD. Various COPD screening questionnaires have been clinically developed, such as the CAPTURE questionnaire,15 Chronic Obstructive Pulmonary Disease Screening Questionnaire (COPD-SQ),16 etc. The COPD-SQ was developed by a research team using data collected in 2002 from 19,800 Chinese subjects aged 40, including 7 items: age, smoking pack-years, body mass index, cough, difficulty breathing, family history of respiratory diseases, and exposure to biomass smoke from cooking. The scale has a COPD diagnosis sensitivity of 60.6%, specificity of 85.2%, and an accuracy rate of 82.7%.17 The widely used screening questionnaires for COPD are often based on studies conducted on populations from other countries, and whether they are applicable to China remains to be further explored. In 2023, a small-scale COPD screening questionnaire was applied in community-based primary healthcare institutions in China. The results showed that compared to diagnostic pulmonary function tests, the COPD-SQ scale demonstrated a specificity of 59.69%, accuracy (approximate index of 0.246), and an area under the curve of 0.744 (95% CI 0.650, 0.837).18

Application of AI in COPD Screening Diagnostic Assistance

AI-Assisted Pulmonary Function Diagnosis: Pulmonary function tests play a crucial role in diagnosing COPD. Lung parameters measured by these tests, such as Forced Vital Capacity (FVC) and Forced Expiratory Volume in One Second (FEV1), are the gold standards for confirming COPD. Besides, pulmonary function is also used for staging COPD patients, differentiating other respiratory diseases with similar symptoms like asthma in early diagnosis, and monitoring disease progression in long-term management. Regular pulmonary function tests help in monitoring changes in the condition, assessing future health risks, evaluating treatment effectiveness, and timely adjusting treatment plans. Therefore, standardized and accurate pulmonary function test results are vital for comprehensive COPD management. Artificial Intelligence can assist in interpreting results and diagnosing pulmonary function tests.19 Marko Topalovic and others compared the accuracy of AI software and pulmonologists in interpreting pulmonary functions. The results showed that AI software had a match rate of 100% in interpreting test results and a diagnostic accuracy rate of 82%. In contrast, pulmonologists had a match rate of 74.4% and a diagnostic accuracy rate of 44.6%. This study highlights the potential of AI in improving the accuracy of pulmonary function interpretation and its possibility as a decision-support tool in clinical practice.20

AI Technology in Recognizing Imaging for Early Diagnosis: AI technology has been used to analyze lung CT scans and X-ray images to identify early images of COPD. Through deep learning algorithms, it can accurately pinpoint changes in lung structure, assisting doctors in early and precise detection of COPD.21 The process of imaging omics primarily involves importing imaging data, utilizing software such as ITK-SNAP and 3D Slicer to delineate regions of interest, extracting imaging omics features, and establishing machine learning models based on the selected imaging omics features, such as logistic regression (LR) model, support vector machine (SVM) model, K-nearest neighbor (KNN) algorithm, etc. Finally, the performance of the machine learning model is evaluated.9 For instance, in 2022, Li Z and others developed a Graph Convolutional Network (GCN) for early detection of COPD. This method utilized chest computed tomography image data from the publicly available Danish Lung Cancer Screening Trial database. The GCN model achieved an accuracy of 0.77 and an area under the curve of 0.81.22

AI Technology in Integrating and Analyzing Data for Early Diagnosis

AI can efficiently process and analyze large volumes of electronic health records, mine data, and extract key information to help identify risk factors for COPD and predict high-risk groups, such as those with a history of smoking or occupational exposure. By analyzing patients’ past medical records and vital signs, AI can build predictive models to forecast the development trends of COPD, further assisting doctors in making better personalized screening decisions.23 For example, Clinical Decision Support Systems (CDSS) are defined as electronic systems that directly assist in clinical decision-making, which can help generate specific evaluations or recommendations for patients and present them to clinical doctors for reference. Although the World Health Organization (WHO) has identified the development of health information systems and digital technologies (including CDSS) as one of the priorities for strengthening primary healthcare, the application of CDSS for evaluating patients with respiratory distress in primary healthcare and outpatient services in China has not yet become widespread.24 Another study published in “CHEST” explored the use of AI as a predictive tool to identify high-risk COPD patients.25 Certain AI systems use sensor data collected from wearable devices to assist in diagnosing COPD by analyzing changes in breathing patterns, such as frequency, depth, and rhythm.26

Challenges and Limitations of AI in COPD Screening

While Artificial Intelligence (AI) offers many potential advantages in the screening of Chronic Obstructive Pulmonary Disease (COPD), there are also significant challenges and limitations.

Data Privacy: In terms of protecting sensitive information, safeguarding personal privacy is crucial when handling patients’ medical records and diagnostic data. It is essential to ensure that data collection and processing comply with relevant privacy laws and standards, such as the European Union’s General Data Protection Regulation (GDPR).27 Regarding data security, strong safety measures must be implemented to prevent data breaches and unauthorized access. This includes data encryption, secure data storage and transmission, and strict control over data access. In China, the privacy of medical information data is covered by the Personal Information Protection Law28 implemented in 2021 and the Data Security Law,29 both of which involve the secure processing of data, including medical data. These laws regulate data processing activities to ensure data security and prevent data leakage. Furthermore, the “National Health and Medical Big Data Standards, Security, and Service Management Measures (Trial)”,30 specifically aimed at data security management in the healthcare industry, covers regulations on the collection, storage, transmission, sharing, and destruction of medical data. These laws and regulations constitute the legal framework for the protection of medical information data privacy in China, requiring medical institutions and related companies to strictly adhere to the relevant provisions of privacy protection and data security. As technology develops and the application of data expands, these regulations may be continuously updated and improved to meet new challenges and needs.

Algorithm Accuracy: Data Quality and Representativeness: The accuracy of AI models highly depends on the quality and representativeness of the data used for training. In a study on the diagnostic value of artificial intelligence assisted diagnosis of COPD in China, the sensitivity of the artificial intelligence robot questionnaire for screening COPD was 76.11%, the specificity was 84.76%, the Jordan index was 60.87%, and the area under the ROC curve was 0.858.9 Based on CT imaging, COPD recognition was performed using a multi instance learning logic classifier on a dataset of 100 COPD patients and 100 healthy subjects, resulting in an AUC of 0.742.31 If the data is biased or incomplete, it can lead to decreased performance of the model in real-world applications.32 Different populations (such as different ethnicities, age groups, or geographical locations) may exhibit different phenotypic variations of COPD.33 AI models need to be able to adapt to these differences. For example, in a study, six machine learning classifiers, LR, SVM, KNN, RandomForest, ExtraTrees, and XGBoost, were trained to construct an imaging omics model. The performance of the imaging omics models constructed based on these six machine learning models varies, with XGBoost having the best diagnostic value for COPD. However, the sample size used for the establishment of imaging omics models in China is relatively small, and further expansion of the sample size is needed for validation.9 Moreover, most studies are single center studies, and external independent tests need to be conducted in multiple centers.A 2022 study suggested that creating a specific extension for quality assessment tools for AI research in diagnostic accuracy could help safely translate AI tools into clinical practice.34

Interpretability Issues: Many AI models, especially deep learning models, are considered “black boxes” because their decision-making processes are difficult to explain. This is particularly important in the medical field, where both doctors and patients may need to understand the basis of the model’s decisions.35 When using AI in medical decision-making, it is necessary to clarify issues of responsibility attribution and compliance. For instance, if an AI model’s diagnostic or treatment recommendation is incorrect, it should be clear where the responsibility lies.

Conclusion and Outlook

The application of AI in the screening of COPD at primary healthcare institutions in China is gradually unfolding. AI technology has shown potential in disease screening, condition monitoring, data analysis, and patient education, particularly in handling large volumes of electronic health records, data mining, and extracting key information. AI provides an effective tool for identifying COPD risk factors and predicting high-risk groups. Additionally, AI has significant advantages in enhancing telemedicine services, designing personalized medical plans, and improving the efficiency of public health strategies.

To effectively enhance the application of AI technology in COPD screening at primary healthcare institutions in China, training and education can be implemented to improve medical personnel’s understanding and ability to use AI, ensuring its effective application in COPD screening and management. Secondly, improving the quality of data collection and processing, including ensuring the accuracy and comprehensiveness of collected health data and strengthening data privacy and security, is essential so that AI can be more effectively used for data analysis and decision support. Furthermore, promoting interdisciplinary collaboration between the healthcare sector and fields such as computer science and data analysis is crucial for developing and implementing effective AI solutions. Lastly, the government and related institutions need to strengthen policy support and financial investment to promote the widespread use and application of AI technology in primary healthcare institutions. By implementing these comprehensive measures, the application of AI in COPD screening at primary healthcare institutions in China will be significantly enhanced, better addressing this major public health challenge.

Acknowledgments

Funded by Capital’s Funds for Health Improvement and Research (CFH 2022-4-7014).

Disclosure

The author reports no conflicts of interest in this work.

References

1. Global Initiative for Chronic Obstructive Lung Disease (GOLD). Global strategy for prevention, diagnosis and management of COPD: 2024 Report; 2024. Available from: https://goldcopd.org/2024-gold-report/. Accessed May10, 2024.

2. Meghji J, Mortimer K, Agusti A, et al. Improving lung health in low-income and middle-income countries: from challenges to solutions. Lancet. 2021;397(10277):928–940. PMID: 33631128. doi:10.1016/S0140-6736(21)00458-X

3. Zhong NS, Wang C, Yao WZ, et al. Prevalence of chronic obstructive pulmonary disease in China: a large, population⁃based survey. Am J Respir Crit Care Med. 2007;176(8):753⁃760.

4. Wang C, Xu J, Yang L, et al. Prevalence and risk factors of chronic obstructive pulmonary disease in China (the China Pulmonary Health [CPH] study): a national cross ⁃ sectional study. Lancet. 2018;391(10131):1706⁃1717.

5. Chen M, Ye K, Xu Z, et al. Current situation and prospect of chronic obstructive pulmonary disease community management in China. Chin Gen Pract. 2020;23(3):251–256. doi:10.12114/j.issn.1007-9572.2019.00.756

6. Topalovic M, Das N, Burgel PR, et al. Pulmonary function study investigators; pulmonary function study investigators: Artificial intelligence outperforms pulmonologists in the interpretation of pulmonary function tests. Eur Respir J. 2019;53(4):1801660. PMID: 30765505. doi:10.1183/13993003.01660-2018

7. Yang Y, Li W, Guo Y, et al. Early COPD risk decision for adults aged from 40 to 79 years based on lung radiomics features. Front Med Lausanne. 2022;9:845286. PMID: 35530043; PMCID: PMC9069013. doi:10.3389/fmed.2022.845286

8. Zhang L, Jiang B, Wisselink HJ, et al. COPD identification and grading based on deep learning of lung parenchyma and bronchial wall in chest CT images. Br J Radiol. 2022;95(1133):20210637. PMID: 35143286. doi:10.1259/bjr.20210637

9. Wang WY. To explore the diagnostic value of artificial intelligence robot pre-diagnostic risk assessment and radiomics in chronic obstructive pulmonary disease [Master’s thesis]. Dalian Medical University; 2023.

10. Chinese Medical Association, Chinese Medical Association Press, General Practice Branch of Chinese Medical Association, Pulmonary Function Professional Group of Chinese Medical Association Respiratory Diseases Branch, Editorial Committee of Chinese Journal of General Practitioners, Expert Group for Writing Primary Diagnosis and Treatment Guidelines of Respiratory System Diseases. General guidelines for routine pulmonary function tests (2018). Chin J Gen Practit. 2019;18(6):511–518. doi:10.3760/cma.j.issn.1671-7368.2019.06.003

11. Price DB, Tinkelman DG, Halbert RJ, et al. Symptom-based questionnaire for identifying COPD in smokers. Respiration. 2006;73(3):285–295. doi:10.1159/000090142

12. Martinez F, Mannino D, Leidy NK, et al. A new approach for identifying patients with undiagnosed chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2017;195(6):748–756. doi:10.1164/rccm.201603-0622OC

13. Schoepf UJ, Bruening RD, Hong C, et al. Multislice helical CT of focal and diffuse lung disease: comprehensive diagnosis with reconstruction of contiguous and high-resolution CT sections from a single thin-collimation scan. AJR Am J Roentgenol. 2001;177(1):179–184. PMID: 11418423. doi:10.2214/ajr.177.1.1770179

14. Agusti A, Sin DD. Biomarkers in COPD. Clin Chest Med. 2014;35(1):131–141.

15. Leidy NK, Martinez FJ, Malley KG, et al. Can CAPTURE be used to identify undiagnosed patients with mild-to-moderate COPD likely to benefit from treatment? Int J Chronic Obstr. 2018;13:1901–1912. doi:10.2147/COPD.S152226

16. Zhou Y, Liu S, Lv J, et al. Design of a survey method for the prevalence of chronic obstructive pulmonary disease in China. Chin J Epidemiol. 2006;27(9):814–818.

17. Zhou YM, Chen SY, Tian J, et al. Development and validation of a chronic obstructive pulmonary disease screening questionnaire in China. Int J Tuberc Lung Dis. 2013;17(12):1645–1651. doi:10.5588/ijtld.12.0995

18. Yang X, Yao M, Yin D, et al. Comparative study on chronic obstructive pulmonary disease screening tools in primary healthcare institutions in Beijing, China. Int J Chron Obstruct Pulmon Dis. 2023;18:1773–1781. PMID: 37608835; PMCID: PMC10441650. doi:10.2147/COPD.S419550

19. Kaplan A, Cao H, FitzGerald JM, et al. Artificial intelligence/machine learning in respiratory medicine and potential role in asthma and COPD diagnosis. J Allergy Clin Immunol Pract. 2021;9(6):2255–2261. PMID: 33618053. doi:10.1016/j.jaip.2021.02.014

20. Topalovic M, Das N, Burgel P-R, et al. Artificial intelligence outperforms pulmonologists in the interpretation of pulmonary function tests. Eur Respir J. 2019;2019:1.

21. Fischer AM, Varga-Szemes A, Van assen M, et al. Comparison of artificial intelligence-based fully automatic chest CT emphysema quantification to pulmonary function testing. AJR Am J Roentgenol. 2020;214(5):1065–1071. PMID: 32130041. doi:10.2214/AJR.19.21572

22. Li Z, Huang K, Liu L, et al. Early detection of COPD based on graph convolutional network and small and weakly labeled data. Med Biol Eng Comput. 2022;60(8):2321–2333. doi:10.1007/s11517-022-02589-x

23. Zafari H, Langlois S, Zulkernine F, Kosowan L, Singer A. AI in predicting COPD in the Canadian population. Biosystems. 2022;211:104585. PMID: 34864143. doi:10.1016/j.biosystems.2021.104585

24. Sunjaya AP, Ansari S, Jenkins CR. A systematic review on the effectiveness and impact of clinical decision support systems for breathlessness. Npj Primary Care Respirat Med. 2024. doi:10.1038/s41533-022-00291-x

25. Deshpande R, Koester C, Singanallur P, et al. Artificial intelligence as a predictive tool to identify patients with COPD at high risk for 30-day readmission. Chest. 2021;160(4):A1409. doi:10.1016/j.chest.2021.07.1289

26. Davies HJ, Bachtiger P, Williams I, et al. Wearable In-Ear PPG: detailed respiratory variations enable classification of COPD. IEEE Trans Biomed Eng. 2022;69(7):2390–2400. PMID: 35077352. doi:10.1109/TBME.2022.3145688

27. Voigt P, Von Dem Bussche A. The eu general data protection regulation (gdpr). A Practical Guide. 2017;10(3152676):10–5555.

28. Cheng Xiao. On the personal information processing rules in China’s personal information protection law. Tsinghua Law Rev. 2021;15(3):55–73.

29. Guifeng L, Bingying R, Qiong L. Strengthening data security protection and enhancing data governance capabilities: an interpretation of the draft ‘data security law of the People’s Republic of China’. J Agricult Lib Informat Sci. 2021;33(4):4–13.

30. Tong R, Jiahui Q, Zhixing Z, et al. Medical data governance - building a high-quality medical big data intelligent analysis data foundation. Big Data. 2019;5(1):12–24.

31. Cheplygina V, Pena IP, Pedersen JH, et al. Transfer learning for multicenter classification of chronic obstructive pulmonary disease. IEEE J Biomed Health Inform. 2018;22(5):1486–1496. doi:10.1109/JBHI.2017.2769800

32. Wang H, Fu T, Du Y, et al. Scientific discovery in the age of artificial intelligence. Nature. 2023;620(7972):47–60. PMID: 37532811. doi:10.1038/s41586-023-06221-2

33. Alcázar-Navarrete B, Trigueros JA, Riesco JA, Campuzano A, Pérez J. Geographic variations of the prevalence and distribution of COPD phenotypes in Spain: ”the ESPIRAL-ES study”. Int J Chron Obstruct Pulmon Dis. 2018;13:1115–1124. PMID: 29692606; PMCID: PMC5901135. doi:10.2147/COPD.S158031

34. Jayakumar S, Sounderajah V, Normahani P, et al. Quality assessment standards in artificial intelligence diagnostic accuracy systematic reviews: a meta-research study. Npj Digital Med. 2022;5(1):11. doi:10.1038/s41746-021-00544-y

35. Chaddad A, Peng J, Xu J, Bouridane A. Survey of Explainable AI Techniques in Healthcare. Sensors. 2023;23(2):634. PMID: 36679430; PMCID: PMC9862413. doi:10.3390/s23020634

Comments (0)

No login
gif