This is the first hands-on application of text mining tools (TMT) in Biomaterials.
•We apply multiple TMTs to extract information from the Biomaterials literature.
•TMTs produce an informative research map of polydioxanone with main topics & trends.
•TMTs also highlight research gaps, missing assets & unresolved obstacles.
•We showcase NER’s potential to extract deep data & drive discoveries in Biomaterials.
AbstractScientific information extraction is fundamental for research and innovation, but is currently mostly a manual, time-consuming process. Text Mining tools (TMTs) enable automated, accurate and quick information extraction from text, but there is little precedent of their use in the biomaterials field. Here, we compare the ability of various TMTs to extract useful information from biomaterials abstracts. Focusing on the biocompatibility of polydioxanone, a biodegradable polymer for which there are relatively few scientific publications, we tested several tools ranging from machine learning approaches and statistical text analysis to MeSH indexing and domain-specific semantic tools for Named Entity Recognition. We also evaluated their output alongside a manual review of systematic reviews and meta-analyses. The findings show that TMTs can be highly efficient and powerful for mapping biomaterials texts and rapidly yield up-to-date information. Here, TMTs enable one to identify dominating themes, see the evolution of specific terms and topics, and learn about key medical applications in biomaterials literature over the years. The analysis also shows that ambiguity around biomaterials nomenclature is a significant challenge in mining biomedical literature that is yet to be tackled. This research showcases the potential value of using Natural Language Processing and domain-specific tools to extract and organize biomaterials data.
AbbreviationsBCTEOBone and Cartilage Tissue Engineering Ontology
CHEBIChemical Entities of Biological Interest
CT/MRIComputed Tomography/Magnetic Resonance Imaging
DEBDevices, Experimental Scaffolds and Biomaterials Ontology
DEBBIEDatabase of Experimental Biomaterials and their Biological Effect
GMDNGlobal Medical Device Nomenclature
hLDAHierarchical Latent Dirichlet Allocation
MEDLINENational Library of Medicine
MeSHMedical Subject Headings
NERNamed Entity Recognition
NLPNatural Language Processing
PMIDPubMed Unique Identifier
RCTRandomized Clinical Trial
RNRegistry Number/EC Number
SGDStochastic Gradient Descent
SR&MAsSystematic Reviews and Meta-Analyses
KeywordsBiomaterials
Text mining
Polydioxanone
Biocompatibility
Information extraction
© 2023 The Authors. Published by Elsevier B.V.
Comments (0)