Using Artificial Intelligence to Detect Risk of Family Violence: Protocol for a Systematic Review and Meta-Analysis

Introduction

Family violence is a major public health problem and can include behaviors such as physical, sexual, and verbal abuse; coercive control; and emotional, spiritual, religious, and financial abuse perpetrated by 1 adult partner to another. Definitions of family violence vary in the research literature, which has implications for reported rates of family violence and, importantly, policy []. For this article, family violence will be defined by Section 4AB (1) of the Australian Family Law Act 1975 as “threatening or other behavior by a person that coerces or controls a member of the person’s family (the family member) or causes the family member to be fearful.” Family violence is most commonly, but not exclusively, perpetrated by men against women [], and these experiences can have considerable mental health impacts, such as depression, posttraumatic stress disorder and anxiety [], and physical consequences for survivors, including serious injury and death []. Recent estimates suggest that globally 27% of women aged between 15 and 49 years old have experienced intimate partner violence []. In Australia, the most recent statistics suggest that approximately 20% of adults have experienced family violence at some point since the age of 15 years [].

The prevalence and impact of family violence underscores the importance of early detection and prevention. Addressing family violence requires prevention strategies across a range of contexts, such as social policy, culture, health care as well as the familial level []. Prevention of family violence has relied primarily upon education strategies, with a large emphasis on educating survivors of family violence [], which have reported mixed results. For instance, Noughani and Mohtashami [] found that dissemination of an educational booklet to women did not change the reported incident rate of family violence in their sample. However, a significant reduction in family violence after three 60-minute educational classes for women was reported by Taghdisi et al []. Education strategies have also targeted health professionals such as nurses []. Given the varying levels of success of these initiatives and the often-hidden nature of family violence, novel approaches for detecting family violence are needed.

Family violence increased during the COVID-19 pandemic, exacerbating the mental and physical health difficulties experienced by individuals, families, and communities []. Telehealth services and online mental health support tools proliferated during the COVID-19 pandemic [,]. The use of technology in psychology has been found to be useful across a range of settings. For instance, telehealth is effective in treating several mental health disorders [] and previous research has examined the use of technology regarding family violence screening and interventions []. According to a review by El Morr and Layal [], information and communication technologies facilitate greater levels of disclosure of family violence victimization than is achieved though in-person screening; however, only standardized screeners were studied and alternate means of disclosure (eg, free-text questions and voice signals) were not considered.

Progress in information technology, particularly artificial intelligence (AI) has led to some significant advancements in medical and public health interventions. In a recent systematic review, Qui et al [] found that large AI models have been used in bioinformatics, medical diagnoses, imaging, informatics, education, robotics, and public health. These models include large language models, large vision models, and large multimodal models. The authors caution that large language models are not yet reliable and may generate information that can mislead users. Further challenges and risks identified by the review include bias, privacy issues, the resource intensive requirements associated with updating large models, and emphasize the importance of situating the models in line with human ethics.

Recent research has examined the use of machine learning in mental health, including screening for suicide risk [], assessment of psychological distress [], and detection of family violence in medical records [,] as well as using voice signal informed classification []. Voice signal data such as vocal pitch [] and articulation [] have been found to distinguish between the voice recordings of distressed and nondistressed individuals. Similar approaches using text-based analysis have been developed to identify and to detect reports of family violence on Twitter []. The analysis of social media content can provide valuable insights into risk factors, in particular textual markers of family violence. A similar approach has been taken where advanced AI models, such as transformer architecture have been applied to the analysis of annotated clinical notes to detect family violence []. In a review by Iyer and Meyer [], timing patterns of speech were able to detect high risk of suicide callers compared with their comparison group with a median accuracy of 95%. Other vocal characteristics such as power spectral density sub-bands and mel-frequency cepstral coefficients demonstrated at least 80% accuracy in differentiating groups.

Machine learning and AI have also been used in predictive analytics across a range of settings, including crime and policing using Tweets [] and mobile phone behavioral data [] with high degrees of accuracy (70%-81%). Historically, predictive analysis regarding family violence has relied on police responses to questionnaires such as the Domestic Violence Safety Assessment Tool, and has poor predictive accuracy []. Given the success and accuracy previous AI tools have demonstrated at identifying psychological distress across both vocal and text-based settings, as well as predictive accuracy in some settings, research into predictive analytics and the use of AI to detect risk of family violence may have real world consequences, particularly in terms of prevention. However, to date, no research has amalgamated these findings to assess the accuracy of AI models in identifying the risk of family violence.

Detecting individuals who are at risk of perpetrating family violence is critical for the implementation of prevention strategies. The primary aim of this systematic review is to assess the accuracy of AI models in differentiating between individuals at risk of perpetrating family violence, versus those who are not, using textual or voice signal data. The following questions will inform this review: (1) What research using AI and machine learning has used textual or voice signal data to identify risk of family violence? and (2) What is the accuracy of such tools in differentiating between individuals at risk of perpetrating family violence versus those who are not?

MethodsInclusion and Exclusion Criteria

An overview of inclusion and exclusion criteria is presented in .

Textbox 1. Inclusion and exclusion criteria.

Inclusion criteria

Include human participants.Involve adult participants.Use machine learning methods.Differentiate between low and high risk of family violence perpetration.Use voice signal data or linguistic (textual) data.Nonexperimental and experimental studies.Reporting metrics of classification accuracy.

Exclusion criteria

Animal models.Child or adolescent perpetrators.Reviews (eg, narrative, systematic, meta-analysis, and meta-regression).Comments and editorials.The treatment of family violence.Involving child abuse.Involving elder abuse.The focus of enquiry is a condition other than family violence.

Additional inclusion criteria are that the papers are available in full text, and published in peer reviewed journals. Theses and dissertations will also be eligible for inclusion to yield the most exhaustive search possible. Languages other than English will be considered and translations conducted, subject to time and resource availability.

The decision to restrict the study population to adult participants is consistent with most research that recognizes adults as the key perpetrator population.

Search Strategy

The search strategy and terms were informed by a preliminary search for relevant publications. The search will be restricted to the following databases: IEEE Xplore, PubMed, PsycINFO, EBSCOhost (Psychology and Behavioral Sciences Collection), and Computers and Applied Sciences Complete. ProQuest Dissertations and Theses A&I will also be used to search the grey literature. The full text will be searched using the following representative syntax (ie, Pubmed) (((“Domestic Violence“[Mesh] NOT (“Child Abuse”[Mesh] OR “Elder Abuse”[Mesh])) OR “family violence”[All Fields] OR “domestic violence”[All Fields] OR “domestic abuse”[All Fields] OR “intra-familial violence”[All Fields] OR “spousal violence”[All Fields] OR “spousal abuse”[All Fields] OR “interpersonal violence”[All Fields] OR “interpersonal abuse”[All Fields] OR “intimate partner violence”[All Fields]) AND ((“natural language processing”[Mesh] OR “natural language processing”[All Fields] OR “word embeddings”[All Fields]) OR (“signal processing, computer-assisted”[MeSH Terms] OR “speech analysis”[All Fields] OR “acoustic”[All Fields] OR “emotional speech”[All Fields] “voice”[All Fields] OR “MFCC”[All Fields]))) NOT “review”[publication type]. The reference lists of all articles included for review will be searched for any additional publications as well as relevant reviews.

Study Selection

Retrieved studies will be loaded into NVivo (Lumivero) and title and abstract screening will be conducted by 2 researchers. A full-text review will be conducted by 2 authors and will resolve any discrepancies through discussion, or a third reviewer as needed. An overview of the study selection process is provided in .

Studies that meet all inclusion criteria will be included in the review. An overview of the participant, intervention, comparator, outcomes, and time (PICOT) [] is provided below () and forms the basis of data extraction. Screening and reporting of included publications will be in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines (refer to for PRISMA checklist).

‎

Figure 1. Overview of study selection. Searches are yet to be performed. Textbox 2. Summary of the participant, intervention, comparator, outcomes, and time requirements.

Patient population

Intervention of interest

Reviewing the use of text and voice signal-based machine learning tools used to predict risk of family violence.

Comparison interventions

Primary outcomes

Accuracy of classification between low and high risk of family violence.

Time (Year published)

Other considerations

Language: all languages.Study designs: quantitative, experimental, and nonexperimental.Population: nonanimal investigations.Quality Assessment

Quality assessment will be performed by 2 researchers using the risk of bias tool in nonrandomized studies of interventions []. Included studies will be rated across the 7 domains including bias due to confounding, selection of participants, classification of interventions, deviations from intended interventions, missing data, measurement of outcomes (eg, reporting training versus test dataset accuracy), and selection of the reported result. A final rating of risk of bias will then be assigned to each included publication and across included publications as either low, moderate, serious, critical, or no information provided. This step is essential to ensure the validity and reliability of reported findings.

Data Extraction

Data extraction will be conducted on NVivo and will be performed by 2 researchers, with regular meetings to discuss results. If there is disagreement regarding extraction, a third researcher will be involved in mediating. The following categories will be used to extract the data: study characteristics, participant characteristics, intervention and comparator characteristics, and outcome characteristics. Specific information will also be extracted relevant to vocal signal characteristics used, natural language processing methods, and machine learning methods.

Synthesis Method

Initially, included studies will be synthesized using a narrative synthesis following guidelines proposed by Rodgers et al []. Data will be tabulated based on the information described above. The tabulated data will then be analyzed and clustered into groups based on various characteristics, such as population demographics and data type (eg, text or speech). Short summaries of each included paper will also be developed. Throughout the synthesis process, the research team will meet to critically evaluate the process and to resolve issues that may arise during the synthesis.

A random-effects meta-analysis will be conducted using the area under the receiver operating curve (AUC) and its SE, as reported in the included studies. Where an AUC value is not reported, a point estimate for the AUC will be obtained from the confusion matrix []. To be included in the analysis, the SE or CI must be reported. In the absence of SE values, these will be computed by the equivalence of the AUC to the Wilcoxon statistic or derived from reported CIs. If not provided, the corresponding authors will be contacted with a request to provide the necessary data. Separate random-effects meta-analyses will be performed by classification algorithm subgroup (eg, deep learning, gradient boosting random forest etc), where enough studies (>3) are available. Higgins I2 test will be used to evaluate the level of heterogeneity between included studies, with a level >56% indicating substantial levels of heterogeneity between included studies []. Publication bias will be ascertained through Egger regression and funnel plot analyses []. All analyses will be conducted using R software (version 4.2.0; R Foundation for Statistical Computing) and the “metafor” (ie, meta-analyses and funnel plots).

Ethical Considerations

No participants will be involved in this review, as data will be extracted from existing published studies. Thus, ethics review is not required for this study. This review aims to contribute to a paucity of research in the areas of family violence risk prediction that uses textual and voice-signal based data. Findings from this review will be of particular importance internationally to health care providers such as telehealth services that leverage verbal cues to assess risk. Given the reported high incidence of family violence in the community, this will clarify the state of the literature, providing meaningful information that may be of interest to law enforcement, health care providers, and policy makers and may clarify the next steps needed to advance research in this area. In addition, this review will provide important information about the relevance of specific voice and text features in risk detection for the development of future algorithms.

Results

As of October 2024, preliminary searches have not been conducted. Data will be extracted in line with the aims of this review. The results will include a narrative summary. The meta-analysis results will be presented using a forest plot. It is expected results will be published with the established protocol in a peer-reviewed journal.

Discussion

The primary aim of this systematic review is to assess the accuracy of AI models in differentiating between individuals at risk of perpetrating family violence versus those who are not. Given that previous research has been accurate at identifying other forms of risk for example, suicide risk [], and based on preliminary searches [], it is anticipated that these models may report a high degree of accuracy in identifying individuals who are at high risk of perpetrating family violence.

Given that this is a nascent area of research, it is possible that there may be a high level of heterogeneity across the included studies, differences in the definitions used to identify family violence and there may be diversity in the vocal or textual characteristics used to discriminate high and low risk, as has been found in previous reviews []. Despite these potential limitations, the results of this review will clarify the state of the literature on the accuracy of machine learning models to identify individuals who are at high risk of perpetuating family violence. Importantly, they may contribute to the development of machine learning and AI models that can accurately predict individuals at risk of perpetrating family violence, thus contributing to the development of potential prevention and intervention strategies. In addition, the findings from this review will summarize what systems are involved in the detection of family violence.

Findings from this review may inform the development of surveillance models and contribute to evolving dialogue concerning ethical and political considerations on surveillance on domestic spaces, abusers co-opting smart speakers and the privatization response to family violence as outlined by Sparrow and colleagues [].

Results from this literature review will be disseminated through academic journals and will likely be presented at academic conferences.

The authors gratefully acknowledge the financial assistance of On the Line Australia, a not-for-profit organization that provides telephone and online assisted counselling services.

Data sharing is not applicable to this article as no datasets were generated or analyzed during this study.

KdB, RI, and DM were involved in conceptualization and development of the methodology. KdB and RI wrote the original draft. DM, JLM, and MN contributed to reviewing and editing drafts.

None declared.

Edited by A Mavragani; submitted 29.11.23; peer-reviewed by D Chrimes, T Abd El-Hafeez; comments to author 20.02.24; revised version received 11.04.24; accepted 17.10.24; published 02.12.24.

©Kathleen de Boer, Jessica L Mackelprang, Maja Nedeljkovic, Denny Meyer, Ravi Iyer. Originally published in JMIR Research Protocols (https://www.researchprotocols.org), 02.12.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Research Protocols, is properly cited. The complete bibliographic information, a link to the original publication on https://www.researchprotocols.org, as well as this copyright and license information must be included.

View original article

JMIR RESEARCH PROTOCOLS

Share Bookmark

0 0 0 0 0 0 0

More from this channel

Using Artificial Intelligence to Detect Risk of Family Violence: Protocol for a Systematic Review and Meta-Analysis

Comments (0)