In an operational context, most human activities revolve around the interaction between two or more individuals engaged in group tasks. Consequently, efficiency and safety in the workplace depend not only on the individuals' performance but also on the collective workers' capability to cooperate with each other. In addition, the interaction between the individual and the surrounding working environment is significantly influenced by the individual's cognitive and emotional state. In the last decade, human factor (HF) research has focused on objectively quantifying these cognitive and emotional states as well as the way in which various states from different individuals interact, thereby influencing processes such as team communication, collaboration, and decision-making (Cooke and Gorman, 2009; Heaphy and Dutton, 2008). As a result, the capability to objectively evaluate the performance and cognitive resources of operators and their teams becomes critical to prevent fatal and serious human errors, especially in an operational environment (OE). It is important to highlight that OE refers to settings where people undertake particular tasks that demand technical and professional skills, wherein the level of risk must always be kept under control. In fact, according to National Transportation Board (NTSB) reports, in the last 20 years, ~85% of aviation accidents have been caused by human error (Risk and Management Handbook, 2022). Other statistics presented a slightly different percentage, ~69% (Waraich et al., 2013), but it remains significantly high and potentially preventable.
In this context, the HF related to cooperation could play a crucial role, especially within the field of aviation in which multiple operators are often asked to act as one by coordinating and finalizing each action to the same shared goal. Indeed, it has been demonstrated that in most types of interaction tasks, group-based systems perform better than individual-based and mixed systems, encouraging more cooperative behavior (Ladley et al., 2015). In addition, previous findings have elucidated an interesting phenomenon: Individuals who perceived heightened effort from their crew partner, particularly under high effort conditions, tend to invest more effort in the task and perform better. These findings are consistent with the idea that appreciating another person's effort increases one's own sense of commitment to a cooperative endeavor (Chennells, 2018). Thus, the capability to evaluate cooperation can lead to (i) improving performance, since better teams' performance can be induced by the awareness coming from evaluation of cooperation degree between members, (ii) improving safety, through the evaluation of cognitive and emotive mental state large scale mistakes can be avoided, while through the cooperation evaluation a constant degree of communication and collaboration can be maintained to avoid wrong management of emergency, and (iii) improving operators' wellbeing and satisfaction by increasing their confidence with respect to the environment, thereby contributing to improve performance too. These aspects are particularly valuable when applied in an OE as it is important that individuals' wellbeing and performance remain consistent to avoid human errors that could lead to serious consequences, including, in some cases, fatalities.
To date, especially in OE, scientific literature shows various investigated approaches for cooperation assessment. These approaches consist of methodologies based on subjective or behavioral measures (Ellis et al., 2023; Lapierre et al., 2023). Although subjective measurements are acknowledged to be useful in various contexts, it is important to be aware of the pertinent limits of such evaluations due to their inherent subjectivity and inability to capture the “unconscious” process underlying human behavior (Borghini et al., 2016; Dienes and Perner, 2004). In addition, to obtain these measurements from operators, it is necessary to interact with them, and this leads to difficulties to evaluate their state throughout particular activities. To address these limitations, in the last decade, technological developments have enabled the use of neurophysiological measurements also in OE (Borghini et al., 2012, 2020a; Vecchiato et al., 2014). As an example, measuring the brain and autonomic nervous system activities enables to obtain objective measures of specific mental states with low invasiveness and without negatively interfering with the operators (Arico et al., 2017). Moreover, since these kinds of measures are not subjective, the results could be generalized to assess the dynamic conditions of different subjects in the same operational environment.
Electroencephalography (EEG) and autonomic nervous system-based measures of workers' mental and emotional states have already been investigated during the recent decades to determine brain and autonomic cues of incoming risky psychophysical states (e.g., stress, drowsiness, inattention, and overload; Borghini et al., 2012, 2020a,b; Di Flumeri et al., 2022; Ronca et al., 2022; Vecchiato et al., 2014). Regarding cooperation assessment, the most recent scientific literature has demonstrated that specific indexes, such as functional connectivity and phase synchrony indexes, can reflect the state of cooperation between individuals (Réveillé et al., 2024). More specifically, the most proposed indexes to assess cooperation using EEG signals rely on functional connectivity indexes such as coherence and Granger causality, and synchrony indexes such as phase locking value (PLV), phase lag index (PLI), and circular correlation (Ccor; Réveillé et al., 2024; Sciaraffa et al., 2021). More importantly, these considered recent scientific contributions demonstrated how in the last decade cooperation evaluation through neurophysiological measures was conducted mostly in laboratory contexts and under hyperscanning conditions. The current limitations of functional connectivity indexes stem from the underlying hypotheses of these methods as they necessitate high-quality and high-density data, prolonged registration periods, and hyperscanning conditions to ensure accuracy (Bevilacqua et al., 2019; Liu et al., 2018; Richard et al., 2021; Toppi et al., 2016). In this regard, it has to be underlined that by hyperscanning condition, it is intended the one that foresees the simultaneous recording of brain electrical activity between individuals; hence, this experimental condition guarantees perfect time synchronization between signals. As presented earlier, other investigated methods to assess cooperation between individuals in the scientific literature are based on phase synchrony between signals (i.e., PLV, PLI, and Ccor). These methods aim to find similarities between EEG patterns over time that reflect the sharing of attention or psychological states necessary to coordinate actions aimed to achieve the same objective; phase synchrony also reflects a common response to the same environmental stimuli, similar actions aimed at the same goal, and similar engagement in the task (Burgess, 2013; Buzsáki and Wang, 2012; Stuldreher et al., 2022). Currently, these methods have exclusively been utilized within hyperscanning contexts (Wikström et al., 2022). However, the circular correlation (Ccor) method, due to its dependence on the circular covariance of differences between observed and expected phases, might demonstrate accuracy even in non-hyperscanning scenarios. Therefore, this is one of the approaches to be investigated by the present study.
A second methodology for the objective cooperation evaluation explored in the present study corresponds to one based on mutual information (MI). Such an approach relies on the information theory (IT; Cover and Thomas, 2006) according to the concept of Shannon entropy. The Shannon entropy measures the amount of information in a signal and provides an estimate of the level of uncertainty surrounding the signal-related event. In contrast to other indices based on IT, MI is capable of catching non-linear pattern similarity between two or more variables (Alonso et al., 2015); this property makes this methodology suitable for the study of EEG signals and their correlated features (i.e., mental states).
Considering the existing literature and taking advantage of modern technologies in terms of wearable and comfortable sensors, which can provide high-quality neurophysiological signals, the present study aimed to assess pilots' cooperation degree while performing flight simulations under realistic conditions through the use of neurophysiological measures. The cooperation assessment stands as a highly debated and challenging topic within scientific literature (Toppi et al., 2016). Currently, the lack of broadly validated methodologies, especially in real-world contexts, presents a substantial obstacle to the advancement of this research field. Particularly, for the application of these methods in OEs, it is important that the measuring system is non-invasive and compatible with environmental constraints, but at the same time it has to guarantee the high quality of the signals recorded. To address these needs, in this study, a wearable device for high-quality EEG data collection has been employed. Starting from this premise, the present study investigated the application and comparison of two novel techniques to assess cooperation between pilots during a real-flight simulation, contributing to state of the art by providing new methodologies for objectively evaluating cooperation in OE using mobile neurophysiological data collection systems. The potential innovation characterizing these methodologies corresponds to their robustness to experimental environments compatible with out-of-the-lab applications, which are notably violating the hypothesis of hyperscanning and lacking the high spatial resolution of traditional EEG measurements employed in controlled settings.
The present study is structured as follows: The selected methods for the cooperation neurophysiological characterization were technically described; subsequently, the experimental protocol, including all the collected data (i.e., neurophysiological, behavioral, and subjective measurements), was described; the experimental results and their conceptual implications were first presented and then discussed; finally, a conclusive section was presented to underline what are the main research outcomes of the presented study.
2 Materials and methodsAs already mentioned in the Introduction section, the first proposed approach to evaluate cooperation was based on phase synchrony analysis of operators' EEG signals through the application of the circular correlation (Ccor) technique (Frassineti et al., 2021; Shahsavari Baboukani et al., 2019).
The second approach was based on empirical evidence suggesting that cooperation can be assessed as the output of a multivariate system derived from the cognitive and emotional states of users (Sciaraffa et al., 2021). Such a multivariate system was therefore computed through the MI. In this regard, the cognitive and emotional states were associated with the mental workload (MW) and approach-withdrawal (AW), respectively. More specifically, EEG-derived indexes for MW and AW have been selected for each individual engaged in the cooperative task since the MW reflects the level of cognitive effort, while the AW indicates the emotional tendency to approach or withdraw from a stimulus.
To technically validate both the proposed approaches, simulated crews (FAKE crews) were constructed, randomizing pilots within the crews and experimental phases so that their input in the method would be considered white noise in terms of cooperation. These simulated crews served to highlight the sensibility and reliability of the proposed indexes to cooperative behavior, as by construction, they cannot reflect cooperative behaviors. Therefore, it was expected that these indexes would consistently demonstrate a fixed response pattern for FAKE crews, whereas for REAL crews they must be sensitive to the cooperation phase. The choice of Ccor and MI-based approaches among all the most investigated and validated methodologies within the state of the art is motivated by their ability to capture both linear and non-linear relationships between time series, providing a more comprehensive view of information exchange. It is important to note that both the proposed methods investigate the same phenomenon from complementary perspectives. To standardize the terminology in the present study, the cooperation index assessed through mutual information will be referred to as the MI-Cooperation Index (MICI), while the one assessed through circular correlation will be termed the Ccor-Cooperation Index (CCI).
2.1 Cooperation assessmentAs previously mentioned, the cooperation index [C(t)] over time has been computed through two approaches that will be described along the respective following subchapters.
2.1.1 Ccor-Cooperation Index assessmentCCI method is based on the similarity between the instantaneous phases of two time series which are directly the EEG signals from each pilot involved in the task. Therefore, the CCI was computed according to the following formula (Berens, 2009; Shahsavari Baboukani et al., 2019).
Ccorϑ,φ = ∑k=0N−1sin(ϑk− ϑ¯)sin(φk− φ¯)∑k=0N−1sin2(ϑk− ϑ¯)sin2(φk− φ¯) (1)• ϑk and φk: instantaneaous phases evaluated at k-th sample;
• ϑ̄ and φ̄: circular means of ϑk and φk computed using 2.
ϑ¯=arg(∑n=0N−1ej ϑ¯[n]) (2)Phases have been considered in radians.
CCI has been computed first on the overall preprocessed signal and second on each individual frequency band as reported in Table 1. For each individual, individual alpha frequency (IAF) was computed on the eyes closed EEG signal (Klimesch, 1999).
Table 1. Frequency band definition.
The percentage of cooperation time has been computed by determining how long the cooperation neurophysiological index was above the cooperation threshold. The cooperation threshold has been established as the CCI median evaluated along the calibration tasks (i.e., low and high difficulty).
2.1.2 MI-Cooperation Index assessmentConsidering the two discrete variables of MW and AW, the theoretical formulation of MICI is described by the following formulas (Cover and Thomas, 2006; Kraskov et al., 2004):
I(X;Y)= ∑i,jp(i,j)logp(i,j)px(i)py(j) (3) I(X;Y)= ∬p(x,y)logp(x,y)px(x)py(y)dxdy (4)where p(i,j) is the joint probability of the two discrete time series, px(i) and py(j) are the marginal probabilities of each time series, and i,j are bins of the 2D discrete space in which the two series are defined.
The reported discrete formula (3) is already an approximation of the continuous MI evaluated on a continuous space (4). To technically implement the aforementioned formulation, Kraskov et al. (2004) proposed an estimator of MI based on entropy (H) concept:
I(X;Y)= H(X)+H(Y)−H(X,Y) (5)Thus, MICI(t) was assessed as the mutual information between four time series (i.e., MW index and AW index assessed on both individuals), using the Kraskov estimator (Kraskov et al., 2004). This is a KNN-based estimator that aims to estimate the Shannon entropy of every signal from the average distance of the k-th nearest neighbor averaged over all other points of the signal. Since the computation of mental workload and approach-withdrawal indices involves the evaluation of power across distinct EEG signal bands, the analysis of the MICI index's contribution was constrained to the entire preprocessed signal's band, precluding examination within specific frequency bands like done with CCI.
The percentage of cooperation time has been computed as according to the procedure described in the previous paragraph.
2.2 Experimental protocol design and participantsSixteen (16) participants have been selected and organized in eight flight crews, four of which were composed of Unexperienced pilots (UNEXPs, i.e., pilots that had just got the integrated ATPL—Airline Transport Pilot License to become commercial pilots) and four by Experienced ones (EXPs, i.e., pilots working for a commercial airline for at least 10 years). Every crew was flanked by an expert trainer with the role of supervising and providing information about the pilots' mental states while performing the experimental tasks. Furthermore, to mimic a more realistic situation, the expert trainer was designated to act as an air traffic controller to simulate the communications with the crews. The flight simulations were performed on the Mechtronix Ascent XJ Trainer Boeing 737–800 simulator, located at the Urbe Aero training center in Rome.
The experimental protocol was organized into two major sections described in Table 2. In the first phase, defined as the “calibration scenario,” each pilot performed the experimental flight simulation alone. In the second phase, both pilots shared the same cockpit to perform a mission designed to induce cooperative behavior between them. The pilots' EEG signals were collected throughout the two mentioned experimental phases. The first scenario, which was completed by each pilot alone, was designed to derive a sort of baseline in an uncooperative environment. This phase was necessary to determine the effective cooperation threshold for each pilot crew. In addition, two preliminary 60-s EEG data collections were performed, while the participants were relaxed with the eyes closed, to evaluate the individual alpha frequency (IAF; Klimesch, 1999) for each individual, and while the participants were laying still relaxed, with the eyes open for calibrating the eyeblink correction algorithm (Di Flumeri et al., 2016). At the end of each flight mission, the trainer who supervised all the activities provided subjective behavioral assessments every 30 s of the pilots' cooperation effectiveness through a rating scale on a tablet (Figure 1). According to the trainer's behavioral assessment, a performance index representative of the outcome of the mission and the mental condition of each crew was assessed as described in Section 2.4.
Table 2. Experimental Protocol's scenario description.
Figure 1. On the right is shown the experimental setting in the Mechtronix simulator and on the left the Mindtooth Touch EEG headset (BrainSigns srl, Rome, Italy). The system has been designed to be wearable, comfortable for a long use, easy to wear, and fully compatible with aviation tools (e.g., headphones and glasses).
2.3 Neurophysiological data recording and processingThe Mindtooth system (Brain Products GmbH, Gilching, Germany, and BrainSigns srl, Rome, Italy) has been employed for EEG signal acquisition. This system was developed within the homonymous project (H2020-EIC-FTI-GA950998), and it is able to record high-quality EEG data while maintaining high comfort for the operator (Sciaraffa et al., 2022). The EEG data were collected with a sample rate of 125 Hz, on eight recording channels (AFz, AF3, AF4, AF7, AF8, PZ, P3, and P4), referenced to the right mastoid, and grounded on the left one. Electrodes' impedance has been maintained below 50 kOhm (Sciaraffa et al., 2022). All measurements were subjected to a 50 Hz notch filter to remove main line power interference. AF7 and AF8 channels were removed from the analysis due to a too high percentage of movement artifacts coupled with the EEG signal, while the other channels were preserved with an average artifact percentage below 8% as shown in Table 3.
Table 3. Mean artifact percentage on each preserved channel.
The EEG recordings were also band-pass filtered [low-pass filter cutoff frequency: 30 Hz, high-pass filter cutoff frequency: 2 Hz] with a 5-th order Butterworth filter. The Reblinca method was used to identify the blink artifacts and the ocular component, which were corrected using a multi-channel Wiener filter (MWF; Di Flumeri et al., 2016). EEG signals were segmented into epochs of 1 s, and on every epoch, only the threshold criterium was applied to mark artifactual epochs through the EEGLAB toolbox (Delorme and Makeig, 2004). In particular, the EEG epochs were identified as “artifact” if the EEG amplitude was >±80 (μV). Once the artifact-free EEG signal was computed, global field power (GFP) was estimated for each frequency band as a preliminary step to compute the evaluation of cooperation-related neurophysiological metrics (i.e., MW and AW). Frequency bands have been defined in Table 3.
The GFP-derived (Skrandies, 1990; Vecchiato et al., 2014) features computed among the aforementioned frequency bands and EEG channels have been used to assess mental workload index [W(t)] and approach-withdrawal index [AW(t)] over time:
W(t)= GFPϑ(AF3,AFz,AF4)GFPα(P3,Pz,P4) (6) AW(t)= GFPα(AF4) − GFPα(AF3) (7)where GFPx represents the GFP evaluated on the individual x-band.
The overall preprocessing and cooperation evaluation process is graphically presented in Figure 2.
Figure 2. Overview of the preprocessing steps and analytical methods applied to EEG signal analysis, including noise reduction, artifact removal, feature extraction, and cooperation index evaluation. Cooperation indexes are highlighted by a green framework.
2.4 Behavioral dataAs previously anticipated, during each flight mission, an instructor, acting as subject matter expert (SME), supervised all the activities and provided a subjective assessment of cooperation degree between pilots by a rating scale on a tablet (Figure 1) every 30 s.
Crew's total behavioral cooperation assessment was computed as the mean between the behavioral cooperation assessment of the pilots composing the crew.
2.5 Statistical analysisThe Shapiro–Wilk test was used to assess the normality of the distribution related to each of the considered parameters evaluated through the pilots' dyads. If normality was confirmed, Student's unpaired t-test would have been performed to pairwise comparing Experienced (i.e., EXPs) and Unexperienced (i.e., UNEXPs) groups. Otherwise, the same differences have been tested through the Mann–Whitney test.
ANOVA test has been applied in different phase comparisons after verification of the ANOVA assumptions.
In addition, to validate the proposed indexes, the relationships between each neurophysiological cooperation index (i.e., CCI and MICI) and crews' behavioral cooperation assessment were examined using Pearson's correlation analyses. Furthermore, an additional validation method involved comparing the results obtained from REAL and FAKE crews, assuming that the methods applied to the FAKE crews would return constant results. This last analysis was conducted using both ANOVA and Student's unpaired t-test to compare between phases and between REAL and FAKE in each phase.
3 ResultsThe result paragraph has been organized in two different subsections to neatly distinguish the results from different methodologies. The results from MI and Ccor methodologies to assess cooperation (i.e., the CCI and MICI) will be presented, respectively, on the first and second subchapters.
3.1 Mutual information approach: MICI resultsTwo statistical unpaired Student's t-tests, regarding the difference in cooperation time and performance between EXPs crews and UNEXPs ones, have been performed as shown in Figure 3. The first analysis (left in Figure 3) aimed to compare average cooperation time in the two experimental groups, which was found to be statistically higher in the EXP group (p = 0.017). The second analysis presented on the right side in Figure 3 compares instead the cooperation assessment provided by the trainer in the same experimental groups, resulting once more to be statistically higher in the EXP group (p = 0.015). As shown, both neurophysiological and behavioral assessments reveal the same trend in the two experimental groups suggesting that the measurements truly observe directly related phenomena such as evaluated teamwork and tracked performances.
Figure 3. Percentage of cooperation time evaluated through MICI and trainer performance assessment between EXP and UNEXP pilots. *Indicates the statistical differences between experimental groups (both p < 0.05).
The Pearson correlation analysis in Figure 4 conducted between the two cooperation measures (i.e., MICI and the SME assessment) has shown a high positive and significant correlation, confirming that the MICI index follows the judgment of the SME in evaluating cooperation (R = 0.78, p = 0.032). This result particularly suggests that the two measures describe the same phenomenon (i.e., neurophysiological and behavioral cooperation) and serve to validate MICI index as a representative cooperation index.
Figure 4. Pearson's correlation between cooperation time and trainer performance assessment (R = 0.78, p < 0.05).
Figure 5 represents the REAL vs. FAKE crews ANOVA analysis revealing statistically significant (p < 0.001, F = 28.697, ω2 = 0.491) differences in behavior along phases. In addition, the t-test between REAL and FAKE on cooperation phase resulted in a significant difference (p < 0.001).
Figure 5. ANOVA comparison of MICI through different experimental phases in real and simulated crews. **Indicates the statistical difference in the behavior during the experimental phases between real and simulated crews (both p < 0.001).
3.2 Circular correlation approach: CCI resultsAll the presented results regarding CCI are assessed on the overall signal band (i.e., 2–30 Hz). Individual power band analysis did not result in significant results.
Figure 6 shows the Friedman statistical comparisons between cooperative experimental conditions and non-cooperative ones (i.e., Low and High diff). This statistical analysis demonstrated that the CCI was significantly higher when evaluated along the cooperative experimental tasks compared with the non-cooperative ones (p = 0.005, F = 7.833, ω2 = 0.061) over the frontal area.
Figure 6. Raincloud of CCI through different experimental phases on the frontal electrodes. *Indicates the statistical difference between the cooperation condition and the solo ones (i.e., Low and High diff; p < 0.010).
By considering the second dimension of comparison, that is, distinguishing between pilot categories EXPs and UNEXPs based on their cooperation levels, the statistical analysis revealed a significant difference in the percentage of cooperation time between the two groups, with EXP crews demonstrating notably higher cooperation time on frontal electrodes compared to the UNEXP group (p = 0.029, W = 16.000). This distinction is represented in Figure 7, illustrating the percentage of cooperation time assessed on frontal electrodes, and the Mann–Whitney test has been used for this analysis. The same analysis conducted on parietal electrodes did not report statistically significant evidence.
Comments (0)