Delayed neurocognitive recovery after surgery is associated with poor outcome. Most surgeries require general anesthesia, of which sevoflurane and propofol are the most commonly used inhalational and intravenous anesthetics. The authors tested the primary hypothesis that patients with laparoscopic abdominal surgery under propofol-based anesthesia have a lower incidence of delayed neurocognitive recovery than patients under sevoflurane-based anesthesia. A second hypothesis is that there were blood biomarkers for predicting delayed neurocognitive recovery to occur.
A randomized, double-blind, parallel, controlled study was performed at four hospitals in China. Elderly patients (60 yr and older) undergoing laparoscopic abdominal surgery that was likely longer than 2 h were randomized to a propofol- or sevoflurane-based regimen to maintain general anesthesia. A minimum of 221 patients was planned for each group to detect a one-third decrease in delayed neurocognitive recovery incidence in propofol group compared with sevoflurane group. The primary outcome was delayed neurocognitive recovery incidence 5 to 7 days after surgery.
A total of 544 patients were enrolled, with 272 patients in each group. Of these patients, 226 in the propofol group and 221 in the sevoflurane group completed the needed neuropsychological tests for diagnosing delayed neurocognitive recovery, and 46 (20.8%) in the sevoflurane group and 38 (16.8%) in the propofol group met the criteria for delayed neurocognitive recovery (odds ratio, 0.77; 95% CI, 0.48 to 1.24; P = 0.279). A high blood interleukin-6 concentration at 1 h after skin incision was associated with an increased likelihood of delayed neurocognitive recovery (odds ratio, 1.04; 95% CI, 1.01 to 1.07; P = 0.007). Adverse event incidences were similar in both groups.
Anesthetic choice between propofol and sevoflurane did not appear to affect the incidence of delayed neurocognitive recovery 5 to 7 days after laparoscopic abdominal surgery. A high blood interleukin-6 concentration after surgical incision may be an independent risk factor for delayed neurocognitive recovery.
Postoperative neurocognitive disorders are common in older surgical patients
It is unclear whether there are differences in neurocognitive outcomes between propofol-based intravenous anesthesia and the use of volatile anesthetics
This prospective randomized study found no differences in neurocognitive disorder at postoperative days 5 to 7 between patients anesthetized with a propofol-based compared to a sevoflurane-based anesthetic
Elevated interleukin-6 concentrations 1 h after skin excision may be predictive of the development of a postoperative neurocognitive disorder on postoperative days 5 to 7
More than 50 and 60 million patients have surgery annually in the United States and in China, respectively.1,2 Studies have shown acute and delayed onset postoperative cognitive dysfunction.3–5 Postoperative cognitive dysfunction is associated with an increased mortality, length and cost of hospital stay, and dropout from job market.4,6 Aging is identified as a risk factor for postoperative cognitive dysfunction.3,4 About 40 and 10% of elderly (60 yr and older) patients have this syndrome 7 days and 3 months, respectively, after noncardiac surgery.3,4 Recently, the term “delayed neurocognitive recovery” was adopted for cognitive impairment in the interval from 0 to 30 days postoperatively.7
Because the majority of patients require general anesthesia during surgery, the possible role of general anesthetics, especially volatile anesthetics, in postoperative cognitive dysfunction is proposed based on animal study findings.8–10 However, whether general anesthesia contributes to postoperative cognitive dysfunction development in humans is controversial. Patients having general anesthesia had a worse cognition function than patients having regional anesthesia at 1 week after surgery in some studies,11,12 but this difference was not revealed in other studies.13,14 Also, there was no difference in cognitive functions 3 or 6 months after surgery between patients receiving general and regional anesthesia.11,13,14
Because general anesthesia is often necessary in surgical patients, one important issue is whether anesthetic choice will influence postoperative cognitive dysfunction occurrence. Two types of general anesthetics are used: intravenous and inhalational anesthetics. Recent animal studies have been focused on identifying possible toxic effects of volatile anesthetics including inducing cell injury and inflammation.9,15,16 Neuroinflammation may be a critical pathologic process for postoperative cognitive dysfunction.16,17 These findings have prompted some anesthesiologists to propose reducing the use of volatile anesthetics.18,19 However, the intravenous anesthetic propofol can also induce cell injury.20 Although volatile anesthetics may induce inflammatory responses,8,21 extensive research has shown that volatile anesthetics can inhibit various insult-induced neuroinflammation and brain cell injury.22–25 Because surgery is an insult and induces neuroinflammation,26,27 volatile anesthetics may inhibit surgery-induced neuroinflammation. Thus, volatile anesthetics may not be a “bad player” in terms of postoperative cognitive dysfunction.28
In fact, the role of intravenous and volatile anesthetics in delayed neurocognitive recovery/postoperative cognitive dysfunction in humans remains unclear. Although many studies have investigated this issue, most studies are small in sample size, used substandard methods to diagnose the syndrome, or were not designed primarily to determine this issue.29–32 There is no multicenter study to address this issue. Because anesthetic choice can be effectively managed by anesthesiologists, determining the role of anesthetic choice in delayed neurocognitive recovery/postoperative cognitive dysfunction is important. The primary aim of our study was to compare the effects of propofol versus sevoflurane on the incidence of delayed neurocognitive recovery in elderly patients undergoing laparoscopic abdominal surgery. A secondary aim was to determine whether there were blood biomarkers for delayed neurocognitive recovery. These studies aimed to test the hypotheses that general anesthesia maintained mainly by propofol reduces delayed neurocognitive recovery compared with sevoflurane-based general anesthesia and that there are early blood biomarkers for predicting whether delayed neurocognitive recovery will occur.
Materials and Methods
Study Design and Participants
A multicenter, double-blind, randomized controlled trial was performed at Sun Yat-sen Memorial Hospital (Sun Yat-sen University, Guangzhou, China), the Sun Yat-sen University Cancer Center (Guangzhou, China), the First People’s Hospital of Foshan (Foshan, China), and the Third Affiliated Hospital of Sun Yat-sen University (Guangzhou, China). The trial protocol was approved by the Clinical Research Ethics Committee of the leading hospital (Sun Yat-sen Memorial Hospital) and then the committee of each hospital (approval No. 2012-25). The study was registered with ClinicalTrials.gov (identifier NCT01809041; principal investigator: Zhiyi Zuo) on March 12, 2013. Initially, eight hospitals agreed to participate and were on the register. However, four hospitals dropped out due to insufficient manpower. The remaining four hospitals have about 7,500 beds and perform 140,000 surgeries annually.
Written informed consent was obtained from each participant enrolled in the study, which occurred between March 23, 2013, and March 11, 2019. Inclusion criteria included that patient’s age was 60 yr or older and that the patient was scheduled to undergo elective major laparoscopic abdominal surgery (e.g., gastrointestinal and gynecologic surgery). Additional inclusion criteria were that surgery was expected to last 2 h or longer and that patients did not have serious hearing and vision impairment and were able to read. The exclusion criteria included: (1) severe diseases in cardiovascular, respiratory, liver, kidney, or central nervous systems and having a life span of less than 3 months; (2) a mini-mental status examination score of less than 23; (3) a history of dementia or psychiatric illness; (4) current use of sedatives, antidepressants or corticosteroids; (5) alcoholism and drug dependence; (6) previous inclusion in this study; (7) difficulty with follow-up or poor compliance; and (8) uncontrolled hypertension (more than 180/100 mmHg).
Each participating hospital had at least one investigative team composed of two people. One person evaluated the eligibility of the patients, enrolled patients, received group randomization codes from the leading center, and made sure that the management of the patients was in compliance with the study design. The second person was responsible for performing neuropsychological assessment without knowing the group assignment of the patients.
Randomization and Masking
A study statistician in the leading center generated random numbers without restriction (simple randomization) by using a computer. The randomization codes were sealed in sequentially numbered envelopes and sent to a research coordinator of the research team the day before surgery by a research nurse. The coordinator communicated the group assignment to the anesthesiologist caring for the patient and assigned participants to study groups according to the random codes.
The patients were randomized at 1:1 ratio to receive propofol-based (propofol group) or sevoflurane-based (sevoflurane group) general anesthesia. This trial used a parallel design to test the hypothesis that propofol-based anesthesia reduced the incidence of delayed neurocognitive recovery compared with sevoflurane-based anesthesia (superiority in nature). Preoperative interview, evaluation of eligibility, obtaining written informed consent, and enrolled the participants and postoperative follow-up were performed by investigators who did not participate in perioperative patient care and had been trained to perform neuropsychological assessment before the study (all were trained and certified by Yujuan Li). Both patients and investigators were blinded to study group assignment.
To adjust the practice effect from repeated neuropsychological tests used in this study, we enrolled 184 control subjects who were not exposed to surgery. They were volunteers from the community who were 60 yr old or older (similarly aged to patients with surgery) and were not patients evaluated in the hospitals or clinics. The inclusion and exclusion criteria for control subjects were identical to those of surgical patients. The evaluation dates of control subjects overlapped with those of surgical patients in the study. These control subjects were recruited exclusively for this study.
Anesthesia and Perioperative Care
No preoperative anxiolytic medication was administered. Intraoperative monitoring included the American Society of Anesthesiologists (ASA) mandatory monitoring, Bispectral Index (BIS), and end-tidal gas monitoring. Anesthesia was induced intravenously with fentanyl, lidocaine, propofol, and cisatracurium for both groups of patients. Anesthesia was maintained with inhaled sevoflurane (1.0 to 1.5 minimum alveolar concentrations) and intravenous remifentanil infusion (0.1 to 0.5 µg · kg–1 · min–1) for patients in the sevoflurane group. For the other group, anesthesia was maintained with intravenous propofol infusion (50 to 150 µg · kg–1 · min–1) and remifentanil infusion (0.1 to 0.5 µg · kg–1 · min–1). There were no limitations for the use of muscle relaxant and vasoactive medications. However, glucocorticoid drugs, nonsteroidal analgesics, and dexmedetomidine were avoided during surgery.
During the operation, blood pressure was maintained within the range of ± 30% of the baseline, nasopharyngeal temperature was between 36 to 37.5°C, end-tidal pressure of carbon dioxide was between 35 and 45 mmHg, and blood glucose was in a range of 3.9 to 11.1 mM. The BIS values were maintained between 40 and 60 by adjusting propofol or sevoflurane dosages. Postoperative analgesia during the first 48 h was provided with a patient-controlled analgesia pump, which infused fentanyl or sufentanil. Pain relief was by tramadol or meperidine hydrochloride when needed 2 days after surgery. Hormones, nonsteroidal anti-inflammatory drugs, and long-acting sedatives were avoided after surgery.
Data Collection
Preoperative evaluation was performed within 3 days of surgery. The functional effects of comorbid diseases were evaluated with the New York Heart Association functional classification. Fitness and functionality before surgery were assessed with the ASA physical status classification and instrumental activities of daily living scores.33 Cognitive function was evaluated with mini-mental status examination.34 The presence and severity of depression were assessed by the Beck depression inventory.35 State anxiety and trait anxiety were measured by the State–Trait Anxiety Inventory.36 Adverse events during and after surgery until the patient was discharged were recorded. A data safety–monitoring board was not formed for this study because propofol and sevoflurane have been commonly used in clinical practice.
Venous blood (5 ml) was drawn before anesthesia (T1), 15 min after endotracheal intubation (T2), 1 h after operation had started (T3), at the end of surgery (T4), and 24 h after surgery (T5) for measuring biomarkers possibly related to delayed neurocognitive recovery/postoperative cognitive dysfunction. Delirium was assessed in the postanesthesia care unit when patients were awake by using the intensive care delirium screening check inventory.37
Enzyme-linked Immunosorbent Assay
Blood was maintained for 30 min at room temperature. Serum was obtained by centrifugation of blood at 4,000g for 20 min at 4°C and stored at –80°C. The concentrations of tumor necrosis factor-α, interleukin-6, interleukin-1β, interleukin-10, vascular endothelial growth factor, β-melanocyte–stimulating hormone, interferon-γ, intercellular adhesion molecule, transforming growth factor-β1, apolipoprotein E, complement C3a, advanced glycation end products, and myeloperoxidase were measured using enzyme-linked immunosorbent assay kits. Malondialdehyde was measured using a lipid peroxidation malondialdehyde assay kit.
Neuropsychological Assessment
Neuropsychological tests for patients were carried out the day before and 5 to 7 days after surgery with only the patient and an investigator present in a quiet room. Similarly, the second neuropsychological test time for control subjects was 6 to 9 days after the first time. The test was also done with a control subject and an investigator in a quiet room. All tests were administered and scored in a standardized manner to minimize differences between investigators.
The neuropsychological test battery included the tests used to evaluate patients for delayed neurocognitive recovery in the International Study of Post-Operative Cognitive Dysfunction 13,4 : (1) word learning: visual verbal learning test based on the Rey’s auditory recall of words38 ; (2) word recall: the number or words recalled from visual verbal learning trials after a 20-min delay; (3) cognitive flexibility: including trail making test A and B39 ; (4) distractibility: Stroop color word interference test40 ; and (5) working memory: letter–digit coding.41 If a patient exhibited delirium at a testing time, neuropsychological evaluation was postponed 3 days.
Delayed neurocognitive recovery was diagnosed using the International Study of Post-Operative Cognitive Dysfunction 1 definition.3,4 In brief, the average practice effect (ΔXcontrol) and standard deviation (SD[ΔXcontrol]) of each neuropsychological test were determined by comparing the test scores of control subjects at baseline with those 1 week later. For each patient, a Z score for each test was calculated by determining the difference between postoperative and preoperative scores (ΔX), subtracting ΔXcontrol from ΔX, and then dividing it by SD(ΔXcontrol) as shown by this equation: Z = (ΔX − ΔXcontrol)/SD(ΔXcontrol). A combined Z score (Zcombined) was obtained by summation of the Z scores of all tests (∑Z) in an individual patient divided by the standard deviations for this sum of Z scores in control subjects (SD[∑Zcontrol]) as shown by the equation: Zcombined = (∑Z)/(SD[∑Zcontrol]). SD(∑Zcontrol) was calculated based on ∑Z of all participants in the control group. A patient was diagnosed to have delayed neurocognitive recovery if the Z scores on two individual tests or Zcombined was 1.96 or greater.
Statistical Analysis
The sample size estimate of this study was predetermined and posted on a publicly accessible server (ClinicalTrials.gov, identifier NCT01809041). The principal of data analysis and the statistical plan were decided before the initiation of the study. However, imputation strategies for missing data were decided after the data was accessed for post hoc sensitivity analyses.
Sample Size Calculation
The incidence of delayed neurocognitive recovery in comparable patient populations was about 40% at 1 week after noncardiac surgery.4 We planned to detect 33% reduction in delayed neurocognitive recovery incidence in the propofol group compared with sevoflurane group or vice versa. Per group, 221 patients would provide 80% power to detect this difference based on a two-tailed significance level at 0.05. The sample size in the control group would be 164, assuming 4% incidence of decreased cognitive functions over time, and the estimated error range was 3% (δ = 3%, zα/2 = 1.96). Considering a 10% loss-to-follow-up rate, the final sample size was determined to be 250 patients in each surgery group and 184 nonsurgery elderly people in the control group. Additional patients were planned if the dropout rate was more than 10% to maintain minimally 221 patients per group. When this sample size was achieved, the enrollment would cease.
Outcome Analysis
The primary outcome of this study was delayed neurocognitive recovery incidence at postoperative 5 to 7 days. This outcome was analyzed using chi-square with both intention-to-treat and per-protocol principles. Patients who were enrolled, did not violate the research protocol, and completed all needed neuropsychological tests were included in the analysis with per-protocol principle. All enrolled patients were included in the analysis with intention-to-treat principle. Three strategies for managing missing data were taken in the intention-to-treat analyses: (1) no imputation of missing data, (2) imputing the missing data with last observation carried forward strategy, and (3) hot-deck imputation strategy. The data of patients who were enrolled and completed all needed neuropsychological tests were analyzed with the first strategy. Missing values were imputed from preoperative values with last observation carried forward strategy. For hot-deck imputation strategy, a patient with missing data was matched with patient(s) who had complete set(s) of data and similar characteristics including age, ASA physical status, education level, mini-mental state examination score, Beck depression inventory score, and insurance status. The missing Z scores of the patients with missing data were imputed with the Z scores of the matched subject. If there were two or more matched subjects, the average Z scores of these matched subjects were used to impute the Z scores of the patient with missing data.
The secondary outcomes were the changes of serum concentrations of inflammation-related cytokines (interleukin-10, interleukin-1β, interleukin-6, and tumor necrosis factor-α), oxidative stress biomarkers (myeloperoxidase, malondialdehyde, advanced glycation end products, and apolipoprotein E), and immune modulators (vascular endothelial growth factor, intercellular adhesion molecule, β-melanocyte–stimulating hormone, transforming growth factor-β1, interferon-γ, and C3a)42,43 at various time points in patients with or without delayed neurocognitive recovery. These data were analyzed by two-way (between group comparisons with the time and group as two factors to be analyzed) repeated-measures ANOVA followed by Bonferroni multiple comparison correction.
Categorical data were reported as frequencies (percentage) and analyzed using the chi-square or Fisher exact test. Continuous variables are presented as means ± SD or median (25th, 75th percentiles) and were compared with independent t test or Mann–Whitney U test depending on the distribution of data. Univariate logistic regression analysis was used as an initial step to identify possible prognostic factors for delayed neurocognitive recovery, and these variables with a P value of <0.05 from this analysis were included in the multiple regression analysis with forced entry. Variance inflation factor was used to evaluate multicollinearity of enrolled variables, the Hosmer–Lemeshow test was used to examine the goodness of fit of model, and the discrimination of the models was estimated by using analysis of the area under the curve. No analyses were adjusted for additional variables or stratification variables. Two-tailed tests were performed whenever appropriate, and a P value of <0.05 was considered statistically significant. The SPSS 25.0 for Windows (SPSS Inc., USA) software was used for all statistical analyses.
Results
Patient Characteristics
In total, 580 patients were screened for this study from March 23, 2013, to March 11, 2019. Among them, 36 patients did not meet the inclusion criteria, and 544 subjects were enrolled. Among the enrolled patients, 12 patients did not receive allocated intervention (surgery was changed to open procedure in 8 patients and 4 patients withdrew their consents to participate), 52 patients were lost to follow-up, and 33 patients were excluded because of incompletion of the tests (7 patients) or receiving medications that should be avoided (26 patients). Therefore, 221 patients in the sevoflurane group and 226 patients in the propofol group completed the study per protocol (fig. 1). In addition, 184 subjects were enrolled in the control group. The trial was ended because the planned number of patients was achieved.
The baseline characteristics of the patients were listed in table 1 and were well balanced between the sevoflurane and propofol groups. The age was 64 (62 to 68) years in the propofol group and 65 (62 to 69) years in the sevoflurane group (P = 0.023 per protocol principle and P = 0.056 per intention-to-treat principle). The sex distribution was different between the two groups (P = 0.034) in the per-protocol data set but was not different between the two groups in the intention-to-treat data set (P = 0.139). The education level distributions were different between the two groups in both data sets (table 1). The perioperative variables were comparable between the two groups (table 2). One patient each in the propofol and sevoflurane groups had an intensive care delirium screening check score of 4 or higher (the threshold to indicate delirium) in the postanesthesia care unit. The incidence of adverse events was similar between two groups except that more patients in the sevoflurane group developed intraoperative hypotension requiring intravenous vasopressors (table 3).
Primary Outcome
No patient exhibited delirium at a preset testing time. Therefore, neuropsychological evaluation was not postponed for any patient. Delayed neurocognitive recovery rates were not different between sevoflurane and propofol groups when the analysis was performed with per-protocol principle (20.8% vs. 16.8%, P = 0.279; table 4). We also analyzed the data of patients who were enrolled and completed the study with intention-to-treat principle. This analysis included the data of patients with per-protocol principle plus the data of patients receiving additional medications (13 in the sevoflurane group and 13 in the propofol group). The delayed neurocognitive recovery rates were also not different between sevoflurane and propofol groups: 21.4% versus 17.2% (P = 0.245; table 4). There were 12 patients who did not receive allocated intervention (9 in the sevoflurane group and 3 in the propofol group) because the surgery was changed to an open procedure, 22 patients in the sevoflurane group and 24 patients in the propofol group who were lost to follow-up because of earlier discharge, 6 patients (3 in the sevoflurane group and 3 in the propofol group) who declined to have postoperative neuropsychological tests, and 7 patients (4 in the sevoflurane group and 3 in the propofol group) who had incomplete data. When missing values were imputed with last observation carried forward strategy, there was no difference in the delayed neurocognitive recovery rates between sevoflurane and propofol groups: 18.7% versus 15.1% (n = 272 for each group, P = 0.253; table 4). When missing values were imputed with hot-deck imputation strategy, delayed neurocognitive recovery rates between sevoflurane and propofol groups were also not different: 20.2% versus 17.3% (n = 272 for each group, P = 0.380; table 4). However, the delayed neurocognitive recovery incidences in propofol or sevoflurane groups were higher than that in control subjects (3.8% [4 of 184]; P < 0.001; table 4).
Secondary Outcome
We planned to draw blood samples from all enrolled patients. However, some patients declined to have blood drawn or missed some time points of blood samples. Some samples had hemolysis. Therefore, for each biomarker, there were 53 to 66 patients who had a full set of blood samples for analysis. The basic characteristics of patients with blood biochemical measurements were similar to those of patients without blood biochemical measurements except for mini-mental status examination scores, number of patients with coronary arterial diseases, and number of patients with smoking history (Supplemental Digital Content 1, https://links.lww.com/ALN/C534). Our analysis showed that only interleukin-6 concentrations were higher in patients with delayed neurocognitive recovery than patients without nondelayed neurocognitive recovery (P = 0.020) and had a significant difference among the values from T1 to T5 within the same group of patients (P < 0.001; fig. 2). The interaction between time and group was significant (P = 0.013). Further analysis showed that interleukin-6 concentrations at T3 between patients with or without delayed neurocognitive recovery were significantly different (odds ratio, 1.04 [1.01 to 1.08]; P = 0.007; table 5). Although some other blood indicators had time-dependent changes (such as vascular endothelial growth factor, intercellular adhesion molecule, transforming growth factor-β1, C3a, myeloperoxidase, apolipoprotein E, and advanced glycation end products), there were no differences in these indicators between patients with and without delayed neurocognitive recovery at various times (fig. 2).
Univariate Logistic Regression Analysis of Possible Predictive Factors for Delayed Neurocognitive Recovery

Comparison of serum cytokines, oxidative stress biomarkers, or immune modulators between patients with or without delayed neurocognitive recovery. The data are presented as the means ± SD. The P values are the results from two-way repeated-measures ANOVA with time and group as two factors to be analyzed. The serum concentrations of interleukin (IL)–6 were significantly higher in patients with delayed neurocognitive recovery than those in patients without nondelayed neurocognitive recovery (P = 0.020) and had a significant difference among the times from T1 to T5 within the same group of patients (P < 0.001). There was a significant interaction between time and group in interleukin-6 changes (P = 0.013). T1, before anesthesia; T2, 15 min after endotracheal intubation; T3, 1 h after surgical incision; T4, at the end of surgery; T5, 24 h after surgery. ApoE, apolipoprotein E; AGE, advanced glycation end product; C3a, complement C3a; ICAM, intercellular adhesion molecule; IFN-γ, interferon-γ; MPO, myeloperoxidase; bMSH, β-melanocyte–stimulating hormone; TGF-β1, transforming growth factor β1; TNFα, tumor necrosis factor α; VEGF vascular endothelial growth factor.
Comparison of serum cytokines, oxidative stress biomarkers, or immune modulators between patients with or without delayed neurocognitive recovery. The data are presented as the means ± SD. The P values are the results from two-way repeated-measures ANOVA with time and group as two factors to be analyzed. The serum concentrations of interleukin (IL)–6 were significantly higher in patients with delayed neurocognitive recovery than those in patients without nondelayed neurocognitive recovery (P = 0.020) and had a significant difference among the times from T1 to T5 within the same group of patients (P < 0.001). There was a significant interaction between time and group in interleukin-6 changes (P = 0.013). T1, before anesthesia; T2, 15 min after endotracheal intubation; T3, 1 h after surgical incision; T4, at the end of surgery; T5, 24 h after surgery. ApoE, apolipoprotein E; AGE, advanced glycation end product; C3a, complement C3a; ICAM, intercellular adhesion molecule; IFN-γ, interferon-γ; MPO, myeloperoxidase; bMSH, β-melanocyte–stimulating hormone; TGF-β1, transforming growth factor β1; TNFα, tumor necrosis factor α; VEGF vascular endothelial growth factor.
Prognostic Factors for Delayed Neurocognitive Recovery
The univariate logistic regression analysis showed that risk factors for delayed neurocognitive recovery were age (P = 0.044), mini-mental status examination scores (P = 0.001), ASA physical status classification (P = 0.018), insurance status (P = 0.042), duration of hospitalization (P = 0.005), and interleukin-6 concentrations at T3 (P = 0.007; table 5). Elder age, ASA physical status classification III, a long hospitalization time, and a high interleukin-6 concentration at T3 increased delayed neurocognitive recovery incidence. Better mini-mental status examination scores and employer’s medical insurance were associated with a lower delayed neurocognitive recovery rate.
The variables, such as age and ASA physical status classification, that were reported as risk factors for delayed neurocognitive recovery in previous studies4,44 also had a P value of <0.05 in the univariate logistic regression analysis of the present study. Therefore, we chose these variables with a P value of <0.05 in this analysis to be enrolled into multiple regression analysis with forced entry. In the regression model, age, mini-mental status examination scores, interleukin-6 concentration at T3, and duration of hospitalization were analyzed as continuous variables; insurance was analyzed as a categorical variable; and ASA physical status classification was analyzed as a ranked variable. Variance inflation factor was used to evaluate multicollinearity between these factors. We found that the variance inflation factor values were less than 2 for all predictors enrolled into the multiple regression analysis (ranging from 1.079 to 1.356), suggesting that there was minimal collinearity among predictors in the regression analysis. The Hosmer–Lemeshow test was used to examine the goodness of the fit of a model and showed a P at 0.815, suggesting that our model predictions match well with the observed data. The discrimination of the models was performed by using analysis of the area under the curve. The area under the curve of our model was 0.780 (95% CI, 0.654 to 0.906). The blood interleukin-6 concentrations at T3 were an independent risk factor for delayed neurocognitive recovery at 5 to 7 days after surgery (table 6). The other factors were not independent risk factors for delayed neurocognitive recovery by our analysis (table 6).
Discussion
Propofol and sevoflurane are the most frequently used inhalational and intravenous anesthetics. It has been a recent focus to determine whether anesthetic choice plays a role in delayed neurocognitive recovery/postoperative cognitive dysfunction, a syndrome whose importance is well recognized now.17,19 This determination is necessary because it will define whether anesthetic choice is a modifiable risk factor for delayed neurocognitive recovery/postoperative cognitive dysfunction. Our study showed that delayed neurocognitive recovery incidences in patients with sevoflurane- or propofol-based general anesthesia after a laparoscopic abdominal surgery were not different no matter whether the analysis was performed with per-protocol principle or intention-to-treat principle with various imputation strategies for missing data, suggesting that anesthetic choice is not a modifiable risk factor for delayed neurocognitive recovery. This finding is consistent with that in our animal study.28
Many clinical studies have determined the role of anesthetic choice in delayed neurocognitive recovery. A meta-analysis analyzed seven studies with a total of 869 patients published before November 2017 and has concluded that there is evidence with low certainty that propofol-based anesthesia may reduce delayed neurocognitive recovery compared with volatile anesthetic-based anesthesia (odds ratio, 0.52 [0.31 to 0.87]).29 However, the certainty of the evidence was classified to be low by the authors because some studies had insufficiently reported methods of randomization, and one study might have a high risk of attrition bias. In addition, the majority of these previous studies only used the decrease of the mini-mental status examination score to diagnose delayed neurocognitive recovery.29 Three randomized studies were published after this meta-analysis. A prespecified subanalysis of one center’s data of a clinical trial designed to determine anesthetic choice on the outcome of patients after cancer surgery has shown that patients anesthetized with propofol have a lower delayed neurocognitive recovery incidence than those anesthetized with sevoflurane,32 a finding that is different from our study. Our study is a multicenter randomized study designed specifically to determine whether there was a difference in delayed neurocognitive recovery incidence between patients whose general anesthesia was maintained mainly by propofol or sevoflurane. The type of surgery in our study is laparoscopic abdominal surgery, whereas the previous study included laparoscopic and open thoracic/abdominal surgery for patients with cancer. In addition, some medications, such as midazolam, dexmedetomidine, glucocorticoids, and nonsteroidal analgesics, that may affect cognitive functions were not permitted in our study but were used in the previous study. These major differences may have led to the different findings between our study and the previous study.32 Another study showed no difference in delayed neurocognitive recovery incidence in patients with spinal surgery under sevoflurane-based or propofol-based anesthesia.30 The third study showed that patients anesthetized by sevoflurane had a lower delayed neurocognitive recovery incidence than those anesthetized by propofol after carotid endarterectomy.45 It is important to note that most of the previous studies have small patient samples (15 to 100 patients per group), are not registered in a publicly accessible server, do not include a nonsurgery control group to adjust the practice effects of repeated testing, and use inappropriate neuropsychological tests to diagnose delayed neurocognitive recovery.29,30,45,46 Our study is a multicenter study designed to avoid these weaknesses to determine whether anesthetic choice plays a role in delayed neurocognitive recovery.
Associated with the issue of anesthetic choice is the question of whether the depth of general anesthesia plays a role in postoperative cognitive dysfunction. A meta-analysis that included four small studies published before August 2017 did not show a difference in delayed neurocognitive recovery/postoperative cognitive dysfunction rates between patients maintained at deep or light general anesthesia.47 This meta-analysis combined the information of cognitive impairment assessed within 1 month or at a time point longer than 1 month after surgery. Two studies have published after this meta-analysis. One showed that light anesthesia had advantages,48 and the other showed that deep anesthesia reduced delayed neurocognitive recovery.49 Thus, there is no evidence to indicate that the depth of general anesthesia plays a role in delayed neurocognitive recovery/postoperative cognitive dysfunction development.
Interestingly, patients with delayed neurocognitive recovery had a lower mini-mental status examination score than patients without delayed neurocognitive recovery, suggesting that patients with delayed neurocognitive recovery have lower cognitive functions. Also, patients with delayed neurocognitive recovery had worse ASA physical status classification. These results suggest that patients with worse physical status are more likely to have delayed neurocognitive recovery, similar to the finding reported previously.4 Patients with delayed neurocognitive recovery stayed longer in the hospital, again as shown previously.4,50 Univariate logistic regression analysis showed that age, ASA physical status classification, mini-mental status examination scores, and duration of hospitalization were predictors for delayed neurocognitive recovery, similar to that reported before.4 Also, we found that patients with employer’s health insurance had a lower delayed neurocognitive recovery incidence, indicating that less financial burden on the family for health care reduces delayed neurocognitive recovery. Finally, a high blood interleukin-6 concentration at 1 h after surgical incision is a risk factor for developing delayed neurocognitive recovery. In fact, interleukin-6 concentration in the blood at this time is the only independent risk factor for delayed neurocognitive recovery as determined by multiple regression model analysis. Surgery-induced interleukin-6 increase has been shown in multiple studies.51,52 However, its concentration as a risk factor for delayed neurocognitive recovery has not been reported.
Neuroinflammation is considered a critical neuropathological process for postoperative cognitive dysfunction.17 This consideration is mostly based on results from animal studies.17 We have shown that surgery on peripheral tissues causes systemic inflammation that then induces neuroinflammation in rodents.16,28 Consistent with this finding, serum interleukin-6 concentrations were increased after surgery in patients with or without delayed neurocognitive recovery, but the interleukin-6 concentrations were higher in patients with delayed neurocognitive recovery than that in patients without delayed neurocognitive recovery, providing initial evidence that heightened inflammation may be a pathologic process for delayed neurocognitive recovery in humans. There were no changes in the serum interleukin-1β, interleukin-10, and tumor necrosis factor-α within 24 h after the surgery. These results suggest that only selected cytokines, like interleukin-6 in this surgical population, are induced after surgery. Interestingly, the serum concentrations of vascular endothelial growth factor, intercellular adhesion molecule, transforming growth factor-β1, C3α, and advanced glycation end products were first decreased and then recovered after surgery. The first four factors can modulate immune functions. Advanced glycation end products are an oxidative stress marker.42,43 The reasons for this pattern of change are not known but may indicate a decreased immune function at the end of surgery, which recovers with time after surgery.
Perioperative neurocognitive disorder including delayed neurocognitive recovery (the subject of our current study), postoperative delirium, and longer-lasting postoperative cognitive dysfunction has been a focus of recent studies.17,19 Identifying modifiable risk factors, intervention, and biomarkers for perioperative neurocognitive disorder in humans and determining the mechanisms for this syndrome in animal and human studies will help reduce its occurrence and ultimately improve the outcome of perioperative patients.
One obvious limitation of our study is the unintended low power. A landmark study has shown that 41% of patients who were 60 yr of age or older developed delayed neurocognitive recovery at 7 days after various noncardiac surgeries including minimally invasive surgery.4 The sample size was calculated assuming a 40% delayed neurocognitive recovery rate and an 80% power to detect a one-third decrease in delayed neurocognitive recovery incidence in one group of patients compared with the other group. Because our study had 20.8% patients with delayed neurocognitive recovery in one group, the power to detect the same magnitude of decrease is decreased to 0.499 with the number of our patients. To achieve 80% power with the incidence in one group, we will need 430 patients in each study group to detect a one-third decrease in delayed neurocognitive recovery incidence in another group. Nevertheless, our study did not show the magnitude of one-third decrease in the propofol-based general anesthesia group, and the delayed neurocognitive recovery incidences were not different between patients anesthetized with propofol or sevoflurane. Thus, our study does not support that the general anesthetic choice may play an important role in delayed neurocognitive recovery development.
Although the incidence of delayed neurocognitive recovery in our study is lower than that in the study using the same method and criteria to diagnose delayed neurocognitive recovery,4 our incidence is similar to those in other well designed studies. The incidence of delayed neurocognitive recovery for noncardiac surgery patients (60 yr or older) in International Study of Post-Operative Cognitive Dysfunction 1 is 25.8% 7 days after the surgery.3 The other two very recent studies showed 14.8 to 23.2% of patients to have delayed neurocognitive recovery 7 days after surgery.32,53
In summary, our study showed that patients with laparoscopic abdominal surgery under propofol-based anesthesia had a delayed neurocognitive recovery incidence similar to that of patients under sevoflurane-based anesthesia. Anesthetic choice may not be a modifiable factor for delayed neurocognitive recovery. Our study has identified that a high concentration in serum interleukin-6 is an independent risk factor for delayed neurocognitive recovery. This finding provides clinical evidence for the role of inflammation in delayed neurocognitive recovery. Finally, our study has shown that delayed neurocognitive recovery is associated with an increased hospitalization length in Chinese patients, similar to patients in the United States and Australia.4,50
Research Support
Supported by grant No. 17ykjc26 from the fundamental research fund for the central universities, an internal fund from each participating institute, and the Baxter International Chinese Branch (Shanghai, China). The fund of the Baxter International Chinese Branch was used to print the data collection form and purchase blood sample collection tubes and a freezer to store blood samples.
Competing Interests
This study was partly sponsored by Baxter International Chinese Branch (Shanghai, China). The company had no role in the study design, data collection, data analysis, data interpretation, or writing of the article. The authors declare no competing interests.
Reproducible Science
Full protocol available at: zz3c@virginia.edu or liyuj@mail.sysu.edu.cn. Raw data available at: zz3c@virginia.edu or liyuj@mail.sysu.edu.cn.