Systematic Review and Meta Analysis of Validated P

European Journal of Surgical Oncology 48 (2022) 492e499
Contents lists available at ScienceDirect
European Journal of Surgical Oncology

journal homepage: www.ejso.com
Systematic review and meta-analysis of validated prognostic models

for resected hepatocellular carcinoma patients
Berend R. Beumer a, Stefan Buettner a, Boris Galjart a, Jeroen L.A. van Vugt a,
Robert A. de Man b, Jan N.M. IJzermans a, Bas Groot Koerkamp a, *
a
Erasmus MC Transplant Institute, Department of Surgery Division of HPB & Transplant Surgery, Erasmus MC, University Medical Centre Rotterdam,
Rotterdam, the Netherlands
b
Erasmus MC Transplant Institute, Department of Gastroenterology and Hepatology, Erasmus MC, University Medical Centre Rotterdam, Rotterdam, the
Netherlands
a r t i c l e i n f o a b s t r a c t
Article history: Background: Many prognostic models for Hepatocellular Carcinoma (HCC) have been developed to
Received 21 June 2021 inform patients and doctors about individual prognosis. Previous reviews of these models were quali-
Received in revised form tative and did not assess performance at external validation. We assessed the performance of prognostic
13 September 2021
models for HCC and set a benchmark for biomarker studies.
Accepted 15 September 2021
Methods: All externally validated models predicting survival for patients with resected HCC were sys-
Available online 21 September 2021
tematically reviewed. After selection, we extracted descriptive statistics and aggregated c-indices using
meta-analysis.
Keywords:
External validation
Results: Thirty-eight validated prognostic models were included. Models used on average 7 (IQR:4e9)
Prognostic model prognostic factors. Tumor size, tumor number, and vascular invasion were almost always included.
Liver resection Alpha-fetoprotein (AFP) was commonly incorporated since 2007. Recently, the more subjective items
Hepatocellular carcinoma ascites and encephalopathy have been dropped. Eight established models performed poor to moderate at
Survival external validation, with a pooled C-index below 0.7; including the Barcelona Clinic Liver Cancer (BCLC)
Recurrence system, the American Joint Committee on Cancer (AJCC) 7th edition, the Cancer of the Liver Italian (CLIP)
C-index Program, and the Japan Integrated Staging (JIS) score. Out of 24 prognostic models predicting OS, only 6
(25%) had good performance at external validation with pooled C-indices above 0.7; the Li-post (0.77), Li-
OS (0.74), Yang-pre (0.74), Yang-post (0.76), Shanghai-score (0.70), and Wang-nomogram (0.71). Models
improved over time, but overall performance and study quality remained low.
Conclusions: Six validated prognostic models demonstrated good performance for predicting survival
after resection of HCC. These models can guide patients and doctors and are a benchmark for future
models incorporating novel biomarkers.
© 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license
(http://creativecommons.org/licenses/by/4.0/).
1. Introduction models may describe mere artefacts caused by local practices in

patient selection, care, and data collection. Therefore, external
Prognosis is important in the care for patients with hepatocel- validation, preferably by an independent research group, is
lular carcinoma (HCC). To aid in the difficult task of predicting a required prior to implementation into clinical practice.
patient's prognosis, previous experiences can be used to develop a Numerous prognostic models for HCC have been developed. The
prognostic model. most established ones are the Barcelona Clinic Liver Cancer (BCLC)
Immediately after prognostic models are created, concerns are system, the American Joint Committee on Cancer (AJCC) 7th edi-
raised about their generalizability. Rather than a general pattern, tion, the Cancer of the Liver Italian Program (CLIP), and the Japan
Integrated Staging (JIS) score. Nonetheless, new proposals for
staging systems are being published on a monthly basis [1e3].
Several literature overviews of prognostic models for HCC have
* Corresponding author. 40 3015 GD Rotterdam Department of surgery Room, been published [4e8]. These reviews broadly discuss the historical
RG-219, the Netherlands.
context and classification schemes of the most common staging
E-mail address: b.grootkoerkamp@erasmusmc.nl (B.G. Koerkamp).
https://doi.org/10.1016/j.ejso.2021.09.012
0748-7983/© 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
B.R. Beumer, S. Buettner, B. Galjart et al. European Journal of Surgical Oncology 48 (2022) 492e499
systems. These reviews, however, lack a systematic approach to the observations of two random patients are concordant. A c-index of
inclusion of models. Moreover, conclusions are biased, because they 0.5 indicates no predictive discrimination; a value of 1.0 indicates a
are primarily informed by the performance in development studies, perfect risk ordering of all patients. A c-index of below 0.60 was
rather than external validations studies. considered poor; a value between 0.60-0.69 was considered mod-
Therefore, the primary aim of this study is to assess and erate; and a value above 0.70 was considered good [11].
compare model performance of all externally validated prognostic A meta-analysis was performed using meta package for R 3.6.1 if
models for survival outcomes of patients after resection of HCC. there were multiple external validations available that reported the
Secondly, we aim to determine trends in the selection of prognostic confidence interval or a standard error alongside their c-index
factors and model performance. [12,13]. Pooling was performed on the probability scale using in-
verse variance weighting [14]. We applied a random effects model
2. Method with the DerSimonian-Laird method to estimate between study
variance. For studies with overlapping populations, only the largest
This study was embedded in the ‘Validated surgical models of cohort was considered for pooling. Heterogeneity was investigated
hepatobiliary malignancies’ (VALIDATE) initiative, in which we by means of a forest plot and I2 statistic [15]. To summarize these
appraised all externally validated prognostic models developed for results and inspect the across model performance, the pooled c-
hepato-pancreato-biliary malignancies. In the present study, indices and year of publication are displayed in a weighted scatter
prognostic models used to stage patients with HCC were investi- plot.
gated. The scope of this research was further narrowed to external
validations in patients who had undergone first time liver resec- 2.3. Development studies
tion. The study was not prospectively registered. We performed a
systematic review and meta-analysis in accordance with the The development study is the publication in which the model is
PRISMA and CHARMS guidelines (Supplemental Digital Content 1) first presented. The reference lists of included validation studies
[9,10]. were screened to identify the development studies of the prog-
nostic models. From these studies, we extracted the following data:
2.1. Selection criteria year of publication, country, medical center, inclusion period,
number of patients, patient selection criteria, estimation technique,
An experienced librarian performed a search of Embase, Med- end point definition, number of events, median follow-up
line, Web of Science, the Cochrane database and Google Scholar (months), number of coefficients (incl. candidate prognostic fac-
using the search terms provided in Supplemental Digital Content 2. tors, transformations, indicator variables of categorical prognostic
The last search was conducted on the 18th of July 2019. factors, and interaction terms), type of information incorporated in
Eligible studies were those in which an external validation was the final model, number of patients in internal validation cohort,
performed of a prognostic model in resected HCC patients with and model performance at internal validation. Effective sample size
either disease-free survival (DFS) or overall survival (OS) as was calculated as the number of events divided by the total number
endpoint. Studies written in English and published after 1990 were of coefficients. For interpretation the one in ten rule of thumb was
considered. Reviews, expert opinions, abstracts, and posters were used [16,17]. If the number of events was poorly reported, the
excluded. Studies with prognostic models containing prognostic fraction of the cohort size and total number of parameter estimates
factors not readily available in clinical practice (e.g., RNA/DNA was used as a proxy.
sequencing data or liquid biopsies) were also excluded. Further-
more, studies on recurrence of malignancies were excluded. Studies 2.4. Methodological and reporting quality
that in their analysis pooled data of resected patients with data of
patients treated without resection were excluded. Lastly, studies The quality of included studies was assessed independently by
were excluded that did not report model performance in terms of a two authors (BB and SB). As widely adopted consensus guidelines
Harrell's concordance index (c-index) or area under the curve are lacking, we used the items of the CHARMS checklist and TRIPOD
(AUC) of the receiver operating characteristic (ROC) with specifi- statement to score the development studies [9,18]. Items were
cation of the censor time. grouped in categories (e.g. patient selection, handling of missing
Three reviewers (SB, BGa, and BB) independently screened the data, and candidate predictors). The aggregate score per category
abstracts of all studies identified by the search. Eligibility was was low (1) if all items were listed in the paper, medium (2) if at
determined by reviewing the full manuscript of potentially relevant least one item was not reported, and high (3) if more than one item
studies. Disagreement between the reviewers was resolved by was missing. An aggregate score per study was constructed by
discussion. Descriptive, methodological, and outcome data for each summation of the category scores. Study was considered at low risk
of the included studies was extracted by BB and SB. of bias if the sum score was lower than 12, medium between 13-14
and high for scores above 15. We only scored the studies in which
2.2. Validation studies the model was derived using statistical methods (rather than
expert opinion), as most items concern collection, preparation, and
Descriptive statistics included year of publication, country, analysis of the data. The consensus-derived models were consid-
medical center, inclusion period, selection criteria, sample size, and ered to be at high risk of bias by default.
type of outcome. If a study externally validated more than one
prognostic model, or used multiple independent cohorts, data was 3. Results
extracted for each instance.
Discriminatory performance was assessed using the c-index or The search identified 10,282 potentially relevant citations for all
the AUC-ROC at 1, 3, and 5 years after surgery. The c-index served as HPB malignancies. After screening of the titles and abstracts, we
the principal measure of external validity, because it is a commonly selected 451 studies for full-text assessment. Ultimately, we
reported discrimination measure, remains unbiased in the pres- included 36 external validation studies in this review (Fig. 1). These
ence of censoring, and does not require an arbitrary cut-off point. 36 studies performed 150 validations for DFS and 190 validations
The c-index reflects the probability that predictions and for OS of 38 different prognostic models.
493
value of each factor; 20 of the 38 models (53%) used Cox propor-

tional hazards, 3 logistic regression, and only one (HKCL model)
used a decision-tree algorithm. Sixteen (42%) prognostic models
created risk groups using arbitrary cut-off points and displayed
Kaplan-Meier curves for each risk group.
Models marked with an * have a pooled C-index > 0.7. For
consensus models only the pooled c-index for OS validations is
shown. For models with NA pooling was not performed as no
confidence intervals were reported in the validation studies. Ab-
breviations: C-reactive protein (CRP), Eastern cooperative oncology
group (ECOG), Alkaline Phospatase (Aphos), Gamma-glutamyl
transferase (gGT), Aspartate aminotransferase (AST), Sodium (Na),
Lymphocytes (Lympho), Neutrocytes (Neutro), Hepatitis B (HBV).
Most models (45%) divided the study population in a separate
training and test set. In these studies, on average 70% (IQR 67e75)
of the data was assigned to the training cohort. Two studies divided
the population multiple times and averaged model performance
(i.e. cross-validation) [34,39] and one model used bootstrap
resampling [37]. Three studies used only the development cohort to
describe model performance [26,40,47].
3.2. Validation studies
A total of 32,825 patients from 36 validation studies provided

data for 340 validations [39,40,42,43,45e76]. Apart from two
studies [49,57] all validation cohorts were Asian, of which the
majority originated from China. A full listing of all validation
studies, their design, and results per model are presented for DFS in
Supplemental Digital Content 4 and for OS in Supplemental Digital
Content 5.
The median sample size in which prediction models were vali-
Fig. 1. Flowchart studies' screening and selection. dated was 349 (IQR: 212e768). The BCLC system and AJCC 7th
edition were validated most frequently, with 30 and 29 validations
for OS, and 24 and 22 validations for DFS. For DFS 7 out of 27 (26%)
3.1. Development studies models were externally validated only once. For OS this was 6 out of
24 model (25%).
Table 1 displays the key characteristics of the 38 prognostic Eligibility criteria for patients included in the validation studies
models. Additional information about the development studies is were typically: good performance status, no previous treatment,
presented in Supplemental Digital Content 3. The prognostic resection, and pathologically proven HCC. Additional criteria in one
models were developed between 1984 and 2019. They were pri- or more studies required patients with an R0 resection [45,46,51],
marily based on Asian cohorts with the exception of the CLIP, BCLC, the presence or absence of HBV [43,46,50], PVTT [51], large HCC
(modified) Glasgow prognostic score (GPS), and the Memorial [42], and multiple-lesion HCC [44].
Sloan Kettering Cancer Center (MSKCC) nomogram. The sample size
for model development was on average 451 patients (interquartile 3.3. Meta-analysis of model performance (c-indices)
range (IQR): 310e850). Inclusions spanned on average 9 years (IQR:
6e12). The median number of patients per estimate was 13 Meta-analysis of c-indices was performed for 33 models. Data
(IQR:7e23), and the median number of events per coefficient was from 17 of the 36 validation studies could be used. Fig. 2 shows the
12 (IQR:7e24). forest plots of 8 established models for OS validations. A complete
Models developed before 2002 typically included the total HCC overview of all forest plots for DFS and OS of all prognostic models
population, regardless of stage or treatment. In comparison, models is available in Supplemental Digital Content 6 and 7 and in table
developed after 2015 increasingly target subgroups such as patients format in Supplemental Digital Content 8 and 9.
undergoing resection of multiple tumors or patients with portal The pooled c-indices at validation of the 22 prognostic models
vein tumor thrombosis (PVTT). Before 2015, the MSKCC nomogram for OS ranged from 0.52 to 0.77. Several established models per-
was the only model for DFS, rather than OS. After 2015, DFS was the formed poor to moderate at external validation, with a pooled C-
endpoint in the majority (10/18) of models (Supplemental Digital index below 0.7; BCLC (0.61), AJCC 7th (0.62), CLIP (0.63), and JIS
Content 3). (0.61). Six models had good performance with a pooled c-index
The median number of prognostic factors in the risk score was 7 above 0.7: the Li-post (0.77), Li-OS (0.74), Yang-pre (0.74), Yang-
(IQR: 4e9). Traditional tumor characteristics such as tumor size, post (0.76), Shanghai-score (0.70), and Wang-nomogram (0.71)
tumor number, and vascular invasion were almost always included. [42e46]. These models were, however, only validated on a limited
Alpha-fetoprotein (AFP) was commonly incorporated since 2007; number of external validation cohorts (<3) and is there no confir-
15 out of 25 models (60%). Recently, the more subjective prognostic mation from an independent research group.
factors ascites and encephalopathy have been dropped. Prognostic The number of prognostic factors for the 6 best models ranged
factors were selected based on the literature and expert opinion from 7 (Li models) to 15 (Shanghai model). All 6 models require
(i.e. consensus based) in thirteen models (34%). All other models tumor size and number of tumors, 5 require vascular invasion and
used statistical methods to determine the independent prognostic AFP, and 3 require encapsulation, HBV, bilirubin, and creatinine
494
Table 1
All validated prognostic models [19e51].
(Table 1). Model discriminatory performance at internal validation was

Fig. 3 displays the (pooled) C-indices from external OS valida- assessed in 11 of the 30 studies (36%) using the c-index and
tions versus year of publication of the development study. The accompanied by a confidence interval in 8 studies (72%). Only 9
figure for the DFS setting is displayed in Supplemental Digital development studies (30%) inspected the calibration of the model.
content 10. For both outcome measures model performance
increased over time.
4. Discussion
3.4. Risk of bias In this systematic review and meta-analysis, we compared all 38
validated prognostic models for survival after resection of HCC.
The risk of bias for the development studies is summarized in Established models, such as the BCLC, CLIP, and AJCC performed
Supplemental Digital Content 11, and a detailed overview is pre- moderate to poor at external validation. Six models had good per-
sented in Supplemental Digital Content 12. Out of the 38 models 13 formance for OS at external validation; the Li-post, Li-OS, Yang-pre,
models were expert based leaving 25 statistically derived models to Yang-post, Shanghai score, and Wang nomogram [42e46]. More-
be investigated for risk of bias. The risk of bias was scored low for 7 over, we found that prognostic models improved over time with
models (28%), intermediate for 10 models (40%), and high for 8 better performance and fewer ambiguous prognostic factors such
models (32%). None of the studies reported the data source (e.g., as ascites and encephalopathy.
cohort study, randomized control trial or registry) and whether the Several other reviews of prognostic models for HCC have been
data was collected retrospectively or prospectively. The inclusion published [4e8]. These reviews, however, were qualitative in na-
criteria, baseline characteristics, and inclusion period were always ture, did not require external validation, and did not perform meta-
stated. However, the number of patients with missing values was analysis for model performance (c-indices). They conclude that
only reported for 4 of the 25 models (16%). there is no ideal staging system accounting for the geographical and
In 10 of the 25 models, the definitions of OS and DFS did not etiological differences [4,5]. Most reviews conclude that the BCLC is
specify how and when they were assessed. Only 8 of the studies the most comprehensive and commonly used staging system. The
(27%) reported the time of measurement of the prognostic factors main advantage of the BCLC system is that each stage has specific
(e.g., whether tumor size was measured preoperatively on imaging treatment recommendations [24]. The prognostic performance of
or postoperatively at pathological examination). None of the the BCLC system, however, is limited because some known prog-
studies investigated non-linearity of continuous prognostic factors. nostic factors (e.g., AFP) are unaccounted for. We found that the
495
Fig. 2. Pooled c-indices of validations of 8 established models. The forest plots show the individual validations in relation to each other for eight established prognostic models. A
study can appear multiple times if in the same study multiple independent cohorts were used.
BCLC staging system had poor to moderate prognostic performance because they are concise and perform well. The Li models cannot be
in 17 external validation studies with a pooled c-index of 0.61. used in the preoperative setting, because they require factors that
The six models that achieved a good prognostic performance all are only available after resection (e.g., microvascular invasion).
required tumor size and tumor number as prognostic factor. Most The two Yang models also had a good performance with a c-
models also included vascular invasion and AFP. They mainly index of about 0.75. The main difference with the Li models was
differed regarding tumor differentiation, capsulation, HBV status, that the Yang models required additional laboratory values. The
and laboratory values (i.e. bilirubin, INR, creatinine, and sodium). preoperative Yang model was based on only preoperatively avail-
The two Li models have a high c-index of about 0.75 and require able factors; the postoperative Yang model added 4 postoperative
only 7 prognostic factors. The Li-post model was developed for factors. Performance of the preoperative and postoperative Yang
patients with large HCC; the Li-OS model for patients with HBV- models were similar at external validation. The preoperative model
related early-stage HCC. The Li models are attractive models to of Yang is the best validated model to predict OS after resection of
predict OS after resection of HCC in the postoperative setting, HCC in the preoperative setting.
496
Fig. 3. Pooled c-indices for OS versus year of publication of the development study. Each point represents a model evaluated for OS with its size corresponding to the total
number of patients on which the performance estimate is based. The horizontal dashed lines represent the performance thresholds.
Prognostic models such as the preoperative model of Yang can Also, we recognize that the estimated performance of more
be used to inform patients and doctors in the preoperative setting recent models might decrease when more validations become
about life expectancy after resection of HCC. The potential benefit of available. Therefore, electing a single best model is complicated as
surgery should be balanced against surgical mortality that can be it requires a tradeoff between performance and the uncertainty in
high in patients with advanced age, cirrhosis, and requiring major the performance estimate. Nonetheless, our pooled analyses pro-
hepatectomy. Models that require postoperative factors such as the vide the best possible summary and a clear overview of the current
models of Li can be used to inform patients and doctors in the landscape. Second, we only included models with published
postoperative setting. These prognostic models can also be used to external validations. We recognize that due to this restriction
stratify patients in RCT's. Moreover, individual prediction of prog- several recent models (that have not yet been validated) were not
nosis could guide personalization of follow-up schemes. The included [74,84,85]. Also, models containing prognostic factors not
theoretical ground work that links prognostic models to the fre- readily available in clinical practice are missing in our analysis.
quency of scans has been published in 1983 [77]. Furthermore, we would like to emphasize that prognostic models
The risk of bias was high in most prognostic models. Moreover, are a means to an end rather than an end in itself. Most prognostic
the quality of most models was poor. Quality could be improved if models do not make the final step towards clinical decision making.
non-linear prognostic factors and missing values are better The BCLC system is the main exception. The link to treatment might
addressed. Moreover, explicitly creating separate pre- and post- be the reason why the European association for the study of the
operative models would increase applicability. We strongly liver (EASL) and the European society for medical oncology (ESMO)
recommend that adherence to reporting guidelines (TRIPOD still recommend the BCLC despite its moderate performance for
statement and CHARMS checklist) becomes a prerequisite for prognosis [86,87]. Further research should investigate how the best
publication of prognostic models [9,18]. Moreover, researchers prognostic models can guide treatment decisions [88,89].
should focus on independent external validation studies rather The American Association for the Study of Liver Diseases
than only developing more models. The field is too focused on (AASLD) guidelines also stated that with the typical set of clinical
model creation, whereas model validation, and implementation prognostic factors only marginal improvements can be made in
receive little attention. This trend is in line with systematic reviews prognostic models [90]. Serum biomarkers and gene signatures are,
from other specialties and needs to be addressed in order to in- however, a dynamic and promising field that aims to improve
crease applicability of prognostic models [78,79]. prognostic and predictive models. The best prognostic models of
Several limitations of our study are important. First, the out- this meta-analysis serve as benchmarks for novel biomarkers. Such
comes of pooled analyses cannot be regarded as the true model biomarkers are typically costly to measure and are only useful if it is
performance, because of considerable heterogeneity across cohorts. demonstrated that they can improve performance of the best
Furthermore, although good model performance at external vali- models that only require readily available patient and tumor
dation is necessary, it offers no guarantee that a model will characteristics.
generalize to other populations. Although, East-West differences in In conclusion, we identified 6 prognostic models for OS after
performance of resection models are in our view likely, due to the resection of HCC that achieved good performance at external vali-
focus on transplantation in western validation studies, they could dation [42e46]. These models are a benchmark for future models
not be examined. We hypothesize that the focus of validation incorporating novel biomarkers.
studies is a reflection of the preferred HCC treatment in the two
regions. Eastern cohorts are more often offered resection, in CRediT authorship contribution statement
contrast, in western cohorts increasingly more HCC patients receive
liver transplantation [80,81]. Nevertheless, the proportion of pa- Berend R. Beumer: screened the articles, performed the data
tients receiving liver resection in the West is still about 50% of those extraction, Formal analysis, wrote the paper which was edited by all
surgically treated [82,83]. These patients would benefit from co-authors. Stefan Buettner: screened the articles, performed the
dedicated external validation studies. data extraction. Boris Galjart: screened the articles. Jeroen L.A. van
497
Vugt: Conceptualization. Robert A. de Man: Methodology. Jan N.M. staging classification. Chin J Hepatobiliary Surg 1999;5:67e8.
[24] Llovet JM, Brú C, Bruix J. Prognosis of hepatocellular carcinoma: the BCLC
IJzermans: Methodology. Bas Groot Koerkamp: Methodology, All
staging classification. In: Seminars in liver disease. © 1999 by Thieme Medical
authors have approved the final version of the manuscript Publishers, Inc; 1999.
including the authorship list. [25] Greene FL, et al. AJCC cancer staging handbook: TNM classification of malig-
nant tumors. Springer Science & Business Media; 2002.
[26] Kudo M, Chung H, Osaki Y. Prognostic staging system for hepatocellular car-
Declaration of competing interest cinoma (CLIP score): its value and limitations, and a proposal for a new
staging system, the Japan Integrated Staging Score (JIS score). J Gastroenterol
2003;38(3):207e15.
None.
[27] Nanashima A, et al. The Japanese integrated staging score using liver damage
grade for hepatocellular carcinoma in patients after hepatectomy. Eur J Surg
Acknowledgements Oncol 2004;30(7):765e70.
[28] Minagawa M, et al. Staging of hepatocellular carcinoma: assessment of the
Japanese TNM and AJCC/UICC TNM systems in a cohort of 13,772 patients in
Wichor M. Bramer's help with constructing the search term is Japan. Annals of surgery 2007;245(6):909.
gratefully acknowledged. [29] McMillan DC, et al. Evaluation of an inflammation-based prognostic score
(GPS) in patients undergoing resection for colon and rectal cancer. Int J
Colorectal Dis 2007;22(8):881.
Appendix A. Supplementary data [30] Edge SB, Compton CC. The American Joint Committee on Cancer: the 7th
edition of the AJCC cancer staging manual and the future of TNM. Ann Surg
Oncol 2010;17(6):1471e4.
Supplementary data to this article can be found online at
[31] Kasymjanova G, et al. The predictive value of pre-treatment inflammatory
https://doi.org/10.1016/j.ejso.2021.09.012. markers in advanced non-small-cell lung cancer. Curr Oncol 2010;17(4):52.
[32] Kamarajah SK, et al. Critical evaluation of the American joint commission on
cancer (AJCC) staging system for patients with hepatocellular carcinoma
Financial support statement (HCC): a surveillance, epidemiology, end results (SEER) analysis. J Surg Oncol
2018;117(4):644e50.
None. [33] Tokumitsu Y, et al. A new prognostic model for hepatocellular carcinoma
recurrence after curative hepatectomy. Oncol Lett 2018;15(4):4411e22.
[34] Investigators C.o.t.L.I.P.. A new prognostic system for hepatocellular carci-
References noma: a retrospective study of 435 patients. Hepatology 1998;28(3):751e5.
[35] Leung TW, et al. Construction of the Chinese University Prognostic Index for
[1] Lin C-Y, et al. The ALBI grade is a good predictive model for very late recur- hepatocellular carcinoma and comparison with the TNM staging system, the
rence in patients with hepatocellular carcinoma undergoing primary resec- Okuda staging system, and the Cancer of the Liver Italian Program staging
tion. World J Surg 2019:1e11. system: a study based on 926 patients. Cancer 2002;94(6):1760e9.
[2] Tsilimigras DI, et al. Utilizing machine learning for pre-and postoperative [36] Forrest L, et al. Evaluation of cumulative prognostic scores based on the
assessment of patients undergoing resection for BCLC-0, A and B hepatocel- systemic inflammatory response in patients with inoperable non-small-cell
lular carcinoma: implications for resection beyond the BCLC guidelines. Ann lung cancer. British journal of cancer 2003;89(6):1028.
Surg Oncol 2019:1e9. [37] Tateishi R, et al. Proposal of a new prognostic model for hepatocellular car-
[3] Xue Y, et al. Establishment and validation of a prognostic model for hepato- cinoma: an analysis of 403 patients. Gut 2005;54(3):419e25.
cellular carcinoma after radical liver resection. Journal of BU ON.: official [38] Zheng S-S, et al. Liver transplantation for hepatocellular carcinoma: hangzhou
journal of the Balkan Union of Oncology 2019;24(4):1420e8. experiences. Transplantation 2008;85(12):1726e32.
[4] Tellapuri S, et al. Staging systems of hepatocellular carcinoma: a review. In- [39] Cho CS, et al. A novel prognostic nomogram is more accurate than conven-
dian J Gastroenterol 2018;37(6):481e91. tional staging systems for predicting survival after resection of hepatocellular
[5] Karademir S. Staging of hepatocellular carcinoma. 2018. carcinoma. J Am Coll Surg 2008;206(2):281e91.
[6] Adhoute X, et al. Usefulness of staging systems and prognostic scores for [40] Yang T, et al. A new staging system for resectable hepatocellular carcinoma:
hepatocellular carcinoma treatments. World J Hepatol 2016;8(17):703. comparison with six existing staging systems in a large Chinese cohort.
[7] Maida M, et al. Staging systems of hepatocellular carcinoma: a review of J Cancer Res Clin Oncol 2011;137(5):739e50.
literature. World J Gastroenterol: WJG 2014;20(15):4141. [41] Yau T, et al. Development of Hong Kong Liver Cancer staging system with
[8] Marrero JA, Kudo M, Bronowicki J-P. The challenge of prognosis and staging treatment stratification for patients with hepatocellular carcinoma. Gastro-
for hepatocellular carcinoma. Oncol 2010;15(Supplement 4):23e33. enterology 2014;146(7):1691e1700. e3.
[9] Moons KG, et al. Critical appraisal and data extraction for systematic reviews [42] Li Y, et al. Prognostic nomograms for pre-and postoperative predictions of
of prediction modelling studies: the CHARMS checklist. PLoS Med long-term survival for patients who underwent liver resection for huge he-
2014;11(10):e1001744. patocellular carcinoma. J Am Coll Surg 2015;221(5):962e74. e4.
[10] Moher D, et al. Preferred reporting items for systematic reviews and meta- [43] Li J, et al. Nomograms for survival prediction in patients undergoing liver
analyses: the PRISMA statement. Ann Intern Med 2009;151(4):264e9. resection for hepatitis B virus related early stage hepatocellular carcinoma.
[11] Borges LSR. Diagnostic accuracy measures in cardiovascular research. Inter- European Journal of Cancer 2016;62:86e95.
national Journal of Cardiovascular Sciences 2016;29(3):218e22. [44] Yang P, et al. Nomograms for pre-and postoperative prediction of long-term
[12] Schwarzer G. meta: an R package for meta-analysis. R News 2007;7(3):40e5. survival for patients who underwent hepatectomy for multiple hepatocellu-
[13] R Development Core Team R. A language and Environment for statistical lar carcinomas. Annals of surgery 2016;263(4):778e86.
Computing. Vienna, Austria: R Foundation for Statistical computing; 2019. [45] Sun HC, et al. Shanghai score: a prognostic and adjuvant treatment-evaluating
[14] van Klaveren D, et al. Assessing discriminative ability of risk models in clus- system constructed for Chinese patients with hepatocellular carcinoma after
tered data. BMC Med Res Methodol 2014;14(1):5. curative resection. Chin Med J 2017;130(22):2650e60.
[15] Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat [46] Wang Y, et al. Novel prognostic nomograms based on Inflammation-related
Med 2002;21(11):1539e58. markers for patients with hepatocellular carcinoma underwent Hepatec-
[16] Harrell Jr FE, Lee KL, Mark DB. Multivariable prognostic models: issues in tomy. 2019.
developing models, evaluating assumptions and adequacy, and measuring [47] Ang SF, et al. The Singapore Liver Cancer Recurrence (SLICER) Score for relapse
and reducing errors. Stat Med 1996;15(4):361e87. prediction in patients with surgically resected hepatocellular carcinoma. PloS
[17] Harrell Jr FE, et al. Regression modelling strategies for improved prognostic One 2015;10(4):e0118658.
prediction. Stat Med 1984;3(2):143e52. [48] Shim JH, et al. Prognostic nomograms for prediction of recurrence and sur-
[18] Collins GS, et al. Transparent reporting of a multivariable prediction model for vival after curative liver resection for hepatocellular carcinoma. Annals of
individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med surgery 2015;261(5):939e46.
2015;13(1):1. [49] Chan AWH, et al. Development of pre and post-operative models to predict
[19] Onodera T, Goseki N, Kosaki G. Prognostic nutritional index in gastrointestinal early recurrence of hepatocellular carcinoma after surgical resection. J Hepatol
surgery of malnourished cancer patients. Nihon Geka Gakkai Zasshi 2018;69(6):1284e93.
1984;85(9):1001e5. [50] Qin W, et al. A novel score predicts HBV-related hepatocellular carcinoma
[20] Okuda K, et al. Natural history of hepatocellular carcinoma and prognosis in recurrence after Hepatectomy: a retrospective multicenter study. 2018.
relation to treatment study of 850 patients. Cancer 1985;56(4):918e28. [51] Zhang XP, et al. A nomogram to predict early postoperative recurrence of
[21] Sheng J, et al. The Chinese classification system compared with TNM staging hepatocellular carcinoma with portal vein tumour thrombus after R0 liver
in prognosis of patients with primary hepatic carcinoma after resection. resection: a large-scale, multicenter study. Eur J Surg Oncol 2019;45(9):
Hepatobiliary Pancreat Dis Int: HBPD INT 2005;4(4):561e4. 1644e51.
[22] Yan P, Yan L-N. Staging of hepatocellular carcinoma. Hepatobiliary Pancreat [52] Chen TW, et al. Comparison of clinical staging systems in predicting survival
Dis Int: HBPD INT 2003;2(4):491e5. of hepatocellular carcinoma patients receiving major or minor hepatectomy.
[23] Yang Bh RZ, Tang ZY. The consideration and proponent of the liver cancer Eur J Surg Oncol 2007;33(4):480e7.
498
[53] Noda T, et al. Usefulness of the CLIP scoring system for prediction of post- curative resection. J Surg Oncol 2018;117(4):625e33.
operative prognosis of patients with large hepatocellular carcinoma. [71] Liao R, et al. Combined preoperative ALBI and FIB-4 is associated with
J Hepatobiliary Pancreat Surg 2009;16(4):538e45. recurrence of hepatocellular carcinoma after curative hepatectomy.
[54] Xu L-B, et al. Staging systems for predicting survival of patients with hepa- J Gastrointest Surg 2018;22(10):1679e87.
tocellular carcinoma after surgery. World J Gastroenterol: WJG 2010;16(41): [72] Tokumitsu Y, et al. A new prognostic model for hepatocellular carcinoma
5257. recurrence after curative hepatectomy. Oncology letters 2018;15(4):4411e22.
[55] Huang JT, et al. A modified TNM-7 staging system to better predict the sur- [73] Zheng BH, et al. Radiomics score: a potential prognostic imaging feature for
vival in patients with hepatocellular carcinoma after hepatectomy. J. Cancer postoperative survival of solitary HCC patients. BMC Cancser 2018;18(1).
Res. Clin. Oncol. 2013;139(10):1709e19. [74] Ng KK, et al. A simplified prediction model for early intrahepatic recurrence
[56] Kinoshita A, et al. The Glasgow Prognostic Score, an inflammation based after hepatectomy for patients with unilobar hepatocellular carcinoma
prognostic score, predicts survival in patients with hepatocellular carcinoma. without macroscopic vascular invasion: an implication for adjuvant therapy
BMC Canc 2013;13. and postoperative surveillance. Surgical Oncology 2019;30:6e12.
[57] Santambrogio R, et al. External validation of a simplified BCLC staging system [75] Wang YY, et al. A modified staging of early and intermediate hepatocellular
for early hepatocellular carcinoma. Eur J Surg Oncol 2013;39(8):850e7. carcinoma based on single tumour >7 cm and multiple tumours beyond up-
[58] Huang J, et al. The inflammation-based scores to predict prognosis of patients to-seven criteria. Aliment Pharmacol Ther 2019;49(2):202e10.
with hepatocellular carcinoma after hepatectomy. Med Oncol 2014;31(4). [76] Xu L, et al. Novel prognostic nomograms for hepatocellular carcinoma patients
[59] Liu C, et al. Prognosis evaluation in patients with hepatocellular carcinoma with microvascular invasion: experience from a single center. Gut Liver
after hepatectomy: comparison of BCLC, TNM and Hangzhou criteria staging 2019;13(6):669e82.
systems. PloS One 2014;9(8). [77] Dwyer AJ, et al. Use of the hazard rate to schedule follow-up exams efficiently:
[60] Ruan DY, et al. Nomogram for preoperative estimation of long-term survival an optimization approach to patient management. Med Decis Making
of patients who underwent curative resection with hepatocellular carcinoma 1983;3(2):229e44.
beyond Barcelona clinic liver cancer stage A1. Oncotarget 2016;7(38): [78] Damen JA, et al. Prediction models for cardiovascular disease risk in the
61378e89. general population: systematic review. bmj 2016;353:i2416.
[61] Shen J, et al. Prognostic nomograms for patients with resectable hepatocelluar [79] Moons KG, et al. Prognosis and prognostic research: application and impact of
carcinoma incorporating systemic inflammation and tumor characteristics. prognostic models in clinical practice. Bmj 2009;338:b606.
Oncotarget 2016;7(49):80783e93. [80] Akamatsu N, et al. Surgery and hepatocellular carcinoma. Liver cancer
[62] Sposito C, et al. Development of a prognostic scoring system for resectable 2017;6(1):44e50.
hepatocellular carcinoma. World J Gastroenterol 2016;22(36):8194e202. [81] Belghiti J, Fuks D. Liver resection and transplantation in hepatocellular car-
[63] Wang JQ, et al. Comparison of the prognostic value of C-reactive protein- cinoma. Liver cancer 2012;1(2):71e82.
based prognostic scores in patients with hepatitis B virus-related hepatocel- [82] Chapman WC, et al. Surgical treatment of hepatocellular carcinoma in North
lular carcinoma. Int J Clin Exp Med 2016;9(5):8272e9. America: can hepatic resection still be justified? J Am Coll Surg 2015;220(4):
[64] Yang P, et al. Nomograms for pre- and postoperative prediction of long-term 628e37.
survival for patients who underwent hepatectomy for multiple hepatocellular [83] Liver E.A.F.T.S.O.T.. EASL clinical practice guidelines: management of hepato-
carcinomas. Ann Surg 2016;263(4):778e86. cellular carcinoma. J. Hepatol. 2018;69(1):182e236.
[65] Feng LH, et al. Novel microvascular invasion-based prognostic nomograms to [84] Cai B-B, et al. Use of a novel index, the A-index, and its associated nomogram
predict survival outcomes in patients after R0 resection for hepatocellular to predict overall survival rates after resection of primary hepatocellular
carcinoma. J Cancer Res Clin Oncol 2017;143(2):293e303. carcinoma. Clin Chim Acta 2020;500:34e41.
[66] Fu YP, et al. Prognostic nomograms stratify survival of patients with hepa- [85] Lin, Z.-X., et al., Controlling nutritional status (CONUT) score-based nomogram
tocellular carcinomawithout portal vein tumor thrombosis after curative to predict overall survival of patients with HBV-associated hepatocellular
resection. Oncol 2017;22(5):561e9. carcinoma after curative hepatectomy. Clin Transl Oncol: p. 1-11.
[67] Cao Y, et al. Prediction of long-term survival rates in patients undergoing [86] Galle PR, et al. EASL clinical practice guidelines: management of hepatocel-
curative resection for solitary hepatocellular carcinoma. Oncol Lett lular carcinoma. Journal of hepatology 2018;69(1):182e236.
2018;15(2):2574e82. [87] Vogel A, et al. Hepatocellular carcinoma: ESMO Clinical Practice Guidelines for
[68] Gan W, et al. Prognostic impact of lactic dehydrogenase to albumin ratio in diagnosis, treatment and follow-up. Ann Oncol 2018;29(Supplement_4):
hepatocellular carcinoma patients with childepugh I who underwent curative iv238e55.
resection: a prognostic nomogram study. Canc Manag Res 2018;10:5383e94. [88] Steyerberg EW. Clinical prediction models, 381. Springer; 2009.
[69] Hu Y, et al. Nomogram predicting survival of hepatocellular carcinoma with [89] Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating
portal vein tumour thrombus after curative resection. 2018. prediction models. Med Decis Making 2006;26(6):565e74.
[70] Huang JL, et al. A novel and validated prognostic nomogram based on liver [90] Heimbach JK, et al. AASLD guidelines for the treatment of hepatocellular
fibrosis and tumor burden for patients with hepatocellular carcinoma after carcinoma. Hepatology 2018;67(1):358e80.
499

Systematic Review and Meta Analysis of Validated P

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Systematic Review and Meta Analysis of Validated P

Uploaded by

Copyright:

Available Formats

European Journal of Surgical Oncology 48 (2022) 492e499

Contents lists available at ScienceDirect

European Journal of Surgical Oncology

Systematic review and meta-analysis of validated prognostic models

1. Introduction models may describe mere artefacts caused by local practices in

value of each factor; 20 of the 38 models (53%) used Cox propor-

3.2. Validation studies

A total of 32,825 patients from 36 validation studies provided

(Table 1). Model discriminatory performance at internal validation was

You might also like