Professional Documents
Culture Documents
research-article2020
CIX0010.1177/1176935120939898Cancer InformaticsTesfaw and Muluneh
ABSTRACT
Background: Cancer is the second leading cause of death globally. Despite developing countries including Ethiopia continuing to shoul-
der the greatest burden, insufficient research has been conducted to determine geographical and other characteristic effects. The main
objective of this study was to assess the distribution and risk of cancer and determine the effects of some common clinical patient charac-
teristics on current patient status by taking into account the spatial effect.
Methods: The data for this study were obtained from the oncology ward of Felege Hiwot Referral Hospital, Bahir Dar, Ethiopia. About 415
cancer patients were included in the study. Spatial mixed ordinal logistic regression model was used to explore the geographical patterns
of the incidence of cancer and identify the risk factors.
Results: The findings of this study show that only 1.45% of patients were cured and 46.02% improved, whereas the rest have shown no
change and even worse status after treatment. The estimated odds of patients who received chemotherapy was 4.284 times the estimated
odds of patients who received palliative care. Prognostic factor (stage of cancer tumor), complication of cancer such as anemia during diag-
nosis, and treatment of patients given in the hospital had significant effect on the patient status.
Conclusion: Patients without anemia were more likely to be cured and improved than patients with anemia during diagnosis. Most of the
patients had advanced stage (IV) of cancer tumor, which dismantles the capability of the treatment to be less effective. There was negative
spatial effect on the incidence of cancer, indicating that districts with higher cancer incidence were usually surrounded by districts with lower
incidence.
RECEIVED: June 11, 2020. ACCEPTED: June 16, 2020. Declaration of Conflicting Interests: The author(s) declared no potential
conflicts of interest with respect to the research, authorship, and/or publication of this
TYPE: Original Research article.
Funding: The author(s) received no financial support for the research, authorship, and/or CORRESPONDING AUTHOR: Lijalem Melie Tesfaw, Department of Statistics, Bahir Dar
publication of this article. University, P.O. Box 79, Bahir Dar 6000, Ethiopia. Email: lijalemmelie@gmail.com
Background men and 8.5 million in women. Globally, lung cancer is the first
Cancer is a genetic disease caused by changes to genes that most commonly diagnosed cancer that affects most cases
control the way our cells function, especially how they grow (12.3% of the total). Breast (2 088 849 cases, 12.2% of the total),
and divide, which leads to unregulated growth and division of colorectal (10.00%), prostate (9.8%), and stomach (5%) are
cells that form malignant tumor and invade the nearby parts within the top 5 most commonly diagnosed cancers, ranking
of our body. The cause of cancer, genetic changes, can be second, third, fourth, and fifth, respectively.4
inherited from parents and also during a person’s life time as Cancer is a major public health burden in both developed
a result of errors that occur as cells divide by certain environ- and developing countries. About 72% of all cancer deaths in
mental exposures. Cancer tumors are malignant, which can 2007 occurred in low- and middle-income countries.5 The
spread and invade nearby tissue, called metastatic cancer, same pattern of cancer deaths occurs in sub-Saharan African
which can start almost anywhere in the human body.1,2 countries, particularly in Ethiopia. In Ethiopia, cancer accounts
Recently, the National Cancer Institute2 showed that there for about 5.8% of the total national mortality. According to the
are more than 100 types of cancer, usually named after the 2015 national cancer report of Ethiopia, among the entire adult
organs or tissues where the cancers form. Among these, only population, breast cancer (30.2%), cervical cancer (13.4%), and
20 types of cancers were considered in this study based on the colorectal cancer (5.7%) were the most prevalent cancers. As
registry of cancer patients in the oncology ward of Felege Ethiopia has diversified geographical area, in this study the
Hiwot Referral Hospital (FHRH) in the Western Amhara spatial effect of cancer was also accounted for by determining
region of Ethiopia. the district where the patient resides. Assigning individuals to
According to the Global Burden of Disease Cancer result their place of residence also poses problems,6 although this is
explored in 2015, cancer was the second leading cause of death usually the only location information that is available. Often
globally, 8 million deaths, with cardiovascular diseases being the goal of a geographic analysis is to identify a common envi-
the first.3 Based on the 2018 world cancer statistics (excluding ronmental exposure in a population, but exposures that are
nonmelanoma skin cancer), there were an estimated 18 million occupational or recreational may not necessarily reveal them-
cancer cases around the world, of which 9.5 million were in selves in a residential analysis.
Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial
4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without
further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
2 Cancer Informatics
The data for this study were obtained from the registry of can-
Variable description
cer patients in the oncology ward of FHRH. Felege Hiwot
Referral Hospital is located in Bahir Dar, the capital city of Before clinical assessment was provided at the oncology ward
Amhara region and a metropolitan city in the North Western of FHRH, patients were asked to fill in their life history (for
region of Ethiopia. The oncology ward of the hospital is patient characteristics, see Table 1), and unique patient ID was
designed to register all cancer patients coming from different assigned. Patient characteristics/risk factors such as age, gender,
districts of Western Amhara region (see Figure 1). Nurses residence, and blood type; prognostic factors such as stage;
working in the oncology ward of the hospital were selected and complications such as anemia; and treatment and patient status
trained to collect the cancer data of patients from September indicated in Table 1 are variables considered in this study.
2016 to January 2019. Variables mentioned under risk factors are factors associated
Registries attempt to consolidate information by patient ID with causing a cancer and determined by looking at things that
so that each case appears only once. Completeness of the data influence the incidence of new cancer cases, whereas prognos-
was checked and confirmed by a supervisor engaged in con- tic factors can only be determined by following up people who
tinuous supervision during data collection. For patients with already have cancer. Risk factors are patient characteristics
cancer, a series of interventions, including psychosocial support associated with the risk of contracting cancer disease, which
or palliative care, surgery, and chemotherapy aimed at curing include age, gender, residence, and blood type. On the contrary,
the disease or prolonging life considerably while improving the details of prognostic factor, complication, and treatment were
patient’s quality of life, were carried out. Data were collected collected from patients during treatment in the hospital, and
from the oncology ward of the hospital by retrospectively finally, the patient status was recorded in their registry card
reviewing all new cancer patients in the cancer registry center after receiving treatment. The prognostic factor (stage of can-
report. A series of questions using oral and medical tools cer) and the presence of complication (anemia) were registered
regarding risk factors, main symptoms, complications, comor- at the beginning of the diagnosis, that is, as soon as they are
bidities, treatment options (if they had used any before coming registered at the hospital. Complications of cancer are unan-
to the hospital), and prevention and early detection measures of ticipated/unforeseen diseases that arise after and as a result of
Tesfaw and Muluneh 3
Table 1. Description of variables considered in this study. cancer such as anemia in our study. Based on the information
of patients in their registry card, the following variables were
Independent variables Frequency (%)
considered in this study (see Table 1).
Characteristics and risk factors
Female (0) 257 (61.93) Public health data, such as the cancer data under analysis, often
show differences or patterns across different geographical areas.
Male (1) 158 (38.07)
Ignoring such data during analysis may give faulty results and
Residence blood type conclusions. Spatial analysis is an analysis which includes the
A+ 106 (25.54) influence of space into the analysis. All statistical methods of
spatial data have to take the spatial arrangement, and the
A− 18 (4.34)
resulting correlations, of observations into consideration to
AB+ 17 (4.10) provide accurate and meaningful conclusions based on the
analysis.8,9
AB− 4(0.96)
Spatial data are distinguished by observations that are
B+ 101 (24.34) obtained at spatial locations s1 , s 2 ,..., si , where si are coordi-
B− 16 (3.86) nates8 in the plane ℜ 2 and rarely in the space ℜ3 . Spatial data
have different features such as point, line, area, and volume.
O+ 121 (29.16)
Point is a precise location, s, in space indicated by a dot on a
O− 32 (7.71) map; Line is a sequential collection of connected points like
Prognostic factor
road and rivers; and Area is a region enclosed by lines like
counties, states, and districts, 1 feature of spatial data that has
Stage
been considered in this study. Weighted matrix, sometimes
I 64 (15.42) called Contiguity matrix, describes the relationship between
II 98 (23.62) districts i and j in the specified area. The (i , j )th element of a
spatial proximity matrix W, denoted by wij , quantifies the spa-
III 114 (27.47)
tial dependence between regions i and j, and collectively, wij
IV 139 (33.49) defines a neighborhood structure over the entire area. The spa-
Complication tial correlation parameter W = (wij ) is a neighborhood matrix
for the areal units,10 which can be defined as
Anemia at diagnosis
N N
1 ∑i =1 ∑ j =1 wij (Yi − Y )(Y j − Y ) where j = 1, 2, . . ., J−1, and the probability P [Y ″ j | x ] can be
I = 2 estimated as:
S ∑iN=1 ∑ Nj =1 wij
N N
1 ∑i =1 ∑ j =1 wij Yij (2) exp(α j + X ’β)
= 2 N N
P [Y ≤ j | x ] = (5)
S ∑i =1 ∑ j =1 wij 1 + exp(α j + X ’β)
where Yi is the total number of incidence of cancer diagnosed The cumulative probabilities do not use the final one,
in the ith district, Y = ΣiN=1Yi / N is the overall mean, and S 2 is P(Y ⩽ J), because it necessarily equals 1. The parameter † is a
the sample variance observed in Yi’s that can be computed as vector of regression coefficients describing the effect of the cor-
responding independent variable X on the log odds of response
∑
N
S 2 = (1 / N ) (Y − Y ) 2 .
i =1 i in category j or below. When this model fits well, it requires a
In testing the presence of spatial autocorrelation, the null
single parameter rather than J−1 parameters to describe the
hypothesis states that the nearby districts do not affect one
effect of X, because the model assumes that the effect of X is
another, which implies that there is no dependence and spatial
identical for all J−1 cumulative logits. This is known as the pro-
randomness in the data. In contrast, the alternative hypothesis
portional odds model.
states that there is spatial association or dependence among the
districts. The research hypothesis for this study states that the
nearby districts in Western Amhara region have an association Random effects test
with or dependence on the risk of cancer. Spatial autocorrela-
The random effects model is commonly used to detect whether
tion in the nearby areas is considered to be present when the
the variable intended has random effect or not, beyond the
test statistic such as Moran’s I takes on a larger value, compared
fixed effect that is encompassed in the model.15,16 Random fac-
with what would be expected under the null hypothesis of no
tors are not restricted to linear mixed models. Researchers want
spatial association.9,10 Spatial variable Wij Yij is a variable of a
to incorporate random factors into nonlinear models to build a
product of weighted matrices Wij and Yij , a cross product
model that accommodates correlated data or to consider the
of values in district i and j that deviated from the average value
levels of a factor as selected from a population of levels to make
Y (equation (2)). Yij is needed to measure the proximity,
inference to that population.15 Common questions in mixed
which means the distance between the observed i value and
modeling are whether variance components are zero, whether
the neighboring j values. Although there are several types of
random effects are independent, and whether rows (columns)
distance-based methods,11 the most common distance method,
can be added or removed from the covariance matrix. The
Euclidean distance, Yij = (Yi − Y )(Y j − Y ) , was used in this
effect of cancer type was detected by including in the model as
study.
a random effect. The patient status within one cancer type is
likely to be correlated with each other. The goal here is to make
Ordinal logistic regression model inference for the population in the study area of cancer types.
Ordinal logistic regression analysis deals with the association This could be accommodated by incorporating the cancer type
of a dependent variable with independent variables when the into the model.
dependent variable has more than 2 categories having natural The random test can be conducted in 2 ways: (1) by adding
order or rank.12 The dependent variable Y is assumed to have the random effect to the model that contains the fixed effects
an ordinal scale with J categories, and X = x1 , x 2 ,..., x p is the and perceiving whether there is pragmatic change in the esti-
vector of explanatory variables. Then the chances of the varia- mated parameter or (2) using mixture χ2 statistic.13,15
ble response of the jth category of explanatory variable X in
particular can be expressed by P [Y = j | x ] = π j (x ) . Spatial mixed ordinal logistic regression model
When response categories are ordered, the logits can use the
Spatial mixed ordinal logistic regression (SMOLR) is an anal-
ordering, which results in greater power and simple interpreta-
ysis which incorporates spatial effects into the mixed ordinal
tion. The cumulative probability for Y is the probability that Y
logistic regression model. Scholars in the study by Muhammad
falls at or below a particular outcome category j and is given by
and Tuti Purwaningsih14,17 used different methods to account
P (Y ≤ j ) = π1 (x ) + + π j (x ), j = 1, 2,..., J (3) for spatial effect and estimate its effect in their model. The spa-
tial logistic regression model in Muhammad and Tuti
where J is the number of categories for the response variable Y. Purwaningsih14 accounted for the spatial effect by including
The cumulative logit model13,14 is given as follows: weighting of the ijth location through wij = 1 / hij , where hij
is the Euclid distance between the districts i, j and estimated
P [Y ≤ j | x ] existence of event. On the contrary, the SMOLR model is
logit( P [Y ≤ j | x ]) = log
1 − P [Y ≤ j | x ] (4)
established to handle the spatial relationship proposed by
π1 (x ) + π2 (x ) + + π j (x ) Muhammad and Tuti Purwaningsih,17 and the specific model
= log = α j + X’β
π j +1 ( x ) + π j + 2 ( x ) + + π J ( x ) used in this study is given by
Tesfaw and Muluneh 5
Table 3. Districts within zones, respective number of patients, and total number inhabitants (population at risk).
Abbreviations: AW, Awi; BN, Benshangul; EG, East Gojjam; SG, South Gondar; SW, South Wollo; WG, West Gojjam.
8 Cancer Informatics
Table 4. Incidence of cancer disease per 100 000 standardized population at risk of each district.
Anemia
Blood group
Treatment
Sex
Stage
using AIC and BIC. The estimated fit statistic of AIC (BIC) corresponding independent variables to be estimated, which
was 1056.067 (1072.180) and 924.848 (981.244) for intercept- describe the effect of the corresponding variable on the log
only OLR and SOLR models, respectively. The dummy varia- odds of patient status at j or below category. In the parameter
ble and the reference (*) variables indicated in Table 7 and the βs , there is no subscript j because the model assumes propor-
estimated SOLR model can be written by tional odds, and hence the effect of independent variables on
the patient status is identical for all J−1 cumulative logit
logit( P [Yi ≤ j ]) = α j + β1 Agei + β20 Anemiai models.
+ β30 chamotherapyi + β31combined i + β40Stage_Ii (7) The requirement of random effects of cancer type in the
+ β41Stage_IIi + β42Stage IIIi model was tested using a mixture of χ2 distributions. These
tests were done by adding cancer type as the random effect in
where α j is the intercept and j = 0, 1, 2, and 3 for cured, the SOLR model which becomes the SMOLR model. The
2
improved, same, and deteriorate status of patient, respectively, need for random effect was significant ( χ0:1 = 6.28 ; P = 0.0254),
and i = 1, 2, . . ., 415. β1 , β2 , . . ., and βm are parameters of the which indicates that the model without cancer-type random
Tesfaw and Muluneh 11
Table 6. Indicator of spatial autocorrelation. surgery (combined) treatment was 4.284 ( = e 1.4549 ) and 7.242
Indicator Statistic P value
( = e 1.9799 ) times the estimated odds of patients who received
palliative care, respectively. Hence, patients who received
Moran’s I –0.0491 .0022 chemotherapy and patients who received both chemotherapy
and surgery were likely to fall in the lower direction of patient
status category than patients who received palliative care.
effects does not fit the data well. The estimated variance of
However, patients who received both chemotherapy and sur-
random effects of cancer type was 0.2997 by assuming constant
gery were more likely to fall in the lower direction of patient
variance-covariance working correlation structure.
status category than patients who took chemotherapy and pal-
To estimate the parameters of the SMOLR model, estima-
liative care (Table 5).
tion methods such as commutative logistic mixed model
Stage of patients at diagnosis also has significant effect on
(clmm) and generalized linear mixed model (glimmix) using
the log odds of probability of patient status at j or below cate-
R and SAS statistical software, respectively, were used.
gory. The estimated odds of patients who had stage I cancer
Nevertheless, at the end, the estimated parameters for the
tumor at diagnosis to fall in the lower direction of patient sta-
models described in Table 7 were obtained using the glimmix
tus category was 0.524 ( = e −0.6460 ) times the estimated odds of
SAS procedure.
patients who had stage IV cancer tumor at diagnosis, which
Model comparison was done based on AIC and BIC, the
indicates that the estimated odds of patients who had stage I
model with the smallest AIC and BIC being preferred. The fit
cancer tumor at diagnosis to fall in the lower patient status
statistics AIC (BIC) for intercept-only OLR model, SOLR
category was lower by 47.6% of the estimated odds of patients
model, and SMOLR model were 1056.067 (1072.180), 924.848
who had stage I cancer tumor at diagnosis.
(981.244), and 923.040 (937.970), respectively (Table 7). The
It is noted that there is spatial autocorrelation between dis-
SMOLR model having the smallest AIC (BIC) is therefore the
tricts. In Table 7, P= .0126 also shows that there was spatial
best model for the data.
correlation of levels of patient status between districts. The
Therefore, the final estimated model can be written by
spatial variable correlation with patient status was a negative
logit( P [Yi ≤ j ]) = α j − 0.003 Agei + 0.6034 Anemiai value, −0.0208, which indicates that districts with lower levels
of patient status are usually surrounded by districts with higher
+ 1.4549 Chemotheraphyi + 1.9799 Combined i
(8) levels of patient status.
− 0.6460 Stage I i − 1.2438 Stage_IIi
The goodness of fit of the model was assessed using the
− 0.72855 Stage_IIIi Hosmer-Lemeshow test.13 The Hosmer-Lemeshow good-
ness-of-fit test statistic χ 2 =36.6198 with its corresponding
As patient status has 5 categories (J = 5), the model has 4 P = 0.3935 shows that in the goodness-of-fit model, the data
intercepts such as α 0 , α 1, α 2 and α 3 with estimated values of are held.
−6.2469, −1.6466, −0.03957, and 2.1793, respectively. Usually,
these are not of interest except for estimating the probability of Discussion
patient status that falls at category j or below, P[Y ⩽ j]. This article investigated the distribution of incidence of cancer
The SMOLR model (Table 7) shows anemia at diagnosis, disease across districts in the Western Amhara region, Ethiopia,
treatment, stage at diagnosis, and spatial variables have signifi- and identified the factors that affect patient status by consider-
cant effects (P ⩽ 0.05) on log odds of probability of patient sta- ing the limited number of patient characteristics from the
tus category at j or below, but not age. The ordered logit (log patient registry card in FHRH. Twenty types of cancer were
odds) for patients with no anemia at diagnosis being in a lower considered in the study, and of those cancer type, breast cancer
patient status category was 0.6034 greater than patients who was the highest diagnosed cancer type. A similar study con-
had anemia at diagnosis. The estimated odds among patients ducted in Addis Ababa, Ethiopia, also found that breast cancer
without anemia at diagnosis is 1.828 ( = e 0.6034 ) times greater was the most common malignant neoplasm among women.24
than the odds among patients who had anemia at diagnosis, Among patients who were included in the study and received
which reflects that the estimated odds of patients who did not treatment, only 1.45% were cured and 46.02% improved,
have anemia at diagnosis to fall in the lower direction of whereas the rest have no change in status because of treatment
patient status category was higher by 82.28% of estimated and were even worse after getting treatment.
odds of patients who had anemia, keeping other variables con- The incidence of cancer within each district was assessed.
stant. This means the patients without anemia were less likely To make the incidence of cancer comparable between districts,
to fall in the higher patient status category than patients with the population at risk was standardized10 into the same 100 000
anemia. population at risk per district. Incidence of cancer showed sig-
The estimated odds of patients who received chemotherapy nificant variation across the different districts included in the
treatment and patients who received both chemotherapy and study (19.59 per 100 000 population in Finote Selam to 1.11
12 Cancer Informatics
Table 7. Parameter estimates using intercept-only OLR model, spatial OLR model, and spatial mixed OLR model.
Estimate (SE) P value Estimate (SE) P value Overall Estimate (SE) P value Overall
P value P value
Intercept
Cure −4.222 (0.4112) <.0001 −4.3954 (0.5702) <.0001 −6.2469 (0.9357) <.0001
Improved −0.1013 (0.0983) .3028 0.1292 (0.4159) .7561 −1.6466 (0.8322) .0625
Same 1.0198 (0.1112) <.0001 1.6861 (0.4242) <.0001 −0.03957 (0.8276) .9624
Deteriorate 2.7473 (0.2063) <.0001 3.8532 (0.4754) <.0001 2.1793 (0.8518) .0192
Anemia – –
Treatment – –
Stage – –
Spatial variable – – −0.0201 (0.0082) .0141 .0141 −0.0208 (0.0083) .0126 .0126
Variance component
/ / 0.2997 (0.2339)
Sigma11 (σ 11
2
)
Abbreviations: –, the corresponding variable was not included in the model; /, not applicable in the model; *, reference categories; AIC, Akaike information criterion; BIC,
Bayesian information criterion; OLR, ordinary logistic regression.
per 100 000 population in Enemay). The spatial autocorrela- awareness of the people about the disease, most cancer patients
tion, association between the value of incidence of cancer were diagnosed at an advanced stage. Treating the disease at
observed in a given district unit and the value of incidence of advanced stage becomes difficult with the available technology
cancer observed in neighboring district units, was checked and very limited cancer specialists in poor countries, which
using Moran’s I index and showed a negative autocorrelation. ultimately leads to most patients becoming incurable from the
This means that a district in the neighborhood of high inci- disease and needing palliative care.
dence of cancer is expected to have low incidence. Numerous studies18,26 have reported that cancer is a disease
It was found that one-third of patients visited the hospital of aging, in a sense that the occurrence of cancer becomes high
with advanced stage (IV) of cancer. This result which is in line when people get older. However, this was not the case in this
with results obtained in Africa7 and specifically in Ethiopia25 study as the incidence of cancer was predominantly concen-
tells us that as cancer treatment centers and health service trated in age between 41 and 55 years. It could be due to that
delivery are limited in the country and due to the lack of the life expectancy in developed country, which the study by
Tesfaw and Muluneh 13
Bray et al4 shows, is quite different from the life expectancy in Conclusion
developing countries like Ethiopia. Mostly in Ethiopia and Patients who did not have anemia complication during diagno-
particularly in our study area, West Amhara people mostly go sis were more likely to be cured and improved than those
to religious place while they become old instead of going to patients who had anemia complication. Most of the patients
hospital to be cured from diseases. In fact, it might be also due had advanced stage (IV) of cancer tumor, which dismantles the
to the fact that the population at risk within the age group was capability of the treatment to be less effective and increases the
not standardized as the population at risk within each age resistance of malignancy of the cancer tumor, and hence it was
group was not known to make it standardized. On the contrary, perceived that the stage of patients during diagnosis has its
age did not have a significant association with patient status, own impact on patient status after treatment. The age of the
which indicates that the malignancy of cancer tumor is not patient when the presence of cancer was disclosed has not sig-
dependent on the age group to which the patient belongs. nificant relationship with the current status of the patient after
It was found in our study that anemia complication with taking treatment. If the patients are capable of undergoing sur-
cancer has a significant effect on patient status. For patients gery, on average giving both chemotherapy and surgery treat-
with advanced stage of cancer, complication with anemia was a ment was effective for patients to be in a better status. There
common manifestation in some other studies such as Danaei was significant spatial correlation of incidence of patients with
et al26 and Benson et al.27 The estimated odds of patients who cancer between districts in which districts with high incidence
have no anemia at diagnosis to fall in the lower patient status of cancer were usually surrounded by districts with low inci-
category equals 1.828 times the estimated odds of patients who dence of cancer and vice versa.
have anemia at diagnosis. It means that patients with no ane-
mia complication at diagnosis fall in the lower patient status Acknowledgements
category compared with those patients who have anemia com- We thank the personnel in the hospital who helped us collect
plication during diagnosis. It was also found that the prognos- the data and provided expert assistance. We would also like to
tic factor stage during diagnosis has its own effect on the acknowledge Dr Teshome Alene, staff of Medical Health at
patients’ status after they were given appropriate treatment College of Medicine and Health Science, for technical and
based on the physician prescription. Different studies such as professional assistance.
that by Pakzad18 repeated that when stage (extent of tumor
Author Contributions
spread) increased during diagnosis, the patient status will
LMT conceived the idea of the research, conducted data analy-
become more likely very bad as curing or improving a patient
sis, interpreted the data, and prepared all figures and tables by
with advanced stage is difficult. It is advisable to have early
verifying the analytical method. Both authors discussed the
detection of the disease so that it becomes simple to decrease
results and wrote the manuscript.
the malignancy of the cancer tumor in the given treatment.
ABO blood type is associated with increased risk of lung
Availability of Data and Material
cancer based on the study by Urun et al.28 In contrast, the study
The data sets of generated analyses during the study were col-
conducted in China29 on gastric cancer explained that there is
lected from Felege Hiwot Referral Hospital (FHRH), Bahir
no significant association between blood group and cancer risk,
Dar, Ethiopia, by personnel in the hospital by initiation of
which is consistent with our study. Hence, it can be recognized
Department of Statistics on behalf of Bahir Dar University.
that the effect of blood group on cancer risk might depend on
Anyone can access the data from the Hospital by proposing
the cancer type.
reasonable proposal request.
The choice of treatment depends on where the cancer
tumor is located, stage of the disease, and health status,27 Ethical Consideration
which was partially consistent with our study that the treat- Permission to undertake the study was obtained from Felege
ment type has significant effect on the patient status after Hiwot Referral Hospital (FHRH) by the initiation of
taking treatment. Department of Statistics on behalf of Bahir Dar University.
This study reveals that the spatial variable has negative sig- The patients/participants were informed that participation was
nificant effect where districts with lower levels of patient status confidential and private information would be protected.
were usually surrounded by districts with higher levels of Identification of patients/participants was done only through
patient status and that districts with higher incidence of cancer numerical codes, and data collection of the study groups was
were usually surrounded by districts with lower incidence of carried out only when privacy was ensured based on Ministry
cancer, which is in line with studies by Pakzad et al19 and Shen of Health Legislation in the FHRH.
et al.20 This demonstrated that spatial epidemiology offers
insight into ways that individual characteristics, community ORCID iD
attributes, and physical environments interact to produce dis- Lijalem Melie Tesfaw https://orcid.org/0000-0002-2555
tinctive risk, illness, and disease management patterns. -8559
14 Cancer Informatics
References 1 5. Verbeke G, Molenberghs G. Linear Mixed Models for Longitudinal Data. New
York, NY and Park, CA: Springer-Verlag and SAGE; 2000.
1. Meyerson M, Gabriel S, Getz G. Advances in understanding cancer genomes
16. Seltman H. Experimental Design and Analysis. John Wiley and Sons; 2015.
through second-generation sequencing. Nat Rev Genet. 2010;11:685–696.
17. Muhammad N, Tuti Purwaningsih S. Modeling spatial ordinal logistic regres-
2. National Cancer Institute at the National Institutes of Health. https://www.can-
sion and the principal component to predict poverty status of districts in Java
cer.gov/about-cancer/understanding/what-is-cancer
Island. Int J Statis Appl. 2013;3(1):1–8.
3. Center M, Siegel R, Jemal A. Global Cancer: Facts and Figures. Atlanta, GA:
18. Pakzad R, Ghoncheh M, Pournamdar Z, et al. Spatial analysis of skin cancer
American Cancer Society; 2015.
incidence in Iran. Asian Pac J Cancer Prev. 2016;17:33–37.
4. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer
19. Pakzad R, Khani Y, Pakzad I, et al. Spatial analysis of stomach cancer incidence
statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide
in Iran. Asian Pac J Cancer Prev. 2016;17:27–32.
for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424.
2 0. Shen X, Wang L, Zhu L. Spatial analysis of regional factors and lung cancer
5. De Flora S, La Maestra S. Epidemiology of cancers of infectious origin and pre-
mortality in China. Cancer Epidemiol Biomarkers Prev. 2017;26:569–577.
vention strategies. J Prev Med Hyg. 2015;56:E15–E20.
21. Brown P, Jiang H, Ezzat S, Sawka AM. A detailed spatial analysis on contrasting
6. Boscoe FP, Ward MH, Reynolds P. Current practices in spatial analysis of cancer
cancer incidence patterns in thyroid and lung cancer in Toronto women. BMC
data: data characteristics and data sources for geographic studies of cancer. Int J
Public Health. 2016;16:950.
Health Geogr. 2004;3:28.
22. Roshandel G, Boreiri M, Sadjadi A, Malekzadeh R. A diversity of cancer incidence
7. Jennifer D, Cathyryne K, Milner D, et al. Africa’s emerging cancer crisis: a call
and mortality in West Asian populations. Ann Glob Health. 2014;80:346–357.
to action. https://bvgh.org/wp-content/uploads/2017/07/Africas-Emerging-
23. Gregorio DI, Cromley E, Mrozinski R, Walsh SJ. Subject loss in spatial analysis
Cancer-Crisis-A-Call-to-Action.pdf. Updated 2017. of breast cancer. Health Place. 1999;5:173–177.
8. Gelfand AE, Diggle P, Guttorp P, Fuentes M, eds. Handbook of Spatial Statistics. 24. Hadg E, Seifu D, Tigneh W, et al. Breast cancer in Ethiopia: evidence for geo-
Boca Raton, FL: CRC Press; 2010. graphic difference in the distribution of molecular subtypes in Africa. BMC
9. Waller LA, Gotway CA. Applied Spatial Statistics for Public Health Data Women’s Health. 2018;18:40.
(Vol. 368). Hoboken, NJ: John Wiley & Sons; 2004. 25. Woldeamanuel YW, Girma B, Teklu AM. Cancer in Ethiopia. Lancet Oncol.
10. Carlin BP, Gelfand AE, Banerjee S. Hierarchical Modeling and Analysis for Spatial 2013;14:289–290.
Data. Boca Raton, FL: Chapman and Hall/CRC Press; 2014. 26. Danaei G, Vander Hoorn S, Lopez AD, Murray CJ, Ezzati M; for Comparative
11. Betts MG, Diamond AW, Forbes GJ, Villard MA, Gunn JS. The importance of Risk Assessment Collaborating Group. Causes of cancer in the world: compara-
spatial autocorrelation, extent and resolution in predicting forest bird occurrence. tive risk assessment of nine behavioural and environmental risk factors. Lancet.
Ecol Model. 2006;191:197–224. 2005;366:1784–1793.
12. Midi H, Sarkar SK, Rana S. Collinearity diagnostics of binary logistic regression 27. Benson K, Balducci L, Aapro M. Anemia and cancer. In: Balducci L, Ershler WB,
model. J Interdiscip Math. 2010;13:253–267. Bennett JM, eds. Anemia in the Elderly. Boston, MA: Springer; 2008:99–113.
1 3. Hosmer DW Jr, Lemeshow S, Sturdivant R X. Applied Logistic Regression 28. Urun Y, Utkan G, Cangir AK, et al. Association of ABO blood group and risk of lung
(Vol. 398). Hoboken, NJ: John Wiley & Sons; 2013. cancer in a multicenter study in Turkey. Asian Pac J Cancer Prev. 2013;14:2801–2803.
14. Muhammad N, Tuti Purwaningsih S. Weighted Distance and Optimum Intersec- 29. Xiao S, Feng F, Sun L, et al. Blood type AB predicts promising prognosis in gas-
tion Spatial Logistic Regression for Prediction of Rural Poverty Status in West Java. tric cancer patients with positive preoperative serum CEA. Medicine (Baltimore).
Scientific and Academic publishing; 2009. 2017;96:e8496.
Copyright of Cancer Informatics is the property of Sage Publications Inc. and its content may
not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's
express written permission. However, users may print, download, or email articles for
individual use.