Professional Documents
Culture Documents
Ludovica Pavoni
Professor Stoian
EC 360-1
22 April 2021
Empirical Paper
Rural Italy
Abstract
An econometric model of depopulation is applied to the study of rural areas. In particular, eight
Italian regions–that have a population density lower than 150 inhabitants per square kilometer–
are taken into account. In previous studies, the depopulation rate is been explained as a
phenomenon affected by both demographic and socio-economic variables. This paper then
aims to investigate the Depopulation Rate in Italian rural areas, using data collected from Istat
and Eurostat and to see if depopulation is affected by both types of variables. This study finds
that the demographic variables have much more explanatory power than economic ones; in
particular, the Natural Increase Rate and the Total Fertility Rate are significant when talking
about depopulation.
Pavoni 2
Table of Contents
Literature Review....................................................................................................................... 4
Descriptive Statistics.................................................................................................................. 9
Conclusions .............................................................................................................................. 19
References ................................................................................................................................ 20
Appendix .................................................................................................................................. 20
Pavoni 3
I developed an interest in this field of study while assisting Professor Schwarten with
his research on depopulation in rural areas, especially the rural areas of Abruzzo. The question
of depopulation is quite close to my heart because my grandmother comes from a small village
in the mountains of Abruzzo that, over the years, has consistently been losing its inhabitants.
Working with professor Schwarten enabled me to familiarize myself with the subject matter,
understand how diverse factors bring about depopulation, and appreciate quite how pressing
the issue of depopulation is. Unless action is taken to stem it, depopulation will kill off many
small towns up and down the country. This econometric project will therefore look at the main
determinants/causes of depopulation in rural Italy. Indeed, the study is not limited to Abruzzo,
but it will also take into account other rural areas to see if they depopulate in the same fashion.
Despite the extensive existing literature on the topic, there is no universally accepted way of
identifying or classifying what constitutes a rural area. In very simple words, a rural area is a
scarcely populated zone that is located outside towns and cities. For the purpose of this study,
however, we shall use the definition of the Organization for Economic Co-operation and
Development (OECD)1, which defines as rural any area with fewer than 150 inhabitants per
square kilometer. According to the OECD categorization, eight out of 20 Italian regions qualify
as rural areas, namely Abruzzo, Basilicata, Calabria, Molise, Sardinia, Trentino Alto Adige
Abruzzo 121
Basilicata 55
1
Dijkstra, L.; Ruiz, V. Refinement of the OECD Regional Typology: Economic Performance of Remote Rural
Regions; Regio, D.G., Ed.; European Commission; OECD: Paris, France, 2010.
Pavoni 4
Calabria 126
Molise 68
Sardinia 68
Umbria 104
Valle d’Aosta 38
Literature Review
Quite extensive research has been done on the topic, the most relevant of which for the
purposes of this paper is that done by Ayuda et al.’s. Their study of the Aragon region in Spain
considers the high rate of aging and a low rate of births. The researchers observe that the smaller
a village is, the more it is affected by depopulation and the faster it depopulates. They mention
how out-migration resulted in the phenomenon of negative natural growth, caused by the aging
of the population. The smaller the village, the faster its rate of aging, which, in turn, leads to a
low birth rate. The aging is caused by young people being the first to leave and, in particular,
by the fact more women than men tend to migrate out, which means fewer people are left who
The study carried out by Rizzo on the depopulation of Sicily, on the other
hand, emphasizes a possible link between depopulation and altitude. He shows how between
1961 and 2011 populations declined sharply in mountainous regions (i.e. regions with an
altitude of 501–1000m), while the percentage of people living at sea level (0–200 m) increased
by around 60%. Meanwhile, Mickovic et al. (2020), examine the depopulation of the
from the transformation of the economy in the 1990s, the neglect of agriculture, and the
Finally, the literature also suggests that depopulation follows a certain pattern. With
particular regard to rural areas, there seems to be plenty of agreement on the fact that the
territories being abandoned first are those that lie farthest from urbanized centers and have the
worst infrastructure and services. Rural territories are the most backwards and underdeveloped,
so people tend to abandon them, which sets in motion a vicious cycle from which it is difficult
to escape.
The data employed in the study are taken from official Italian national resources, the
National Institute of Statistics (ISTAT)2, from Eurostat3, and from a study by Cavalieri and
Guccio4 on health expenditure in Italy. In particular, the main model of the study follows the
eight regions over a period spanning from 2002 to 2019, while the three ‘extra’ models that
have restricted observations will cover data respectively from 2003 to 2017, from 2010 to 2019,
and also the comparison between 2003 and 2019. The variables taken into account can be
divided between strictly demographic and socio-economic. Indeed, part of the study is to
understand whether the depopulation affecting the rural regions of Italy is due to ‘natural
causes’ or if there is a specific problem and reason why people decide to leave.
1. Natural Increase Rate – this is calculated as the difference between the live births and
the deaths divided by the resident population of each region on January 1st for each
year. It is expected that if the natural increase rate is negative the depopulation rate will
increase, so the two are negatively related. If the natural increase rate is positive then
2
https://www.istat.it/it/popolazione-e-famiglie?dati
3
http://appsso.eurostat.ec.europa.eu/nui/show.do?dataset=nama_10r_2gdp&lang=en
4
http://www.siepweb.it/siep/images/joomd/1401044478573.pdf
Pavoni 6
2. Total Fertility Rate – defined as the number of births per 1,000 women of childbearing
age (between 15 to 49 years old) per year. It is predicted that as the fertility rate
increases, the depopulation rate would decrease because the two variables are
negatively related.
3. Net Domestic Migration – defined as the change in residence from the rest of Italy to
the region. As out-migration is greater than in-migration the depopulation rate will
increase.
4. Net Foreign Migration – defined as the change in residence from the rest of world to
the region. As migration increases so does depopulation. From the data available it is
possible to see that in-migration is greater than out-migration, so net foreign migration
1. Regional GDP – it is expected to have a negative relation with the depopulation rate
because as GDP goes up, the richer the region, the less people are expected to leave or
it could also be that as GDP grows people are more willing to move; this depends on
2. Annual per capita Household Income – it is expected that it will be negatively related
with the depopulation rate because as income increases or even if it stays relatively
uniform people don’t have reason to move away (data available for this variable spans
labor force, seasonally adjusted. As the unemployment rate increases people will be
more willing to leave the region to look for better opportunities, so the two variables
are positively related. However, it is important to mention that since the unemployment
Pavoni 7
rate doesn’t have a steady decrease or increase it would also make sense for people to
4. Number of crimes – defined as the number of crimes reported by the police forces to
the judicial authorities. It is predicted that as the number of crimes decreases so does
the depopulation rate, the two variables are positively related (data available for this
5. Per Capita Public Expenditure on Healthcare – defined as the amount that each region
spends on health, for both individual and collective services. It is expected that as per
capita public expenditure increases depopulation should decrease because people have
an incentive to remain in the region (data available for this variable is only for 2003
and 2019).
6. University Graduates – defined as the number of people, per region, that have a tertiary
level education. It is expected that as the number of graduates increases so will do the
depopulation rate because people will try to move away in order to have better work
The following table summarizes the type of variable, the unit of measurement, the expected
5
It describes the expected impact that each independent variable will have on the dependent one.
Pavoni 8
Household Income
expenditure on Guccio’s
healthcare study
The Quality of Life Index6 is a thematic analysis conducted by an Italian national daily
business newspaper, Il Sole 24 Ore, based on 90 indicators that span from wealth and
consumption, demographics and health, business and work, environment and services, justice
and security, culture and leisure. The analysis takes into account all the Italian cities and rates
them based on their score. With regards to this research, only the top 10 cities will be taken
into account; in particular, if a region has one of its cities in the top 10 of the Index then it takes
value 0 whereas if it’s not in the top 10 it takes value 1. It is expected that the region which
takes value 0 in the given year, and so that has one of its cities in the top 10 of the index, will
experience less depopulation or, better yet, will experience internal migration; therefore, the
Descriptive Statistics
Table 2 summarizes the variables that will be used in the main econometric model.
increase rate
rate
migration
6
https://lab24.ilsole24ore.com/qualita-della-vita-2019/
Pavoni 10
migration
rate
Table 3 is a correlation matrix that shows the eventual multicollinearity among the
variables under examination. Most of the explanatory variables are not highly correlated with
one another, the only exception are the variables Regional GDP and Net Domestic Migration
that have a negative correlation of -0.86 while for all the other variables the correlation is less
than 0.8.
Here are also reported some interesting graphs representing the relationship between
the Depopulation Rate, the Regional GDP, and the Net Domestic Migration.
The first analysis that was run was a simple OLS model that included observations from
2002 to 2019, so it was run a regression with the entire main model. Then, taking into account
that the data under analysis is panel data, through the use of the command xtset Region Year
we were able to make understand Stata that the observations belonged to eight different regions
across the same time frame. In Regression (1) it was observed that the dummy variable was not
statistically significant and that the Unemployment Rate wasn’t either. Regression (2) was run
omitting the dummy variable and generating a new variable for unemployment that accounted
for its marginal effect. By doing so the Unemployment became statistically significant at 5%
and the R2 slightly improved. Considering that the variables Regional GDP and Net Domestic
Migration have a negative correlation of -0.86 (see Table 3) we run first another regression (3)
without them and with Unemployment Rate and then (4) with Ln(unemployment); this last
model has all variables statistically significant at 1% but the difference in R2 is minimal. In
Regression (4) Regional GDP, Net Domestic Migration, and Unemployment rate have been
omitted. Finally it was run one extra model (5) only omitting the dummy variable and the
Unemployment Rate because not statistically significant. Table 4 reports the five regressions:
VARIABLES OLS 1 2 3 4 5
(0.000765) (0.00107)
(0.000818) (0.000811)
Number of Region 8 8 8 8 8
Pavoni 13
Among the five regressions, Regression (5) is the better model because all the variables
are statistically significant and we omitted Unemployment Rate that was not contributing to
the analysis. This model has a slightly lower R2 but for the purposes of this study the result is
still pretty good. We tested for heteroskedasticity (Figure 3 in Appendix) and corrected for it
(Figure 4 in Appendix) but the difference with the model is once again very little so we prefer
We then ran the regression for the first restricted model, the one with observations from
2003 to 2017 with the added variables Number of Graduates and Annual per capita Household
Income. The first regression (1) run included all the explanatory variables and it was found that
Net Domestic Migration, Unemployment Rate, the Number of University Graduates and the
Annual per capita Household Income were all not statistically significant. We tried generating
a new variable, the natural logarithm of the unemployment rate, but despite an improved R2
both the new variables as well as the dummy variable were not statistically significant, as seen
in Regression (2). We then removed the dummy variable, having checked for multicollinearity
through a matrix of correlation (table 5 in Appendix), also Regional GDP and Net Domestic
Migration were taken out of the Regression (3). Since University Graduates and Annual
Average Households Income were not significant we run a fourth model (Regression (4)) where
the only variables that were statistically significant were the demographic ones suggesting the
idea that depopulation in rural Italy is mainly driven by the increase in deaths and the decrease
in births.
The second restricted model is the one that takes into account the Per Capita Public
Expenditure on Healthcare between the year 2003 and 2019. Once again, the first model that
was run was the one comprising all the explanatory variables, as shown in Table 7 in
Regression (1). All the variables result not statistically significant. It was tested for
multicollinearity (table 8 in Appendix) and none of the variables, except for Regional GDP and
Net Domestic Migration, resulted correlated by -0.83. A second regression (2) was run omitting
the dummy variable and the two previously mentioned variables, but all the variables were still
Pavoni 15
not significant. Also in this case Ln(unemployment) was generated but the variables continued
to be non-significant (3). It is interesting to comment on the fact that Per Capita Public
Expenditure on Healthcare has no explanatory power, but when gathering the data it was
believed the opposite. Despite having Italy a free Healthcare system the per capita increase on
Expenditures on Healthcare should be a good incentive to remain in the region and not move
away. If we take Abruzzo’s case, we can see that Per Capita Public Expenditure on Healthcare
Ln(unemployment) -0.0113
(0.00845)
Constant 0.0728 0.0559 0.0438
(0.0123) (0.0321) (0.0260)
Observations 16 16 16
R-squared 0.998 0.932 0.943
Number of Region 8 8 8
The third restricted model was then analyzed. This model comprehended data from
2010 to 2019 and the new variable added was Number of Crimes. A regression (1) with all the
Pavoni 16
variables was run once again. Many of the variables were not statistically significant; we run a
second (2) model taking out the dummy variable, that given the previous tests was never really
helpful in understanding the depopulation rate, and the Net Domestic Migration. The second
model had three significant variables–Natural Increase Rate, Net Foreign Migration and
Regional GDP–while all the others weren’t significant at 1%. A third model (3) was run after
having checked for multicollinearity (table 8 in appendix) and removed Regional GDP because
showed a correlation of 0.81 with Number of Crimes and of -0.85 with Net Domestic
Observations 80 80 80
R-squared 0.810 0.807 0.800
Number of Region 8 8 8
Pavoni 17
After having run several regressions, we can say that the original model, the one with
the most observations is the most efficient in explaining depopulation rates. In particular,
Regression (5) we run can be considered the best in order to explain depopulation.
.
𝐷𝑒𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝚤𝑜𝑛 𝑟𝑎𝑡𝑒 = 𝛽! + 𝛽" ∗ 𝑛𝑎𝑡𝑢𝑟𝑎𝑙 𝑖𝑛𝑐𝑟𝑒𝑎𝑠𝑒 𝑟𝑎𝑡𝑒 + 𝛽# ∗ 𝑛𝑒𝑡 𝑑𝑜𝑚𝑒𝑠𝑡𝑖𝑐 𝑚𝑖𝑔𝑟𝑎𝑡𝑖𝑜𝑛
.
𝐷𝑒𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝚤𝑜𝑛 𝑟𝑎𝑡𝑒 = 0.023 + 2.51 ∗ 𝑛𝑎𝑡𝑢𝑟𝑎𝑙 𝑖𝑛𝑐𝑟𝑒𝑎𝑠𝑒 𝑟𝑎𝑡𝑒 + 2.62 ∗ 10'(
(5)
VARIABLES 5
Ln(unemployment)
Pavoni 18
Constant 0.0233***
(0.00322)
Observations 144
R-squared 0.836
Number of Region 8
• Ceteris paribus, a percentage point increase in the Natural Increase Rate determines an
• Ceteris paribus, every additional person that migrates within Italy increases, on average,
• Ceteris paribus, every additional person that migrates outside of Italy increases, on
• Ceteris paribus, every additional increase in Regional GDP (in million of euros)
increases by 3.40 ∗ 10'( percentage points the depopulation rate, on average. The
regression showed that Regional GDP and depopulation rate are positively related, this
could be explained by the fact that GDP increases all over Italy thus people would be
more willing to move to a different region, where the economic growth is more
significant.
• Ceteris paribus, 1 more birth per woman, on average, decreases the depopulation rate
This study could be expanded and improved if more time was available for the
collection of data. Indeed, further research should be done on the topic because some
Pavoni 19
explanatory variables included did not cover the chosen timeframe of the analysis, therefore,
the various results from the different regressions could not be compared. Moreover, more
the subject matter. In particular, the model presented in the analysis could be improved by
including a more detailed analysis of the territory. It would be interesting to see how the
population density changes within each region; in particular, how being closer to big cities
changes the depopulation rates. Going more in depth, it would be interesting to see if it possible
to categorize the rural regions in additional subgroups based on population density and also on
distance from bigger urbanized centers. The rural areas could be divided as predominantly
rural, predominantly rural close to cities and predominantly rural remote. Or the rural areas
could be subdivided based on economic performance because not all rural areas are the same;
some are exploiting their natural resources as promotors of economic growth and tourism. As
briefly shown in Rizzo’s study there is quite a clear difference between the rural areas which
have undergone a restructuring of their internal equilibrium and those areas that are
Conclusions
panel data analysis. It was found that demographic variables are much more significant in
explaining depopulation while economic and socio-economic variables influence less the
depopulation rates. The Natural Increase Rate determines a substantial explanatory power for
depopulation rates even though it doesn’t follow its predicted outcome. Migration also plays
an important role in explaining depopulation because as the more people are out-migrants the
more depopulation increases. The Total Fertility Rate however has a negative impact on
depopulation because if more people are born, so if there is an increase in births, then it will be
Pavoni 20
easier to compensate for depopulation. Despite the small weight it has and the fact that it wasn’t
included in the final regression, it is quite interesting to see how unemployment was not
significant while it was predicted that it would have had good explanatory power. It was
assumed that with the worsening of the labor market people would be more willing to leave,
but this study allows to understand that even if unemployment increases depopulation might
decide whether to move or not. Similarly, the impact of Regional GDP is interesting: as GDP
grew, depopulation would have to decrease while, despite having a small coefficient, GDP
determines an increase in depopulation. This might be explained by the fact that as people feel
References
Cavalieri, Martina and Calogero Guccio. “Health Expenditure in Italy: a Regional Analysis of
Mickovic, Biljana, et al. “Contribution to the Analysis of Depopulation in Rural Areas of the
Rizzo, Agatino. “Declining, Transition and Slow Rural Territories in Southern Italy
Characterizing the Intra-Rural Divides.” European Planning Studies, 2016, vol. 24,
Appendix