Professional Documents
Culture Documents
JOINT DIME/ITDG
PLENARY MEETING
Item 2
1. INTRODUCTION
The purpose of this document is to inform the DIME and the ITDG of the draft
detailed lists of high-value datasets for:
business statistics (Annex A.)
macroeconomic statistics (Annex B.)
social statistics (Annex C.).
that have been transmitted to the competent ESS Directors’ Groups (BSDG, DMES
and DSS, respectively) for their opinion, taking into account (i) the capacity of the
ESS to deliver, bearing in mind that it is the responsibility of the national
authorities in the EEA to make sure that the HVDs are disseminated:
(a) free of charge;
(b) machine readable;
(c) via Application Programme Interfaces (APIs); and
(d) made available for bulk download.
and (ii) the valued added for the users.
Jointly, these will form a list of high-value datasets for the statistics category – to
be submitted by Eurostat on behalf of the ESSC to the European Commission’s
Directorate-General for Communications Networks, Content and Technology
(Connect).
The DIME and ITDG are invited to take note of the draft lists, and reflect on
relevant accessibility/interoperability criteria and (joint) development needs, in the
view of the fact that preparedness to meet criteria (c) and (d) appears1 to be uneven
across the ESS.
2. BACKGROUND
The digital recovery will require the emergence of a real data economy as a motor
for innovation and job creation, as outlined in the Communication 2 adopted by the
Commission on 27 May as part of the Recovery Plan. In this context, making
government datasets available for the common good has an important role to play.
The Directive on ‘Open Data and the re-use of public sector information’ (the Open
1
Summary of the outcome of the VIN consultation on HVDs (annex to document ESSC 2020/43/3/EN)
2
The EU budget powering the recovery plan for Europe 27.5.2020; Communication from the
Commission to the European Parliament, the European Council, the Council, the European Economic
and Social Committee and the Committee of the Regions; (COM(2020) 442 final):
https://ec.europa.eu/transparency/regdoc/index.cfm?fuseaction=list&coteId=1&year=2020&number=442&version=F
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
2
Data Directive; ODD)3 should improve re-use of governments’ data by the private
sector.
The Open Data Directive introduces the concept of high-value datasets (see
Article 2 list item 10 of the ODD) that are associated with high commercial
potential. Their re-use ‘…is associated with important benefits for society, the
environment and the economy, in particular because of their suitability for the
creation of value-added services, applications and new, high-quality and decent
jobs, and of the number of potential beneficiaries of the value-added services and
applications based on those datasets’.
As set out in chapter V of the ODD, a specific list of high-value datasets will be
adopted by the European Commission by means of an implementing act, together
with conditions and arrangements for access and re-use, e.g. terms applicable to re-
use, formats of data and metadata and technical arrangements for dissemination.
The specific high-value datasets shall be:
(a) available free of charge;
(b) machine readable;
(c) provided via Application Programme Interfaces (APIs); and
(d) made available for bulk download.
Statistics is explicitly listed (in Annex I to the ODD) as one of the thematic
categories in scope of the high-value dataset concept. Other categories comprise
geospatial, earth observation and environment, companies and company ownership,
and mobility. This is perceived as complementary to the European Data Spaces 4,
initiative, which foresees ‘data spaces for public administrations’ mainly devoted to
data on law and public procurement.
The European Commission will provide opportunities for funding projects to
improve the availability and use of high-value datasets under future Digital Europe
Programme. The programme will focus on building strategic digital capacities and
on facilitating the deployment of digital technologies within the EU. One strand will
be dedicated to ensuring that the public sector can deploy state of the art digital
technologies and to provide interoperable public services across the EU.
The definitive list of high-value datasets by category will be defined in an
implementing act. The European Commission (Connect) has commissioned an
impact assessment study on the list of high-value datasets to be made available by
the Member States under the Open Data Directive5. A tentative list of HVDs for all
areas has been proposed by Eurostat to Connect in April 2020, and full details for
each of the HVDs proposed should be provided by Eurostat in June 2020 in order
for it to be possible to use in the impact assessment study.
In its interactions with Connect, Eurostat has clarified that for the statistics category,
this detailed list will be developed jointly with the National Statistical Institutes of
the ESS.
3
Directive (EU) 2019/1024 of the European Parliament and of the Council of 20 June 2019 on open data
and the re-use of public sector information; available via
https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1561563110433&uri=CELEX:32019L1024.
4
A European strategy for data (Communication COM(2020) 66 from the Commission to the European
Parliament, the Council, the European Economic and Social Committee and the Committee of the
Regions); available via
https://ec.europa.eu/info/sites/info/files/communication-european-strategy-data-19feb2020_en.pdf
5
See: https://etendering.ted.europa.eu/cft/cft-display.html?cftId=4676
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
3
3. ESS EFFORTS TO DATE
3.1. Dedicated group
Following the timetable of the Commission for the preparation of the
implementing act, it would be timely to provide an indicative list of high-
value statistical products by May 2020.
In order to achieve this goal, the joint Steering Group of the ESS Directors
of Methodology (DIME) and the ESS IT Directors’ Group (ITDG) agreed at
its meeting on 12 Feb 2020 to extend the mandate of the group on using
privately held data for official statistics to elaborate a proposal for a common
list of high-value datasets in statistics.
3.2. ESSC opinion
The work of the dedicated group led to an ESS Vision Implementation
Network (VIN) consultation6, involving all ESS NSIs. Based on the
consultation outcome1 of, a proposal7 for statistics HVDs was made to the
ESSC. In its opinion (see Annex D.), the ESSC inter alia:
accepted the approach to have a short list of high-value datasets
underlined the importance of engaging with potential users when defining
the scope, content and granularity of the exercise
tasked the Directors’ Groups with defining, for each dataset, the
exact scope and breakdown taking into account the capacity of the ESS
to deliver and the valued added for the users.
6
The VIN consultation documents are available to each NSI (via their VIG/VIN representative) on
CIRCABC at https://europa.eu/!BY86RQ.
7
Proposed list of high-value datasets in the statistics category (document ESSC 2020/43/3/EN):
https://circabc.europa.eu/ui/group/93919500-a22c-4172-ab6d-68ae21509331/library/7db2813e-b216-44cf-8c6a-fcf735d0df98/details
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
4
4. NEXT STEPS
Table 1 below outlines the main on-going and future steps of the procedure leading
to the formulation and subsequent adoption of the implementing act.
Table 1. On-going and future steps of the procedure leading to the formulation and
subsequent adoption of the implementing act regarding high-value datasets
Eurostat Transmit draft statistics HVD proposal (all domains) to Connect 4 June 2020
DMES, DSS, BSDG Opinions on the HVD proposals for the various domains 8-18 June 2020
Eurostat Transmit final statistics HVD proposal (all domains) to Connect 22 June 2020
Use of draft & final statistics HVD proposal in the study 5 underlying the impact
Connect assessment for the list of high-value datasets
throughout June 2020
Reflection on accessibility/interoperability criteria and joint development
DIME/ITDG 2-3 July 2020
needs/opportunities
Connect Drafting of Commission impact assessment July-Sep 2020
Updating and finalisation of the list statistics HVDs
Connect, RSB 2nd half of 2020
(as part of the full list of HVDs from all categories)
Open Data Committee Vote on the implementing act Feb 2021
Adoption of the implementing act
European Commission 1st half of 2021
(following formal consultations and Committee vote)
Member States Transposition of the Open Data Directive into national law 21 July 2021
Partner12
Product13
Flow14
Country of origin
Type of trip
Breakdowns Duration of the trip
Destination country
Means of transport
Means of accommodation
Booking modalities
Profile of the tourist/visitor
Coastal vs non- Degree of urbanisation
Region –
coastal areas + specific cities
Frequency Yearly
Quarterly
8
By NACE Rev 2
9
Breakdown up to 4-digit NACE activity code, depending on the size of the reporting country
10
By Classification of Types of Construction (CC)
11
The granularity of the activity (CC) breakdown is depending on the size of the reporting country
12
By Nomenclature of Countries and Territories for the External Trade Statistics of the Community and Statistics of Trade between Member States (GEONOM); breakdown by partner Member State/country.
13
By Combined Nomenclature (CN); at 8-digit level.
14
Exports/imports.
High-value datasets (HVDs) – Detailed dataset specifications DIME-
ITDG/2020/2
6
Monthly
SDG indicators
No sufficiently closely associated SDG indicators were identified for any of the proposed business statistics high-value datasets
Table A3. Details on the breakdowns proposed for tourism flows in Europe
Dimension Categories Scope
Country of origin Residents; non-residents Only for Nights spent (key variable 1)
Type of trip Long; short; domestic; outbound Only for Participation (key variable 2)
Duration of the trip Units, in number of nights
Destination country By country of the world (sample size permitting) Only for demand side information (key variables 3, 4, 5)15:
Means of transport 7 categories16 Tourism trips made by EU residents
Tourism nights spent by EU residents
Means of accommodation 7 categories16
Tourism expenditure made by EU residents
Booking modalities Various attributes16
15
The sample size can determine the possible level of granularity.
16
See Annex II to Regulation (EU) No 692/2011 of the European Parliament and of the Council of
6 July 2011 concerning European statistics on tourism
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
8
ANNEX B. PROPOSED HIGH-VALUE DATASETS FOR MACROECONOMIC STATISTICS
The six proposed high-value datasets for macroeconomic statistics are presented in Table B1 below. Some additional details on key variables are
provided in Table B2.
Table B1. Proposed high-value datasets for macroeconomic statistics
National accounts – GDP main aggregates (including National accounts – key indicators National accounts – key indicators Consolidated government
Dataset Consumer Price Index Government expenditure and revenue
population and employment) on corporations on households gross debt
HICP Monthly data P51G - Gross fixed capital Government gross debt
GDP[B1GQ] – Gross domestic product at market Disposable income of households
(index) formation of nonfinancial Total general government revenue
prices – gross (B6G)/net (B6N)
corporations
HICP Monthy data B2A3G - Gross operating surplus P51G - Gross fixed capital
GVA[B1G] – Value added, gross Total expenditure
(monthly rate of change) and mixed income formation for households
HICP Monthly data HFC[P31_S14] – Final consumption expenditure of F.A - Total households sector Net lending/ net borrowing of general
Key variables (annual rate of change) households F.A - Total financial sector assets assets government (B.9)
HICP Annual data F.L - Total financial sector F.L - Total households sector
GFCF[P51G] – Gross fixed capital formation
(index) liabilities liabilities
HICP Annual data (rate
CE[D1] – Compensation of employees
of change)
POP – population and EMP[ETO] – Total
employment figures transmitted in national accounts
Purpose17
Industry18 19
Asset type20 21
Breakdowns Revenue category22
Expenditure category23
Gross debt category24
Regional NUTS 225 NUTS226
Yearly
Frequency Quarterly 27
Monthly
17
By Classification of individual consumption by purpose (COICOP)
18
By NACE Rev. 2
19
For gross value added, income and employment
20
By AN_F6
21
The granularity of the activity (CC) breakdown is depending on the size of the reporting country
22
Defined in ESA 2010 8.100 and chapter 20 by reference to a list of categories: Market output, output for own final use, payments for non-market output, taxes on production and imports, other subsidies on production,
receivable property income, current taxes on income, wealth, etc., net social contributions, other current transfers and capital transfers
23
Defined in ESA 2010 8.100 and chapter 20 by reference to a list of categories: intermediate consumption, gross capital formation, compensation of employees, other taxes on production, subsidies, payable property income,
current taxes on income, wealth, etc., social benefits other than social transfers in kind, social transfers in kind - purchased market production, other current transfers, adjustments for the change in pension entitlements, capital
transfers and acquisitions less disposals of non-financial non-produced assets.
24
Categories of government liabilities (as defined in ESA 2010): currency and deposits (AF.2), debt securities (AF.3) and loans (AF.4).
25
NUTS 3 for GDP/GVA at current prices
26
NUTS 2 currently available only for net disposable income (B6N) of households
27
Quarterly frequency only for key variable 1 (Gross fixed capital formation of nonfinancial corporations) for sector S1M combining households (S14) and non-profit institutions serving households (S15)
High-value datasets (HVDs) – Detailed dataset specifications DIME-
ITDG/2020/2
9
sdg_08_11 – Investment share of sdg_08_11 – Investment share of sdg_17_50 – Share of environmental sdg_17_40 – General
SDG indicator sdg_08_10 – Real GDP per capita
GDP by institutional sectors GDP by institutional sectors taxes in total tax revenues government gross debt
28
Key variable 1 to be used for calculating the Gross investment rate of non-financial corporations (P51 / B1G*100)
29
Key variable 2 to be used for calculating the Gross profit share of non-financial corporations (B2G_B3G / B1G*100)
High-value datasets (HVDs) – Detailed dataset specifications DIME-
ITDG/2020/2
11
ANNEX C. PROPOSED HIGH-VALUE DATASETS FOR SOCIAL STATISTICS
The eight proposed high-value datasets for social statistics are presented in Table C1 below. Some additional details on key variables and SDG indicators
are provided in Table C2 and Table C3, respectively. The proposed breakdowns of total healthcare expenditure are presented in detail in Table C4.
30
Classes: 20-24 years, 25-29 years, 30-34 years, … , 55-64 years
31
Classes: 15-24 years, 25-54 years, 55-74 years
32
By ISCED 2011
33
Only for S80/S20 – no subnational breakdown for the Gini coefficient.
High-value datasets (HVDs) – Detailed dataset specifications DIME-
ITDG/2020/2
12
Frequency Yearly Yearly Yearly Yearly Yearly Yearly Yearly & quarterly Yearly & quarterly
Table C3. Details on the SDG indicators to be provided as part of the social statistics HVDs
Dataset SDG indicator code SDG indicator title
sdg_03_41 Standardised death rate due to tuberculosis, HIV and hepatitis by sex
sdg_03_42 Standardised preventable and treatable mortality
Death rate sdg_08_60 People killed in accidents at work
sdg_11_40 People killed in road accidents
sdg_16_10 Standardised death rate due to homicide by sex
sdg_01_10 People at risk of poverty or social exclusion
Poverty sdg_01_20 People at risk of income poverty after social transfers
sdg_01_30 Severely materially deprived people
sdg_10_41 Income distribution (quintile share ratio)
Inequality
sdg_10_50 Income share of the bottom 40 % of the population
sdg_04_50 Employment rates of recent graduates by sex
Employment sdg_05_30 Gender employment gap
sdg_08_30 Employment rate by sex
sdg_08_20 Young people neither in employment nor in education and training by sex (NEET)
Unemployment
sdg_08_40 Long-term unemployment rate by sex
Table C4. Details on the breakdowns proposed for total healthcare expenditure
Dimension Description Categorisation34 Classes
H1+HC2: HC.1.1+HC.2.1, HC.1.2+HC.2.2, HC1.3+HC.2.3, HC1.4+HC.2.4
HC.3: HC.3.1, HC.3.2, HC.3.3, HC.3.4
Healthcare Type of need that healthcare goods and services aim to satisfy or
ICHA-HC HC.4;
functions the kind of health objective pursued.
HC.5: HC.5.1, HC.5.2
HC.6; HC.7; HC.9.
The organisations and actors that deliver healthcare goods and
Healthcare
services as their primary activity, as well as those for which ICHA-HP HP.1; HP.2; HP.3; HP.4; HP.5; HP.6; HP.7; HP.8; HP.9; HP.0.
providers
healthcare provision is only one among a number of activities.
Healthcare HF.1: HF.1.1, HF.1.2: HF.1.2.1, HF.1.2.2;
Types of financing arrangements through which healthcare goods
financing ICHA-HF HF.2: HF.2.1, HF.2.2, HF.2.3;
and services are paid for and obtained by people.
schemes HF.3; HF.4.
34
As per SHA 2011 (https://ec.europa.eu/eurostat/web/products-manuals-and-guidelines/-/KS-05-19-103)
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
14
ANNEX D. FINAL OPINION35 OF THE 43RD ESSC REGARDING ITEM ESSC 2020/43/3
Opinion
1. The ESSC stressed that all official statistics are of equal high value. While some delegates
emphasised that the full ESS statistical acquis should be given the status of high-value dataset, the
majority was in favour of a narrowed down approach. To practically contribute to the Open Data
Directive exercise, a list of high-value datasets for the statistics thematic category was agreed to start
with.
2. The ESSC underlined the importance of engaging with potential users when defining the scope,
content and granularity of the exercise.
3. The ESSC acknowledged that official statistics based on new privately held data sources (such as
data held by Mobile Network Operators) should be included in the list of HVDs for the statistics
thematic category when available.
4. The ESSC indicated the need for subsequent revisions of the implementing act and expected that
the initial list of HVDs for the statistics thematic category is expanded in a next revision in a few
years’ time.
5. The ESSC underlined the importance of the availability of high-value datasets for the thematic
categories of geospatial data, earth observation, and companies and company ownership, given their
importance as input for the production and enrichment of European statistics.
6. The ESSC tasked the Directors’ Groups with defining, for each dataset, the exact scope and
breakdown taking into account the capacity of the ESS to deliver and the valued added for the users.
7. The ESSC asked Eurostat to submit this opinion to DG Connect and expressed its wish to continue
discussing the topic.
35
https://circabc.europa.eu/ui/group/93919500-a22c-4172-ab6d-68ae21509331/library/58a270fa-666e-4315-9929-b3c7987373de
High-value datasets (HVDs) – Detailed dataset specifications DIME-ITDG/2020/2
15