You are on page 1of 36

Analysis of the 2nd National

Nutrition and Health Survey


Third Installment of the Final Project

January 6, 2022
1
Introduction 3
Dataset Description 4
Objective and scope 4
Tables and their variables 5
Relationship-Entity Diagram 9
Modifications to Main Tables 18
Anthropometry Table: 18
Comorbidity Table: 18
Disability Table 18
Exercise Table 18
Table Frequency Eating Habits 18
Eating Habits Table 19
Members Table 19
Table Province Region 19
Region Table 19
General Health Table 19
Sociodemographic Table 19
Vaccination Table 19
Incorporation of Additional Tables 20
Index Comorbidity 20
Disability Index 20
Vaccination Index 20
Individual Income Index 20
Selected Segmentations 21
Explanation of the Report 22
General Data 22
Glossary of Calculated Measures 28
Future Lines 35
Bibliography: 36
Introduction
This document is a final work of integration of the knowledge obtained during the Course of
Data Analytics.
The objective is to process a set of data given by a Database, to present pertinent
information within a report using the Power Bi tool.

The topic was part of the 2nd National Survey of Nutrition and Health (ENNyS).
In the transformation process, the total of the database was taken, and different tables were
formed to represent the selected variables in a more organized and summarized way.
Then it worked on each of these tables to give it a format that allows to be represented in a
more efficient graphic and dynamic way.
A series of measures have also been generated to represent important information
according to this model.
Dataset Description

The dataset is composed of a subset of variables from the dataset corresponding to the data
of the 2nd National Nutrition and Health Survey, collected between 2018-2019 and published
in 2021 by the Ministry of Health of Argentina. For each subject, results were collected on
their sociodemographic variables, general health, eating habits and physical activity (among
other data that was excluded from the project).

The survey is a cross-sectional study of the Argentine Republic in which a survey is applied
to individuals residing in private households in urban localities of 5,000 inhabitants or more.
These localities allowed the representativeness of the following regions:

· GBA: Autonomous City of Buenos Aires (CABA) and 24 parties of the Buenos Aires
Conurbano.

· CENTER: Rest of the province of Buenos Aires, Córdoba, Entre Ríos and Santa Fe.

· NORTHEAST (NEA): Chaco, Corrientes, Formosa and Misiones.

· NORTHWEST (NOA): Catamarca, Jujuy, La Rioja, Salta, Santiago del Estero and
Tucumán.

· CUYO: Mendoza, San Juan and San Luis.

· PATAGONIA: Chubut, La Pampa, Neuquén, Río Negro, Santa Cruz and Tierra del Fuego.

On the other hand, for reasons of maximizing the generability of the results and simplifying
their interpretation, only those subjects of legal age were selected, resulting in the following
sample:

Population GBA CENTER NEA NOA WHOSE PATAGONI Subtotal


by age A

18 and over 898 1262 1275 1354 1394 1184 7367

Objective and scope

The objective of the project is to make an optimal visualization for the results of the
descriptive statistics of the Argentine population in the areas of interest. In addition, it is
intended to visualize the relationship between general health and other variables that
possibly correlate with the former, such as eating, nutritional, physical activity habits, etc.

The end user of the project is anyone who is interested in observing graphically and
quantitatively the relationship between the health of Argentines and other variables, for
example their habits. On the other hand, it could also have value (as an exploratory study)
when planning public policies that aim to maximize health and prevent illness in the general
population.

Tables and their variables

Table 1: Survey

Contains the total of the subset of variables selected for the project. They are not "clean"
and, therefore, will not be used, since the rest of the tables contain the data already
processed and optimized.

Tabla 2: Miembros (Members)

1. ID (nvarchar): PK. Id of the subject.


2. Region_ID (nvarchar): FK. Region ID.

Tabla 3: Región.

1. Region_ID (nvarchar): PK Region ID.


2. Region (nvarchar): Region Name.
3. Población (int): Number of Inhabitants in the region.

Tabla 4: Provincia-Región

1. ID_Region (nvarchar): FK Region ID.


2. Region (nvarchar): Region Name.
3. Provincia (nvarchar): Province Name

Tabla 5: Sociodemografía.

1. ID (nvarchar): FK. Id of the subject.


2. Sexo (nvarchar): sex.
3. Edad (int): age.
4. Nacionalidad (nvarchar): nationality.
5. Cobertura de Salud (nvarchar): type of coverage (may not have)
6. Nivel Educativo (nvarchar): completed educational level.
7. Situación Laboral (nvarchar): whether or not are employed right now.
8. Ocupación (nvarchar): tipo de ocupación.
9. Ingreso Mensual Individual (nvarchar): individual income range
10. Ingreso Mensual del Hogar (nvarchar): Household income range
Tabla 6: Antropometría.

1. ID (nvarchar): FK. Id of the subject.


2. Peso Autopercibido (Decimal): estimation of one's own weight.
3. Talla Autopercibida (Decimal): estimation of one's own size.
4. Peso Real (Decimal): real weight.
5. Talla Real (Decimal): real size.
6. Conformidad con Peso (nvarchar): description of the degree of conformity.
7. Tiempo desde último pesaje (nvarchar): time range since last weighing.
8. Intento de Disminución del Peso (nvarchar): there is or is not such an attempt.
9. Intento Control de Peso (nvarchar): there is or is not such an attempt.
10. IMC(float)
11. IMC_Interpretacion(nvarchar)

Tabla 7: Ejercicio.

1. ID (nvarchar): FK. Id of the subject.


2. Trabajo Fisico Intenso (nvarchar): performs intense physical activity at work.
3. Frecuencia Semanal Trabajo Intenso (int): number of weekly days of intense physical
activity at work.
4. Duracion Trabajo Intenso (int): number of minutes each time
5. Trabajo Fisico Moderado (nvarchar): engages in moderate physical activity at work.
6. Frecuencia Semanal Trabajo Moderado (int): number of weekly days of moderate
physical activity at work.
7. Duracion Trabajo Moderado (int): number of minutes each time.
8. Cardio en Desplazamiento (nvarchar): walks/bikes at least 10' consecutive times
when commuting.
9. Frecuencia Cardio en Desplazamiento (int): number of weekly days of that
displacement.
10. Duracion Cardio en Desplazamiento (int): number of minutes each time.
11. Ejercicio Fisico Intenso (nvarchar): performs intense physical activity in his spare
time.
12. Frecuencia Semanal Ejercicio Fisico Intenso (int): number of weekly days of intense
physical activity in your free time.
13. Duracion Ejercicio Fisico Intenso (int): number of minutes each time.
14. Ejercicio Fisico Moderado (nvarchar): engages in moderate physical activity in their
spare time.
15. Frecuencia Semanal Ejercicio Fisico Moderado (int): number of weekly days of
moderate physical activity in your free time.
16. Duracion Ejercicio Fisico Moderado (int): number of minutes each time.
17. Total_Moderado (int): number of minutes per week of moderate physical activity.
18. Total_Intenso (int): number of minutes per week of intense physical activity.
19. Total (int): number of minutes per week of moderate and intense physical activity.
20. min_dia (int): number of minutes a day spent sitting or lying down.
Tabla 8: Salud General.

1. ID (nvarchar): FK. Id of the subject.


2. Salud autopercibida (nvarchar): How you perceive your health from Excellent to Bad.
3. disc_Carnet (int): whether have a disability card or not
4. Carnet_vacunacion (nvarchar): whether you have a vaccination card or not

Tabla 9: Comorbilidad.

1. ID (nvarchar): FK. Id of the subject.


2. Diabetes (int): have or do not have diabetes.
3. enfermedad_renal (int): have or do not have kidney disease.
4. hipertension_presion_alta (int): have or do not have hipertension_presion_alta.
5. dislipemia_colesterol_trigliceridos_altos(int): have or do not have dyslipidemia, high
cholesterol or triglycerides.
6. anemia (int): ): have or do not have anemia
7. cardiopatía (int): have or do not have heart disease.
8. sonda_vesical (int): have or do not have a bladder catheter.
9. riñon_transplantado (int): have or do not have a transplanted kidney.
10. obesidad (int): are or are not obese.
11. epoc (int): have or do not have COPD.
12. infarto (int): had or did not have a heart attack.
13. cancer (int): have or do not have cancer.
14. leucemia (int): you do or do not have leukemia.
15. hipotiroidismo (int): have or do not have hypothyroidism.
16. artrosis (int): hve or do not have osteoarthritis.
17. asma (int): have or do not have asthma.
18. chagas (int): have or do not have chagas.
19. gastritis (int): have or do not have gastritis.
20. artritis (int): have or do not have arthritis.
21. hernia (int): have or do not have a hernia.
22. hipertiroidismo (int): have or do not have hyperthyroidism.
23. epilepsia (int): have or do not have epilepsy.
24. osteoporosis (int): have or do not have osteoporosis.
25. acv (int): have or do not have acv.
26. Otra (int): have or do not have another disease.

Tabla 10: Discapacidad

1. ID_Miembro (nvarchar): FK. Subject ID.


2. Discapacidad (str): type of disability. Values:
• Discapacidad Visual: permanent difficulty or limitation to see, even with
glasses or lenses on.
• Discapacidad Auditiva: permanent difficulty or limitation in hearing, even
when wearing a hearing aid
• Discapacidad para Caminar: permanent difficulty or limitation in walking or
climbing steps.
• Discapacidad Manual: difficulty or permanent limitation to grasp objects and /
or open containers with your hands.
• Discapacidad Intelectual: permanent difficulty or limitation to understand
and/or learn.

Tabla 11: Hábitos Alimenticios.

1. ID_Miembro (int): FK. Subject ID.


2. Desayuno Última Semana (int): how many times a week eat breakfast.
3. Almuerzo Última Semana (int): how many times a week have lunch.
4. Merienda Última Semana (int): how many times a week snack.
5. Cena Última Semana (int): how many times a week dinner.
6. Tipos Comida Extra en la Semana (nvarchar): consumed or not, outside the 4 main
meals, some kind of extra food in the week such as sweets, sweets, sweet cookies,
pastry kneading, bills, cereals with sugar.
7. Frecuencia Sal (nvarchar): frequency with which consume salt.
8. Cantidad de Sal (nvarchar): if consume salt, how much of it use.
9. Lectura Información Nutricional (nvarchar): how often read about reading about
nutrition facts.
10. Comprensión Información Nutricional (nvarchar): what is the degree of understanding
about nutritional information.
11. Influencia de Medios (nvarchar): do receive any kind of influence from the media
about this issue?
12. Autopercepción de Alimentación Saludable (nvarchar): how perceive your level of
healthy eating.
13. Frecuencia de Comida Frente a Dispositivo Electrónico (nvarchar): frequency of
meals in front electronic devices.
14. Cantidad de comidas semanales (int): what is the number of weekly meals.

Tabla 12: Frecuencia Hábitos Alimenticios

1. ID_Miembro (int): FK. Id of the subject.


2. Tipo alimento (nvarchar): In general, taking the last three months as a reference,
what kind of food did consume?
3. Frecuencia (int): In general, based on the last three months, how often do consume
the type of food?
4. Tipo Alimento_ID (int): Unique code that identifies the type of food.

Tabla 13: Vacunas.


1. ID (nvarchar): FK. Id of the subject.
2. Antitetánica (nvarchar): did or did not receive the Tetanus vaccine.
3. Hepatitis_B(nvarchar): did or did not receive the Hepatitis B vaccine.

Relationship-Entity Diagram

The Entity-Relationship diagram originally proposed (from which modifications were made)
focuses on a single primary key (Subject _ID), therefore, the beginning is represented by the
following image corresponding to Table 2:

1. Table 3, corresponding to Geography, is represented by the following image:


Being the only table in the model that shows a ratio of many to one (M:1) since there are
many subjects within each region.

2. Table 4, corresponding to Sociodemography, is represented by the following image:

3. Table 5, corresponding to Anthropometry, is represented by the following image:


4.
5. Table 6, corresponding to Exercise, is represented by the following image:

6. Table 7, corresponding to Disability, is represented by the following image:


7. Table 8, corresponding to Diseases, is represented by the following image:
8. Table 9, corresponding to Food Habits, is represented by the following image:
9. Table 10, corresponding to Frequency of Eating Habits, is represented by the
following image:

10. Table 11, corresponding to Smoking, is represented by the following image:


11. Table 12, corresponding to Vaccination, is represented by the following image:
The general diagram is as follows:
After all the changes made (and explained in the next section), the entity-relationship
diagram (seen from the Power BI tool) is as follows:

For more specificity, the relationships are shown in the following list:
Modifications to Main Tables

Anthropometry Table:
1. Column name changed "ID" to "ID_Miembro" to better identify the Primary Key

Comorbidity Table:
1. Column headers were renamed to begin with a capital letter
2. Given the need to have the names of diseases in a single column, the Unpivot tool is
used. This generates duplicate records from the ID column.
3. The "Value" column is eliminated, which previously served as a binary model to
determine if the registry had the disease or not. By doing this, those respondents
who do not have comorbidities were automatically removed from the table (leaving
those without comorbidity reflected in the Members Table).
4. The "attribute" column containing diseases is renamed "comorbidity" and the "ID"
column "ID_Miembro" to better identify the Primary Key
5. From this table is generated Index_Comorbilidad which will be explained in the
section "additional tables"

Disability Table
1. Column headers renamed to begin with a capital letter
2. Column is removed without _disc, which reported that the respondent did not have
disabilities. In this way, the table only contains members with some type of disability
(leaving those without disabilities reflected in the Members Table).
3. An ordering of records according to column ID was carried out
4. Given the need to have the names of the disabilities in a single cotulum, the Unpivot
tool is used. This generates duplicate records from the ID column.
5. The "Value" column, which previously served as a binary model to determine
whether the registry had a disability or not, is deleted.
6. The columna "attribute" containing disabilities is renamed to "Disability" and the "ID"
column to "ID_Miembro" to better identify the Primary Key
7. From this table is generated Index_Discapacidad which will be explained in the
section "additional tables"

Exercise Table
1. Column name changed "ID" to "ID_Miembro" to better identify the Primary Key

Table Frequency Eating Habits


1. Given the need to have the types of food in a single column, the Unpivot tool is used.
This generates duplicate records from the Attribute column.
2. Column name changed "Attribute" to "Food Type", "Value" to "Frequency" and "ID" to
"ID_Miembro" to better identify the Primary Key
Eating Habits Table
1. Values "99" (assumed to be a code and does not provide data relevant to the
analyzed model) are replaced by the value "0"
2. ID column renamed to ID_Miembro to better identify the Primary Key

Members Table
1. ID column renamed to ID_Miembro to better identify the Primary Key

Table Province Region


1. Original table as imported from the .xlsx file

Region Table
1. Original table as imported from the .xlsx file

General Health Table


2. Column name changed "ID" to "ID_Miembro" to better identify the Primary Key

Sociodemographic Table
3. Column name changed "ID" to "ID_Miembro" to better identify the Primary Key

Vaccination Table
4. Given the need to have the names of the vaccines in a single column, the Unpivot
tool is used. This generates duplicate records from the ID column.
5. The "Value" column is filtered leaving only results equivalent to "Yes". By doing this,
those respondents who do not have vaccines or did not answer the question were
eliminated.
6. The "Value" column is eliminated, which previously served to determine if the registry
had Vaccines or not, or if it had not answered.
7. ID column renamed to ID_Miembro to better identify the Primary Key
8. From this table is generated Index_Vacunacion which will be explained in the section
"additional tables"
Incorporation of Additional Tables

Index Comorbidity

Used to summarize the number of comorbidities in a table with a single record and unique
index assigned.
It is not an intermediate table, but it works to improve the order by removing duplicates and
applying filters more effectively when manipulating the information.

Disability Index
It is used to summarize the number of disabilities in a table with a single record and a single
index assigned.
It is not an intermediate table, but it works to improve the order by removing duplicates and
applying filters more effectively when manipulating the information.

Vaccination Index

It is used to summarize the number of vaccines in a table with a single record and a single
index assigned.
It is not an intermediate table, but it works to improve the order by removing duplicates and
applying filters more effectively when manipulating the information.

Individual Income Index

Serves to sort people's individual income ranges

Index Types of Food

It is used to summarize the types of food in a table with a single record and a single index
asignado.

Index Tag Reading

It serves to summarize the answers regarding the reading of the nutritional information
present on the label of the foods and beverages, in a table with a single record and a single
index assigned.
In order for all the tablas mentioned above to work, calculated columns were made in the
associated tables.

Selected Segmentations

Sex
This segmentation allows you to filter by sex. We estimate that it is a strongly determining
variable of the rest of the variables, so when filtered by sex interesting changes in the graphs
are observed. It is available on all tabs via a button called "Filter Panel".

Region
This segmentation allows you to filter by region. We estimate that it is a strongly determining
variable of the rest of the variables, so when filtered by region interesting changes in the
graphs are observed. It is available on all tabs via a button called "Filter Panel".

Comorbidity
This segmentation allows filtering by the different diseases and is available only in the
"Comorbidities" tab through a button called "Filter Panel". It was created with the aim of
observing how the relationship between the disease or diseases changes with the rest of the
variables available on the flap (e.g., distribution by sex, relationship with BMI, etc.)

Disability
This segmentation allows you to filter by the different disabilities and is available only on the
"Disability" tab through a button called "Filter Panel". It was created with the aim of observing
how the relationship between the disability or disabilities changes with the rest of the
variables available on the flap (e.g., age distribution).
Explanation of the Report

General Data

1. Card that shows what is the average age on the total of respondents. To do this, the
Average Age measure was used.

2. Card that reflects the percentage of people who have some comorbidity over the total
number of respondents. To achieve this, the measure of Percentage People with
Comorbidity had to be used.

3. Pie chart used in order to show how many out of the total number of respondents are
male and female, i.e., by sex. As an attribute of the legend sex was used, and the value
is Count of ID_Miembro.

4. Bar chart that shows the number of respondents according to their occupation. On the
y-axis, his occupation was put and on the x the number of people, while the value is
the top 5 of Count of ID_Miembro.

5. Finally, on this flap, another bar chart was made that shows how many respondents
there are according to their income. In the x-axis, again we have the number of people,
and in the y axis there are income ranges of the people surveyed. The value is Count
of ID_Miembro.

6. Button to open Filter Panel to obtain data according to age ranges, region and sex.
General Health

1. Card that shows the percentage of people who have some health coverage out of the
total number of respondents. To do this, the Percentage of People with Health
Coverage measure is used
2. Card showing the average Body Masa Index on the total number of respondents. To
do this, the measurement automatically generated by power bi, from the
"Anthropometry" table in "Average" mode, is used.
3. Card showing the percentage of people who are overweight/obese out of the total
number of respondents. For this, the percentage of overweight/obese people
measure is used
4. Bar chart showing the average comorbidities according to age, using as colors a
conditional format based on the same average. To do this, the Measure of Average
Comorbidities on the Total Respondents is used as a value and the Age column
of the Sociodemographic table as the axis.
5. Bar chart showing the percentage of obese/overweight people according to age,
using as colors a conditional format based on the same percentage. To do this, the
measure Percentage of Overweight/Obese People is used as a value and the Age
column of the Sociodemographic table as the axis.
6. Button to open Filter Panel to get data by age ranges, region and gender
Comorbidity

1. Card showing the average number of comorabilities over the total number of
respondents. To do this, the Average Comorbidity measure is used on the Total
Respondents
2. Card showing the percentage of people who perceive themselves to be in good
health over the total of respondents. To do this, the measure Percentage of People
with self-perceived health as good is used
3. Infographic showing the top 5 most prevalent of how many people possess that
comorbidity out of ten respondents. To do this, the measure of Average
Comorbidities over the Total Respondents is used as a value and the
comorbidities group as an axis.
4. Stacked Horizontal Bars Chart (100%) showing the percentage of people who have
some comorbidity over 100% (divided according to sex). To do this, the measure
Percentage people with Female Comorbidity, the measure Percentage people
with Male Comorbidity and Percentage comorbidity -1 as values are used; and
the attribute Sex of the Sociodemographic table.
5. AI graph of Key Influencers, which seeks to determine if the Body Mass Index is an
influence variable for the average comorbidities.
To do this, the Average Comorbidities measure on the Total Of Respondents is
used as data to be analyzed, IMC_Interpretacion and the Average BMI measure of
the Anthropometry table as supporting data and Sex of the Sociodemographic table
as data of expansion category
6. Button to open Filter Panel to obtain data according to age ranges, region, gender
and type of comorbidity.
Disability

1. Card showing the percentage of people with disabilities out of the total number of
respondents. For this, the measure Percentage of People with Disabilities is used.
2. Card showing the average number of disabilities over the total number of
respondents. To do this, the Average Disability measure is used over total
Respondents
3. Infographic showing the top 5 most prevalent of how many people have that disability
out of ten respondents. For this, the measure Percentage of People with
Disabilities is used as a value and the Disability group as the axis.
4. Bar chart showing the percentage of people with a disability according to age, using
as colors a conditional format based on the same average. To do this, the measure
Percentage of People with Disabilities is used as a value and the Age column of
the Sociodemographic table as the axis.
5. AI graph of Key Influencers, which seeks to determine if age is an influence variable
for the prevalence of disability. For this, the measure Percentage of People with
Disabilities is used as data to be analyzed and Age as justification data.
6. Button to open Filter Panel to obtain data according to age ranges, region, gender
and disability.
Exercise

1. Card showing the average number of minutes of exercise per week over the total
number of respondents.
2. Card that shows the percentage of people who perform some type of physical
activity. To do this, the measure Percentage of People who do some activity is
used.
3. Graph of stacked horizontal bars (100%) that shows how the moment in which the
exercise is performed (at work, in free time or moving) is distributed over 100% (I
divide according to sex). For this, the measure Number of People Who Do some
Activity Outside Work, the measure Number of People Who Do Some Activity at
Work and Number of People Who Do Some Displacement Activity; and the Sex
attribute of the Sociodemographic table are used.
4. Horizontal Bar Chart showing how many minutes per week of exercise is performed
according to sex. To do this, the average of the measure Total_Intenso and the
average of the measure Total_Moderado as tooltips are used, the average of the
Total measure as value, and the sex attribute of the Sociodemographic table as a
legend.
5. Scatter Chart that reflects the relationship between the average of the Total measure
(x axis) and the average measure of Comorbidities over the Total Respondents (y
axis). Within it we add the calculated measures R and R2 between both variables.
6. Button to open Filter Panel to get data by age ranges, region and gender.
Feeding

1. Card showing how many meals on average are made by respondents on a day-to-
day basis. For this, the measure Amount of Meals per day was used.
2. A card that was used to reflect how many people read the nutrition information out of
the total number of respondents. To achieve this, the measure Percentage of people
who read the nutritional information was used.
3. Bar graph that demonstrates based on the average, how often each food is
consumed in a month. For this, they had to put on the x-axis the frequency with
which they consumed each food within a month, while on the axis and the type of
food. The value is Frequency.
4. Scatter Chart tries to show in a first instance and without making a very deep or
detailed analysis, if there is any kind of correlation between the average BMI and the
reading of the nutrition labels. On the x-axis we find the Nutritional Information
Reading, so much so that on the y-axis the average BMI (Body Mass Index).
5. Button to open Filter Panel to get data by age ranges, region and gender.
Glossary of Calculated Measures

Number of Meals Per Day

AVERAGE(Tabla_Habitos_Alimenticios[Number of meals per week]) / 7

Measured to calculate the average number of meals per day of the total number of
respondents, based on the number of weekly meals.

Number of People Doing some activity

CALCULATE( DISTINCTCOUNT(Tabla_Miembros[ID_Miembro]),
FILTER(Tabla_Ejercicio ,
Tabla_Exercise[Intense Physical Exercise ] = "Yes" ||

Tabla_Exercise[Moderate Physical Exercise ] = "Yes"||


Tabla_Exercise[Cardio on the Move] = "Yes" ||
Tabla_Exercise[Intense Physical Work] = "Yes" ||
Tabla_Exercise [Moderate Physical Work ] = "Yes"))

Measure to calculate the number of people who do some activity, either inside or outside of
work, as well as the cardio that means the displacement.

Number of People Doing Some Displacement Activity

CALCULATE( DISTINCTCOUNT(Tabla_Miembros[ID_Miembro]),
FILTER(Tabla_Ejercicio ,
Tabla_Exercise[Cardio in Displacement] = "Yes"))

A measure to calculate the number of people doing cardio on the go either to or from work
Number of People Doing some Activity at Work

CALCULATE( DISTINCTCOUNT(Tabla_Miembros[ID_Miembro]),
FILTER(Tabla_Ejercicio ,
Tabla_Exercise[Intense Physical Work ] = "Yes" ||

Tabla_Exercise [Moderate Physical Work ] = "Yes"))

Measure to calculate the number of people who do physical activity within their work

Number of People Doing some Activity Outside of Work

CALCULATE( DISTINCTCOUNT(Tabla_Miembros[ID_Miembro]),
FILTER(Tabla_Ejercicio ,
Tabla_Exercise[Intense Physical Exercise ] = "Yes" ||

Tabla_Exercise[Moderate Physical Exercise ] = "Yes"))

Measure to calculate the number of people who do physical activity outside of their work

Average Daily Exercise

CALCULATE(AVERAGE('Tabla_Ejercicio'[Total]))/7

A measure to calculate the average number of minutes of daily exercise.

Comorbidity Number

DISTINCTCOUNT(Index_Commorbidity[Comorbidity])

Measure to calculate the number of comorbidities that exist in the model (taking into account
the category others as a unit)

Comorbidity percentage -1

( [Percentage People with Comorbidity] -1 ) *(-1)

Measure to calculate the percentage of personas without comorbidities over the total of
respondents

Percentage of People with Health Coverage


VAR CantCobSalud = CALCULATE(
COUNT(Tabla_Sociodemografia[Health Coverage]),
FILTER(Tabla_Sociodemografia,
Tabla_Sociodemography[Health Coverage] <> "Ns/Nr" &&
Tabla_Sociodemography [Health Coverage] <> "You Don't Have Any"))

VAR TotalPersons = COUNT(Tabla_Members[ID_Miemb ro])

Return
DIVIDE( CantCobSalud, TotalPersonas)

Measure to calculate the percentage of people with health coverage out of the total number
of respondents

Percentage of People with Self-Perceived Health as Good

VAR CantSaludBuena = CALCULATE(


COUNT(Tabla_Salud_General[ID_Miembro]),
FILTER(Tabla_Salud_General,
Tabla_Salud_General [Self-Perceived Health] = "Good" ))

VAR TotalPersonas = CALCULATE (COUNT(Tabla_Miembros[ID_Miembro]),


ALL(Tabla_Miembros[ID_Miembro]))

Return
DIVIDE( CantSaludBuena, TotalPersonas)

Measure to calculate the percentage of people who self-perceive their own health as good
over the total of respondents

Percentage of Overweight/Obese People

VAR CantObesidad = CALCULATE(


COUNT(Tabla_Antropometria[ID_Miembro]),
FILTER(Tabla_Antropometria,
Tabla_Anthropometry[IMC_interpretacion] = "Overweight" ||
Tabla_Anthropometry[IMC_interpretacion] = "Obesity" ))

VAR TotalPersons = COUNT(Tabla_Members[ID_Miembro])


Return
DIVIDE( CantObesidad, TotalPersonas)

Measure to calculate the percentage of people who are obese/overweight out of the total
number of respondents

Percentage of People who do some activity

VAR TotalPersons = COUNT(Tabla_Members[ID_Miembro])

Return
DIVIDE( [Number of People Doing Some Activity], TotalPeople)

Measure to calculate the percentage of people who perform some activity over the total of
respondents

Percentage of People Who Read Nutrition Info

VAR CantPersonasQueLeen = CALCULATE(


COUNT(Tabla_Habitos_Alimenticios[ Nutritional Information File]),
FILTER(Tabla_Habitos_Alimenticios,
Tabla_Habitos_Food[Reading Nutritional Information] = "Always" ||
Tabla_Habitos_Food[Reading Nutritional Information] = "Frequently" ||
Tabla_Habitos_Food[Reading Nutritional Information] = "Sometimes"))

VAR TotalPersons = COUNT(Tabla_Members[ID_Miembro])

Return
DIVIDE( CantPersonasQueLeen, TotalPersonas)

Measure to calculate the percentage of people reading the nutrition information on


packaging out of the total respondents

Percentage People with Comorbidity

VAR PeopleComor = DISTINCTCOUNT(Tabla_Commodity[ID_Miembro])


VAR TotalPersons = COUNT(Tabla_Members[ID_Miembro])

Return
DIVIDE(PersonasComor,TotalPersonas)
Measure to calculate the percentage of people who have some comorbidity over the total
number of respondents

Percentage of people with Female Comorbidity

CALCULATE( [Percentage people with Comorbidity],


FILTER(Tabla_Sociodemografia, Tabla_Sociodemografia[Sex] = "Female"))

Measure to calculate the percentage of people of female sex who have some comorbidity
over the total of respondents

Percentage of people with Male Comorbidity

CALCULATE( [Percentage people with Comorbidity],


FILTER(Tabla_Sociodemografia, Tabla_Sociodemografia[Sex] = "Male"))

Measure to calculate the percentage of male persons who have some comorbidity over the
total number of respondents

Percentage of People with Disabilities

VAR PeopleDisc = DISTINCTCOUNT(Tabla_Discapacity[ID_Miembro])


VAR TotalPersons = COUNT(Tabla_Members[ID_Miembro])

Return
DIVIDE(PersonasDisc,TotalPersonas)

Measure to calculate the percentage of people with a disability out of the total number of
respondents

Average Comorbidities Over Population with Comorbidities

DIVIDE(
CALCULATE(
COUNT(Tabla_Comorbilidad[ID_Miembro]),
ALLSELECTED(Tabla_Comorability[ID_Miembro])),
DISTINCTCOUNT(Tabla_Comorability[ID_Miembro]))
Measure to calculate the average of comorbidities over the population with comorbidities
(Not used in the model)

Average Comorbidities Over Population with Disabilities

DIVIDE(
CALCULATE(
COUNT(Tabla_Discapacity[ID_Miembro]),
ALLSELECTED(Tabla_Discapacity[ID_Miembro])),
DISTINCTCOUNT(Tabla_Discapacity[ID_Miembro]))

Measure to calculate the average of disabilities over the population with disabilities (Not
used in the model)

Average Comorbidities Over Total Respondents

DIVIDE(
CALCULATE(
COUNT(Tabla_Comorbilidad[ID_Miembro]),
ALLSELECTED(Tabla_Comorability[ID_Miembro])),
COUNT(Tabla_Members[ID_Miembro]))

Measure to calculate the average of comorbidities over the total of respondents

Average Disability Over Total Respondents

DIVIDE(
CALCULATE(
COUNT(Tabla_Discapacity[ID_Miembro]),
ALLSELECTED(Tabla_Discapacity[ID_Miembro])),
COUNT(Tabla_Members[ID_Miembro]))

Measure to calculate the average number of disabilities over the total number of
respondents

Average Age
AVERAGE(Tabla_Sociodemography[Age])

Measure to calculate the average age (in this way it allows to give it integer format)

VAR __CORRELATION_TABLE = VALUES('Tabla_Miembros'[ID_Miembro])


WHERE __COUNT =
COUNTX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE(
AVERAGE('Tabla_Ejercicio'[Total])
* [Average Comorbidities Over Total Respondents]))
WHERE __SUM_X =
SUMX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE(AVERAGE('Tabla_Ejercicio'[Total])))
WHERE __SUM_Y =
SUMX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE([Average Comorbidities Over Total Respondents]))
WHERE __SUM_XY =
SUMX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE(
AVERAGE('Tabla_Ejercicio'[Total])
* [Average Comorbidities Over Total Respondents] * 1. ))
WHERE __SUM_X2 =
SUMX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE(AVERAGE('Tabla_Ejercicio'[Total]) ^ 2))
WHERE __SUM_Y2 =
SUMX(
KEEPFILTERS(__CORRELATION_TABLE),
CALCULATE([Average Comorbidities Over Total Respondents] ^ 2))
RETURN
DIVIDE(
__COUNT * __SUM_XY - __SUM_X * __SUM_Y * 1. ,
SQRT(
(__COUNT * __SUM_X2 - __SUM_X ^ 2) * (__COUNT * __SUM_Y2 - __SUM_Y ^
2)))

Measure to calculate the correlation coefficient between the average total minutes of
exercise with the average of comorbidities over the total of respondents
R2

'Measures'[R]^2

Measure to calculate the coefficient of determination between the average total number of
minutes of exercise with the average of comorbidities over the total of respondents

Total People with Comorbidity

DISTINCTCOUNT(Tabla_Comorability[ID_Miembro])

Measure to calculate the total number of people with comorbidity

Technological tools implemented

The development of the project required the following tools:

1. Miro: for the planning of the entity-relationship model.


2. Power BI: for data cleaning, creation of variables and calculated measures, creation
of graphs and calculation of descriptive statistics.
3. Photoshop: for the creation of the buttons.
4. Google Meet: as support for the meetings necessary for the planning and
development of the project.

Future Lines

Future lines of research could delve into the experimentation and graphing of the causal
relationship between correlations and associations found, for example between minutes of
exercise and the number of diseases. Another aspect to investigate would involve the same
variables but making multiple shots over time, that is, adding a temporal dimension. This will
allow us to observe the evolution of the variables over time.
Bibliography:

Dataset and original documentation:


http://datos.salud.gob.ar/dataset/ennys2

You might also like