You are on page 1of 11

Table of Contents

Problem Statement................................................................................................................................ 1
Domain Overview: ............................................................................................................................ 1
Challenges .......................................................................................................................................... 1
Audience response to the change ..................................................................................................... 2
Problem Analysis .................................................................................................................................. 2
Source of data .................................................................................................................................... 2
Drawbacks in the current scenario.................................................................................................. 2
Solution to Problems ......................................................................................................................... 3
Design ..................................................................................................................................................... 8
Approach to solve the problem ........................................................................................................ 8
Data storage process ......................................................................................................................... 8
Data processing method ................................................................................................................... 9
Implementation approach .................................................................................................................... 9
Type of database ............................................................................................................................... 9
Usage of query language ................................................................................................................... 9
Tool used to analyse the data ........................................................................................................... 9
Validation approach ............................................................................................................................. 9
Solution and Implementation........................................................................................................... 9
Problem Statement

Domain Overview:

Every year, around 8,00,000 people are taking their own lives, and there are many more people
trying to commit suicide. Each suicide is a tragedy which has a long-lasting effect on the people
behind us and affects families, societies and whole countries. The second leading cause of death
among 15-29-year-olds in 2016 was a suicide during their lives. In all regions of the world,
suicide is a global phenomenon, not only in countries with high income, but also in the low-
and middle-income countries in 2016, over 79% of the global suicides were actually
committed.

Nevertheless, early, demonstrable and often low-cost approaches may prevent suicide. A
serious public health problem is suicide. To successfully implement regional reactions, a
comprehensive multisectoral suicide prevention program is needed.

Challenges

On the whole, information on suicide and suicide attempts were low in reliability and
performance. Just 80 Member States have good quality essential registration information which
can directly be used to estimate rates of suicide. Self-reporting and misclassification are more
likely to present issues for suicide than for most other causes of death despite the vulnerability
of suicide and the illegal behaviour of some countries. This is not the only problem of low-
quality mortality data.

To order to implement successful suicide prevention measures, better monitoring and control
of suicide and suicide attempts is needed. The cross-border differentials in suicide trends and
improvements in suicide rates, characteristics and strategies emphasize the need for that nation
to develop their suicide-related data on a world-wide basis. It includes critical suicide records,
hospital suicide registries and nationally representative surveys that collect information about
self-reported suicide attempts.
Audience response to the change

Problem Analysis

Source of data

The World Bank aims to increase public access and use of data collected and released by the
World Bank. The datasets of the World Bank Data Catalogs are organized. The data sets are
information collections that are handled in a variety of machine-readable formats, managed by
the World Bank. This data set includes variables District, age and sex, suicide number,
population, annual HDI and annual GDP, and d 27820 observations produced from 1985 to
2016. GDP per capita is an indicator of the economic performance of a nation that accounts for
its population. The gross domestic product of the country is determined by its overall
population. It makes it the highest quality of living in a country. It tells you how prosperous a
country is for every citizen. HDI: The 2010 Human Development Survey, released on 4
November 2010 (and updated 10 June 2011), measured the HDI in three dimensions: long and
healthy lifetime. Level of education: Mean school years and planned school years. A decent
living standard: per capita GNI (US$ PPP). They also use the tools for the study of
comparisons.

Drawbacks in the current scenario

There are few drawbacks that is related to the dataset which has been chosen and are as follows:

 The data is periodic in nature from year 1985-2016 and is not updated afterwards.
 The data set does not cover all the countries across the world and is taken majorly from
Canada, US, Europe, Brazil, and Australia.
 There are few data which is missing in the data which can be a huge drawback in
analysis and understanding of the information.
 The data cannot be used for the further study as it concentrates only on few attributes
and does not cover all the reasons for suicides.

Solution to Problems

We have took data and analysed using various data analysis and visulation tool like tableau,
orange and have obtained these graphs which are informative for understanding.

WORLD VIEW

This graph shows the meadian suicides across different countries

Percentage of Male and Female Suicides


Suicide rates with respect to different age groups

Correlation between GDP and Suicide rates


There is no correlation between GDP and Suicide rates

Median Suicide rates over different countries in Descending order

Suicide rates with respect to different generation


Suicide rates across different countries with Male and Female ratio

No of Suicide over different years (1985-2016)


Ratio of Male and Female suicide over different years

Trends of Median Suicide rates with respect to different years


Design

Approach to solve the problem

Firstly, we will try to clean the data and try to improve the quality of the dataset by structuring
it which makes it easy for us to analyse and to solve the problems that we have defined in our
problem statement. After the cleaning of data, we will try to address each and every problem
that we have mentioned in our main objective individually and try to find out potential results
or insights out of it. Also, we will try to analyse data in a detailed manner which will help us
in bringing or finding out a pattern or behaviour of people who try and commit suicide.

Data storage process

The dataset is primary source of a research done in several countries across the globe. This
compiled dataset pulled from four other datasets linked by time and place and was built to find
signals correlated to increased suicide rates among different cohorts globally, across the socio-
economic spectrum. So, the dataset was mainly stored in the Excel file and the same format
has been retained keeping in mind that it will be very easy and helpful in the data analysis and
in bringing out essential insights out of it.
Data processing method

Implementation approach

Type of database

The dataset is taken from Kaggle which is an open database which consist of thousands of free
datasets which are available for the common public to understand about certain topic which
interest them or other common data sets which had created a huge impact on the environment.
This website provides data about certain events, moments or real time data which can be used
and analysed to find meaningful insights out of it, which generally benefits the common public.

Usage of query language

We will not be using any querying language like SQL or other because it is a public dataset
which is easily accessible by any of the public. The dataset is also downloadable in the excel
format. But we will be using R language to extract and to analyse the data.

Tool used to analyse the data

We will be using the different data mining and visualization tool like orange, excel, tableau to
obtain a meaningful information or insights from this huge data. Also, we will be using some
data cleaning techniques to clean the dataset which consist of several unwanted attributes and
to remove the missing data from the dataset.

Validation approach

Solution and Implementation

Suicides can be avoided. There are a number of measures to prevent depression and suicide at
population, subpopulation and individual rates. These include:
 Reducing access to the means of suicide (e.g. Pesticides, firearms, certain medications).
 Reporting by media in a responsible way.
 School-based interventions.
 Introducing alcohol policies to reduce the harmful use of alcohol.
 Early identification, treatment and care of people with mental and substance use
disorders, chronic pain and acute emotional distress.
 Training of non-specialized health workers in the assessment and management of
suicidal behaviour.
 Follow-up care for people who attempted suicide and provision of community support.

Suicide is a complex issue, and therefore efforts to prevent suicide need to be coordinated and
cooperated between several sectors of society-healthcare, labour, agriculture, businesses, law,
defence, politics and the media. suicide needs to be a problem. These endeavours, as no
approach can affect an issue as complex as suicide, be comprehensive and integrated alone.

You might also like