You are on page 1of 3

Data set: Suicide rates dataset

https://www.kaggle.com/russellyates88/suicide-rates-overview-1985-to-2016
Milestone 1

1) Personal objective:

Death by suicide is an extremely complex issue that causes pain to


hundreds of thousands of people every year around the world. Globally,
close to 800,000 people die from suicide every year. That’s one person
every 40 seconds. Due to the stigma associated with suicide – and the
fact that it is illegal in some countries – this figure is also likely to be an
underestimate, with some suicides being classified as unintentional
injuries. This makes it one of the leading causes of death globally.
Around twice as many dies from suicide as from homicide. Suicide is
more common than homicide across most countries in the world – often
as much as ten to twenty times higher.

2) Intended outcomes:

Studying the data collected by WHO and other organization helping in


preventing the suicide and homicides can help us understand the various
variables such as Age, reason, gender, etc. and how its leading to the
final decision. By studying this data, we can predict the numbers and can
also help in reducing the number by taking different scenarios in mind.

3) An intended audience:

What does intended audience mean?

An intended audience refers to the demographic that writers expect will


read and interact with their work whether it be an article, research
paper, or book. When reflecting on your intended audience, consider
factors such as age, geographic location, culture, and education.

Who will be the intended audience?

By looking at the raw data that is collected from different organization,


we can say that audience of every age group and any gender as well as
from any part of the world can be considered as an intended audience
for this project.
4) Foreseeable challenges:

Following are the challenges one can face while doing a predictive
Analysis project:

a) Finding the data: The first step of any data science project is
unsurprisingly to find the data assets needed to start working. The
surprising part is that the availability of the "right" data is still the
most common challenge of data scientists. There are some data
which are not published or have been updated and can be a harder
to access the data.
b) Understanding the data: Once the data is obtained, the second
challenge start is understanding the data and what problem we are
solving.
c) Data cleaning: Every dataset that is obtained may not be a proper
cleaned data set and hence cleaning the data and filling missing
values can be a challenging task.

You might also like