Professional Documents
Culture Documents
MOHAN KRISHNA
“ EXPLORING PUBLIC
DATASETS” SUBMITTED BY:
GAUNEKAR
MANGUIRISH EKNATH
SUBJECT: 19138
Concerns • From this data set we can come to know, on which basis the
people in Boston select their homes
housing values • The purpose behind choosing this data set was to understand the
in suburbs of nature of house buying and the factors dependent on it which
can be applied elsewhere
Boston. • Problem statement: whether Boston is a better place for
residential setup? And if it is then what are the key factors which
determine the selection procedure for the buying.
• By analysing the data we can get to know and guide new buyers
in Boston
continued…..
3Vs
• Volume: the data set consists of 510 responses which can be analysed.
• Variety: the dataset consists of quantitative variables for many variables. The data set is well structured
with proper notation and values for the variables as some variables are made using dummies.
• Velocity: the velocity of this data is slightly low as data in this field is not generated on high velocity
and data is generated from time to time not frequently.
Challenges:
• To use the data on big data tools and generate insights.
• To remove duplication of data which may occur during analysis
Potential insights
• We will get to know the buying behaviour while purchasing a house in Boston.
• With this analysis we can guide new and potential buyers in Boston area.
Thank you