You are on page 1of 1

DSA Individual

- Kushal Kaushik
210103084

1. In India brain drain is a common thing. Every year very large population of Indians
leave to US or Europe to make their career and then plan to spend their lives there. In
US every 3rd doctor is an Indian and every 1 out of 10 people in US is an Indian. This
generates an Indian specific need in the foreign market. For our study we have tried to
focus on the housing demands, as searching for home becomes a serious problem for
the Indians abroad.
2. To collect the data of the Indian Diaspora. This could be done by segregating Indian
families from property listings on online websites. Also, the touchpoints could be the
Indian people living abroad and listing their properties on a common website. The
data would consist of country, state, region, rent amount and other things. These data
would be then sorted to make it more meaningful.
3. This data would have many different parameters, a combination of a quantitative and
qualitative set of each family. This would include name, origin in India, language
known and other things. Every family's response would be saved according to the
distribution applied, and their preference of the kind of tenant they want would also
be recorded. After that, the data would be arranged and matched according to the
correlation between the choices of family and tenant. The dependent variable can be
the region in which a room is needed, and the independent variable can be the rent
amount, from where the family belongs from India etc.
4. The Exploratory Data Analysis will include segregating the information received and
analysing the various set by understanding, distributing and looking at our data to
inspect the result obtained. For data analysis, statistical tools like box plot, scatter
plots, and other charts can be used for initial data analysis. After that, we can
formulate a hypothesis and then check the model to find the dependence of main
parameters on different independent parameters to study the effect in major ones
5. We will fit a statistical model based on the collected values to extrapolate the
unknown values and establish a relationship through predictive analysis. We will
figure out the dominant variables through an initial examination, which will help us
choose the significant factors to consider while calculating our future estimation and
trend analysis.
6. This is a small glimpse of what data science is capable of and what it could achieve.
With our life in the lap of phones and the internet, every point of interaction with the
external environment is a data point. If collected and used as meaningful information,
these data points can lead to far more efficient and time-saving activities. Therefore,
in this era, it is correctly said that the data is the new gold and one who knows data
science knows the treasure's path.

You might also like