Professional Documents
Culture Documents
one hot encoding is preferred by most of the ml engineers despite increase dimensionality of
dataset. justify your answer with example
2. demonstrate the steps for performing forward feature selection and backward feature
selection. how will you check which one is well suited ?
3. suppose you have a dataset and you need to apply feature engineering. how will you decide
whether you will apply feature extraction or feature selection ?
4. suppose you have collected data through questionaries and need to pre process the data. give
answer of following questions
a. if we have a date column in our dataset then how will you perform feature engineering
India 49 86400 no
USA 35 64800 no
Brazil 43 73200 no
USA 45 yes
Brazil 62400 no
USA 55 99600 no
a. The data in data.txt file. Brief the step for reading and creating pandas data frame.
d. Which of all features should be chosen for model building, why and how ?
f. Will you generate some new features? If yes then why and how ?