Professional Documents
Culture Documents
======================================================
What is difference between using single tree in decision tree reg and 500+ trees in
random forest reg?
Maximum likely hood of logistic regression curve for finding best fit
Does KNN model memorises all the input data points and use them to identify the
classification of new data point?
ans:-
yes
How can naive bayes theorem could clasify the new data point without comparing with
original data points after model got trained?
ANS:-
It dont stores all the data , insted it finds the distribution of the data (multi
variate normal distribution)
In HC, one of the clustering technique is "method of minimum variance" (or 'ward').
how does it work?
Association rule learning:(Apriori model) why the rules are one directional. like,
for ex, rule1:(burger --> frenchFries). in this rule, why not (frenchFries -->
burger) dont you think there is no first preference and only equal likeness for
both as a collective set of items.
Ans:-
because in Apriori, we calculate Confidence(M1-->M2) which implies, what is the
probability of a person watch movie2 privided he watched movie1. it is equivalent
ot bayes theorem called as P(M2/M1)=P(M2&M1)/P(M1). clearly this relation is not
symmetrical.
Should we implement same code as in Apriori which checks for additional metricks
like confidence and lift for identifying frequent items sets in Eclat model.
(since there is no other efficient python library other than apriori for Eclat. So,
we have no choice)
what is AB test
upper confidence bound algorithm: how does the confidence interval shrinks after
each new trial. and by how much value.
PCA vs LDA - how this algorithms reduces dimentionality of a data with many columns
to preferred no of columns.
ensemble learning - 1. bagging and 2. boosting what are the types in it.
---------------------------------