Statistical technique used for investigating and modelling the relationship between
two or more variables is: ~ Regression
What is the type of learning where a function is inferred to describe hidden structure from unlabeled data ~ Supervised Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100% repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid by the owner. Which data mining technique can be used to choose the policy? ~ Decision Tree If time is used as an independent variable in a simple linear regression analysis, which of the following assumptions could be violated? ~ Successive Which statistical technique deals with finding a structure in a collection of unlabeled data? ~ Clusturing Which of the following activities is performed as part of data pre processing? ~ All Detect Missing Values Noisy values are the values that are valid for the dataset, but are incorrectly recorded ~ TRUE Which of the following modelling type should be used for Labelled data? ~ Predictive What is the other name for Data Preparation stage of Knowledge Discovery Process ~ ETL Which of the following role is responsible for performing validation on analysis datasets ~ Statistacian The process of extracting valid, useful, unknown info from data and using it to make proactive knowledge driven business is called ~ Data Mining Which of the following is not applicable to Data Mining ~ Ivolves working with known information ~ Associate rule is known as� ~ Affinity Which data mining method groups together objects that are similar to each other and dissimilar to the other objects? ~ Clustering Which of the following are Multi-class Classification problem ~ Movie _________ are the values that mark the boundaries of the confidence interval. ~ Confidence Limits Regression is typically carried out to develop a mathematical model of the process ~ TRUE Machine learning task of inferring a function from labelled training data is known as ~ Supervised Simulations are carried out to develop a mathematical model of the process ~ FALSE