MATLAB Command Window: Fisheriris

MATLAB Command Window Page 1
>> %TUGAS 05 STUDI PERFORMA OBJEKTIF

%ABYAN JADIDAN 2106780285
%Receiver Operating Characteristic (ROC)
%1. Plot ROC Curve for Classification by Logistic Regression
%Load the sample data.

load fisheriris
%Define a binary classification problem by using only the measurements that

%correspond to the species versicolor and virginica.
pred = meas(51:end,1:2); %hanya menggunakan 2 variabel prediktor
%Define the binary response variable.

resp = (1:100)'>50; % Versicolor = 0, virginica = 1
%Fit a logistic regression model.

mdl = fitglm(pred,resp,'Distribution','binomial','Link','logit');
%Compute the ROC curve. Use the probability estimates from

%the logistic regression model as scores.
scores = mdl.Fitted.Probability;
[X,Y,T,AUC] = perfcurve(species(51:end,:),scores,'virginica');
%Display the area under the curve.

AUC
AUC =
0.7918
>> %Plot the ROC curve

plot(X,Y)
xlabel('False positive rate')
ylabel('True positive rate')
title('ROC for Classification by Logistic Regression')
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> % 2. Compare Classification Methods Using ROC Curve
%Load the sample data

load ionosphere
%X is a 351x34 real-valued matrix of predictors. Y is a character array

%of class labels: 'b' for bad radar returns and 'g' for good radar returns.
%Reformat the response to fit a logistic regression. Use the predictor

%variables 3 through 34.
resp = strcmp(Y,'b'); % resp = 1, if Y = 'b', or 0 if Y = 'g'
pred = X(:,3:34);
%Fit a logistic regression model to estimate the posterior probabilities

%for a radar return to be a bad one.
score_log = mdl.Fitted.Probability; % Probability estimates
%Compute the standard ROC curve using the probabilities for scores.
[Xlog,Ylog,Tlog,AUClog] = perfcurve(resp,score_log,'true');
%Train an SVM classifier on the same sample data. Standardize the data.
mdlSVM = fitcsvm(pred,resp,'Standardize',true);
%Compute the posterior probabilities (scores).

mdlSVM = fitPosterior(mdlSVM);
[~,score_svm] = resubPredict(mdlSVM);
%The second column of score_svm contains the posterior probabilities
%of bad radar returns.
%Compute the standard ROC curve using the scores from the SVM model.
[Xsvm,Ysvm,Tsvm,AUCsvm] = perfcurve(resp,score_svm(:,mdlSVM.ClassNames),'true');
%Fit a naive Bayes classifier on the same sample data.

mdlNB = fitcnb(pred,resp);
%Compute the posterior probabilities (scores).

[~,score_nb] = resubPredict(mdlNB);
%Compute the standard ROC curve using the scores from the naive Bayes classification.
[Xnb,Ynb,Tnb,AUCnb] = perfcurve(resp,score_nb(:,mdlNB.ClassNames),'true');
%Plot the ROC curves on the same graph.

plot(Xlog,Ylog)
hold on
plot(Xsvm,Ysvm)
plot(Xnb,Ynb)
legend('Logistic Regression','Support Vector Machines','Naive Bayes','Location','Best')
xlabel('False positive rate'); ylabel('True positive rate');

title('ROC Curves for Logistic Regression, SVM, and Naive Bayes Classification')
hold off
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> %Compare the area under the curve for all three classifiers.
AUClog
AUCsvm
AUCnb
AUClog =
0.9659
AUCsvm =
0.9489
AUCnb =
0.9393
>>
>> % 3. Determine the Parameter Value for Custom Kernel Function
%This example shows how to determine the better parameter value for a
%custom kernel function in a classifier using the ROC curves.
%Generate a random set of points within the unit circle.

rng(1); % For reproducibility
n = 100; % Number of points per quadrant
r1 = sqrt(rand(2*n,1)); % Random radii

t1 = [pi/2*rand(n,1); (pi/2*rand(n,1)+pi)]; % Random angles for Q1 and Q3
X1 = [r1.*cos(t1) r1.*sin(t1)]; % Polar-to-Cartesian conversion
r2 = sqrt(rand(2*n,1));
t2 = [pi/2*rand(n,1)+pi/2; (pi/2*rand(n,1)-pi/2)]; % Random angles for Q2 and Q4
X2 = [r2.*cos(t2) r2.*sin(t2)];
%Define the predictor variables. Label points in the first and third
%quadrants as belonging to the positive class, and those in the second
%and fourth quadrants in the negative class.
pred = [X1; X2];
resp = ones(4*n,1);
resp(2*n + 1:end) = -1; % Labels
%Train an SVM classifier using the sigmoid kernel function.

%It is good practice to standardize the data.
SVMModel1 = fitcsvm(pred,resp,'KernelFunction','mysigmoid',...
'Standardize',true);
SVMModel1 = fitPosterior(SVMModel1);
[~,scores1] = resubPredict(SVMModel1);
SVMModel2 = fitcsvm(pred,resp,'KernelFunction','mysigmoid2',...
'Standardize',true);
SVMModel2 = fitPosterior(SVMModel2);
[~,scores2] = resubPredict(SVMModel2);
%Compute the ROC curves and the area under the curve (AUC) for both models.
[x1,y1,~,auc1] = perfcurve(resp,scores1(:,2),1);
[x2,y2,~,auc2] = perfcurve(resp,scores2(:,2),1);
>> %Plot the ROC curves.

plot(x1,y1)
hold on
plot(x2,y2)
hold off
legend('gamma = 1','gamma = 0.5','Location','SE');
xlabel('False positive rate'); ylabel('True positive rate');
title('ROC for classification by SVM');
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> auc1
auc2
auc1 =
0.9518
auc2 =
0.9985
>>
>>
>> % 4. Plot ROC Curve for Classification Tree
%Load the sample data

load fisheriris
% The column vector, species, consists of iris flowers of three different

% species: setosa, versicolor, virginica. The double matrix meas consists
% of four types of measurements on the flowers: sepal length, sepal width,
% petal length, and petal width. All measures are in centimeters.
% Train a classification tree using the sepal length and width as the
% predictor variables. It is a good practice to specify the class names
Model = fitctree(meas(:,1:2),species, ...
'ClassNames',{'setosa','versicolor','virginica'});
%Predict the class labels and scores for the species based on the tree Model.
[~,score] = resubPredict(Model);
diffscore = score(:,2) - max(score(:,1),score(:,3));

[X,Y,T,~,OPTROCPT,suby,subnames] = perfcurve(species,diffscore,'versicolor');
% X, by default, is the false positive rate (fallout or 1-specificity) and Y,

% by default, is the true positive rate (recall or sensitivity). The positive
% class label is versicolor. Because a negative class is not defined, perfcurve
% assumes that the observations that do not belong to the positive class are
% in one class. The function accepts it as the negative class.
OPTROCPT
suby
subnames
OPTROCPT =
0.1000 0.8000
suby =
0 0
0.1800 0.1800
0.4800 0.4800
0.5800 0.5800
0.6200 0.6200
0.8000 0.8000
0.8800 0.8800
0.9200 0.9200
0.9600 0.9600
0.9800 0.9800
1.0000 1.0000
1.0000 1.0000
subnames =
1×2 cell array
{'setosa'} {'virginica'}
>> %Plot the ROC curve and the optimal operating point on the ROC curve.
plot(X,Y)
hold on
plot(OPTROCPT(1),OPTROCPT(2),'ro')
title('ROC Curve for Classification by Classification Trees')
hold off
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> %Find the threshold that corresponds to the optimal operating point.
T((X==OPTROCPT(1))&(Y==OPTROCPT(2)))
%Again, you must supply perfcurve with a function that factors in the
%scores of the negative class
diffscore = score(:,2) - score(:,3);
[X,Y,~,~,OPTROCPT] = perfcurve(species,diffscore,'versicolor', ...
'negClass','virginica');
ans =
0.2857
>> figure, plot(X,Y)

hold on
plot(OPTROCPT(1),OPTROCPT(2),'ro')
title('ROC Curve for Classification by Classification Trees')
hold off
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> OPTROCPT
OPTROCPT =
0.1800 0.8200
>>
>> % 5. Compute Pointwise Confidence Intervals for ROC Curve
%Load the sample data.

load fisheriris
%Use only the first two features as predictor variables

pred = meas(51:end,1:2);
%Define the binary response variable.

resp = (1:100)'>50; % Versicolor = 0, virginica = 1
%Fit a logistic regression model.

%Compute the pointwise confidence intervals on the true positive rate

%(TPR) by vertical averaging (VA) and sampling using bootstrap.
[X,Y,T] = perfcurve(species(51:end,:),mdl.Fitted.Probability,...
'virginica','NBoot',1000,'XVals',[0:0.05:1]);
>> %Plot the pointwise confidence intervals.

errorbar(X,Y(:,1),Y(:,1)-Y(:,2),Y(:,3)-Y(:,1));
xlim([-0.02,1.02]); ylim([-0.02,1.02]);
title('ROC Curve with Pointwise Confidence Bounds')
legend('PCBwVA','Location','Best')
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> [X1,Y1,T1] = perfcurve(species(51:end,:),mdl.Fitted.Probability,...
'virginica','NBoot',1000);
>> %Plot the confidence bounds.

figure()
errorbar(X1(:,1),Y1(:,1),Y1(:,1)-Y1(:,2),Y1(:,3)-Y1(:,1));
xlim([-0.02,1.02]); ylim([-0.02,1.02]);
legend('PCBwTA','Location','Best')
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> %Specify the threshold values to fix and compute the ROC curve. Then plot the curve.
[X1,Y1,T1] = perfcurve(species(51:end,:),mdl.Fitted.Probability,...
'virginica','NBoot',1000,'TVals',0:0.05:1);
>> figure()
errorbar(X1(:,1),Y1(:,1),Y1(:,1)-Y1(:,2),Y1(:,3)-Y1(:,1));
xlim([-0.02,1.02]); ylim([-0.02,1.02]);
legend('PCBwTA','Location','Best')
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>

MATLAB Command Window: Fisheriris

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

MATLAB Command Window: Fisheriris

Uploaded by

Copyright:

Available Formats

MATLAB Command Window Page 1

>> %TUGAS 05 STUDI PERFORMA OBJEKTIF

%Receiver Operating Characteristic (ROC)

%1. Plot ROC Curve for Classification by Logistic Regression

%Load the sample data.

%Define a binary classification problem by using only the measurements that

%Define the binary response variable.

%Fit a logistic regression model.

%Compute the ROC curve. Use the probability estimates from

%Display the area under the curve.

>> %Plot the ROC curve

%Load the sample data

%X is a 351x34 real-valued matrix of predictors. Y is a character array

%Reformat the response to fit a logistic regression. Use the predictor

%Fit a logistic regression model to estimate the posterior probabilities

%Compute the posterior probabilities (scores).

%Fit a naive Bayes classifier on the same sample data.

%Compute the posterior probabilities (scores).

%Plot the ROC curves on the same graph.

xlabel('False positive rate'); ylabel('True positive rate');

%Generate a random set of points within the unit circle.

r1 = sqrt(rand(2*n,1)); % Random radii

%Train an SVM classifier using the sigmoid kernel function.

>> %Plot the ROC curves.

%Load the sample data

% The column vector, species, consists of iris flowers of three different

diffscore = score(:,2) - max(score(:,1),score(:,3));

% X, by default, is the false positive rate (fallout or 1-specificity) and Y,

1×2 cell array

>> figure, plot(X,Y)

%Load the sample data.

%Use only the first two features as predictor variables

%Define the binary response variable.

resp = (1:100)'>50; % Versicolor = 0, virginica = 1

%Fit a logistic regression model.

%Compute the pointwise confidence intervals on the true positive rate

>> %Plot the pointwise confidence intervals.

>> %Plot the confidence bounds.

You might also like