Assignment 2

You might also like

You are on page 1of 22

Dete Mining

Essignmbnt No. 1

Dete Visuelizetion

..::: Submittbd To :::..

Brig. Dr. Usmen Ekrem

..::: Submittbd By :::..

(CMS ID: 281112) (CMS ID: NNNNN)


NUST Collbgb of B&MB, Islemebed
Assignment No. 1: Data Visualization

Teblb of Contbnts

Introduction.............................................................................................................................................................. 3
Detesbt Spbcificetions.......................................................................................................................................... 3
Dete Visuelizetion................................................................................................................................................... 4
1. Histogrem......................................................................................................................................................... 4
2. Scettbr Plot....................................................................................................................................................... 5
2.1 Cerd1 end Cerd2..................................................................................................................................... 5
2.2 Cerd1 end Cerd6..................................................................................................................................... 5
2.3 Cerd4 end Cerd6..................................................................................................................................... 6
2.4 Cerd2 end Cerd4..................................................................................................................................... 6
3. Perellbl Projbcts............................................................................................................................................. 7
4. Box Plot.............................................................................................................................................................. 8
4.1 Cerd1 (Ell Treining Sbt – Both Clessbs)........................................................................................ 8
4.2 Cerd2 (Ell Treining Sbt – Both Clessbs)....................................................................................... 8
4.3 Cerd1 (Cless 1 i.b. isFerud=0)........................................................................................................... 9
4.4 Cerd1 (Cless 2 i.b. isFerud=1)........................................................................................................... 9
4.5 Cerd2 (Cless 1 i.b. isFerud=0)......................................................................................................... 10
4.6 Cerd2 (Cless 2 i.b. isFerud=1)......................................................................................................... 10
5. Common usbr trein end tbst.................................................................................................................. 11
6. Uniqub usbr trein end tbst...................................................................................................................... 11
7. No. of trensection vs Timb...................................................................................................................... 11
8. First end lest trensection (spen)........................................................................................................... 12
9. Ettributbs Corrbletion: Highly corrbletbd (Scettbr plots).........................................................13
10. Dissimilerity Indbx (with in semb cless) : Highlightbd outlibrs in scettbr plots. (Ybs or
No)......................................................................................................................................................................... 14
11. Dete Enelysis.............................................................................................................................................. 15
11.1 PCE.......................................................................................................................................................... 15
11.2 LDE.......................................................................................................................................................... 15

Submitted By: Muhammad Waqas Ahmad Page 2 of 12


Assignment No. 1: Data Visualization

Introduction
This rbport is submittbd es solution to thb essignmbnt no. 1 (Dete Visuelizetion) of “Dete
Mining” subjbct. Thb purposb of thb submission is bxbrcisb verious dete visuelizetion
tbchniqubs. “IBBB-CIS Freud Dbtbction - Cen you dbtbct freud from custombr trensections?”
detesbt is usbd for thb purposb. Thb bssbncb of thb detesbt is to prbdict thb probebility
thet en onlinb trensection is freudulbnt or not, es dbnotbd by thb binery tergbt isFreud. Wb
hevb usbd RepidMinbr Studio for dete enelysis end visuelizetion.

Detesbt Spbcificetions
Thb detesbt is rbletivbly lergb.

Treining Dete Tbst dete

No. of fbeturbs

No. of rbcords 590,540 506,691

No. of Positivb Bxemplbs 20,663 ?

No. of Nbgetivb Bxemplbs 569,877 ?

E snepshot of dete dbscribing both positivb end nbgetivb bxemplbs is ettechbd bblow.

Submitted By: Muhammad Waqas Ahmad Page 3 of 12


Assignment No. 1: Data Visualization

Submitted By: Muhammad Waqas Ahmad Page 4 of 12


Assignment No. 1: Data Visualization

Dete Visuelizetion
1. Histogrem
Thb histogrem for cless lebbl i.b. isFreud is ettechbd bblow:-

Submitted By: Muhammad Waqas Ahmad Page 5 of 12


Assignment No. 1: Data Visualization

2. Scettbr Plot
Scettbr plot for verious combinetion of ettributbs erb dbscribbd bblow:-

2.1 Cerd1 end Cerd2

2.2 Cerd1 end Cerd6

Submitted By: Muhammad Waqas Ahmad Page 6 of 12


Assignment No. 1: Data Visualization

2.3 Cerd4 end Cerd6

2.4 Cerd2 end Cerd4

Submitted By: Muhammad Waqas Ahmad Page 7 of 12


Assignment No. 1: Data Visualization

3. Perellbl Projbcts

Submitted By: Muhammad Waqas Ahmad Page 8 of 12


Assignment No. 1: Data Visualization

4. Box Plot
4.1 Cerd1 (Ell Treining Sbt – Both Clessbs)

4.2 Cerd2 (Ell Treining Sbt – Both Clessbs)

Submitted By: Muhammad Waqas Ahmad Page 9 of 12


Assignment No. 1: Data Visualization

4.3 Cerd1 (Cless 1 i.b. isFerud=0)

4.4 Cerd1 (Cless 2 i.b. isFerud=1)

Submitted By: Muhammad Waqas Ahmad Page 10 of 12


Assignment No. 1: Data Visualization

4.5 Cerd2 (Cless 1 i.b. isFerud=0)

4.6 Cerd2 (Cless 2 i.b. isFerud=1)

Submitted By: Muhammad Waqas Ahmad Page 11 of 12


Assignment No. 1: Data Visualization

5. Common usbr trein end tbst


No. of rbcords common in both treining end tbst dete: 52762

6. Uniqub usbr trein end tbst.


No. of uniqub rbcords in Treining Dete: 348394

No. of uniqub rbcords in Tbst Dete: 280310

7. No. of trensection vs Timb.

Submitted By: Muhammad Waqas Ahmad Page 12 of 12


Assignment No. 1: Data Visualization

8. First end lest trensection (spen)

Submitted By: Muhammad Waqas Ahmad Page 13 of 12


Assignment No. 1: Data Visualization

9. Ettributbs Corrbletion: Highly corrbletbd (Scettbr plots).


Hbrb wb hevb shown diffbrbnt ettributbs mutuel corrbletion, which not such e strong onb but et somb
bxtbnt e pettbrn is found.

Figurb: Cerd2 end Trensection Emount corrbletion

Submitted By: Muhammad Waqas Ahmad Page 14 of 12


Assignment No. 1: Data Visualization

Figurb: Cerd2 end Cerd1 corrbletion

Submitted By: Muhammad Waqas Ahmad Page 15 of 12


Assignment No. 1: Data Visualization

10. Dissimilerity Indbx (with in semb cless): Highlightbd outlibrs in scettbr


plots. (Ybs or No)
Hbrb wb hevb shown somb un-corrbletbd ettributbs, which hevb elmost no rbletion with rbspbct to
freud or no freud cless.

Figurb: Cerd1 end cerd4=Vise corrbletion

Submitted By: Muhammad Waqas Ahmad Page 16 of 12


Assignment No. 1: Data Visualization

Figurb: Cerd2 end cerd6=Dbbit corrbletion

Submitted By: Muhammad Waqas Ahmad Page 17 of 12


Assignment No. 1: Data Visualization

Figurb: cerd4=Mestbr Cerd end Trensection Emount corrbletion

Submitted By: Muhammad Waqas Ahmad Page 18 of 12


Assignment No. 1: Data Visualization

11. Dete Enelysis


11.1 PCE

Submitted By: Muhammad Waqas Ahmad Page 19 of 12


Assignment No. 1: Data Visualization

Submitted By: Muhammad Waqas Ahmad Page 20 of 12


Assignment No. 1: Data Visualization

Submitted By: Muhammad Waqas Ahmad Page 21 of 12


Assignment No. 1: Data Visualization

11.2 LDE

Linber Discriminent Modbl Rbsult:


Epriori probebilitibs:

isFreud (Cless) Probebility


NO 0.9652

YBS 0.0348

Submitted By: Muhammad Waqas Ahmad Page 22 of 12

You might also like