You are on page 1of 32

10

l^âçjÖ]<ÌÒ

 K
 ‫א‬‫א‬‫א‬‫א‬‫א‬
F  EoutliersF‫א‬ EanomalousF ‫א‬‫א‬

 ‫א‬‫א‬‫א‬K  ‫א‬E
‫א‬‫א‬‫א‬ Edeviation detectF
 ‫א‬  ،Eexception miningF  ‫א‬‫א‬  ‫א‬  ، ‫א‬
KEoutlierFEanomalyF‫א‬‫א‬‫א‬K‫א‬
‫א‬    ،   ‫א‬ ‫א‬     
   ‫א‬ K  ‫א‬  ‫א‬Emachine learningF ‫א‬ ‫א‬
       ‫א‬   EanomalousF ‫א‬ ‫א‬ 
F  ‫א‬  ‫א‬‫א‬  ‫א‬   ‫א‬ K  ‫א‬ ‫א‬
،K ‫א‬‫א‬،E
K‫א‬‫א‬‫א‬‫א‬‫א‬?‫א‬?‫א‬
   ،‫א‬ ‫א‬  ،‫א‬ ‫א‬  ‫א‬ ‫א‬  ‫א‬  
‫א‬ K ‫א‬،‫א‬
،‫א‬‫א‬‫א‬‫א‬‫א‬K ‫א‬‫א‬‫א‬
K
 ‫א‬‫א‬،‫א‬‫א‬
W ‫א‬     ‫א‬ ‫א‬ ‫א‬ ‫א‬ 
‫א‬‫א‬،‫א‬‫א‬،‫א‬‫א‬‫א‬
K‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 654

K‫א‬‫א‬‫א‬
‫א‬‫א‬ :(Fraud Detection) ‫ אآﺘﺸﺎف اﻻﺣﺘﻴﺎل‬
‫א‬K  ‫א‬‫א‬
K
 ‫א‬ ‫א‬     ‫א‬  ‫א‬    ‫א‬
K‫א‬‫א‬
 ‫א‬   ‫א‬  :(Intrusion Detection) ‫ اﻟﻜﺸﻒ ﻋﻦ اﻟﺘﻄﻔﻞ‬
     K  ‫א‬‫א‬  ‫א‬   ‫א‬‫א‬ 
،‫א‬‫א‬‫א‬‫א‬،‫א‬‫א‬‫א‬
‫א‬،‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬ K
K
‫א‬‫א‬‫א‬ :(Ecosystem Disturbance) ‫ اﺿﻄﺮاب اﻟﻨﻈﺎم اﻟﺒﻴﺌﻲ‬
‫א‬‫א‬‫א‬‫א‬‫א‬ K ‫א‬
   ‫א‬   ‫א‬K
 ‫א‬‫א‬    ‫א‬‫א‬ ‫א‬‫א‬
K‫א‬‫א‬
 ‫א‬  ‫א‬ ‫א‬‫א‬ ‫א‬  :(Public Health) ‫ اﻟﺼﺤﺔ اﻟﻌﺎﻣﺔ‬
‫א‬،K‫א‬‫א‬
   ،E   ‫א‬F      ‫א‬   
‫א‬‫א‬‫א‬
K‫א‬‫א‬
  ‫א‬‫א‬    ،    :(Medicine) ‫ اﻟﺪواء‬
K ‫א‬‫א‬‫א‬
 K  ‫א‬،
‫א‬‫א‬F 
KE‫א‬‫א‬‫א‬
655 l^âçjÖ]<ÌÒ<V10

‫א‬‫א‬‫א‬‫א‬ ‫א‬
‫א‬E‫א‬F‫א‬‫א‬،‫א‬‫א‬
‫א‬، K ‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬EmeanF‫א‬
K‫א‬‫א‬‫א‬E‫א‬F‫א‬K

 ‫א‬K  ‫א‬‫א‬‫א‬‫א‬
K،‫א‬‫א‬‫א‬

íè‚éã³<l]çŞ} <1.10
   ‫א‬     ‫א‬ ‫א‬  
‫א‬ (2) ،‫א‬ (1)  K‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬ (3) ،‫א‬ ‫א‬
 (4) ،Eclass labelF ‫א‬
K‫א‬‫א‬

l^âçi<î×Â<Ùç’£]<h^f‰_ <1.1.10
، W
 ‫א‬‫א‬
K‫א‬،‫א‬‫א‬
 E  F  .‫ﺑﻴﺎﻧﺎت ﻣﻦ أﺻﻨﺎف ﻣﺨﺘﻠﻔﺔ‬
‫א‬‫א‬ K  
‫א‬‫א‬‫א‬ 
F ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬ K  ‫א‬
‫א‬ E  ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬ K  ‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬
 EF ‫א‬‫א‬‫א‬
KDouglas Hawkins‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 656

‫א‬‫א‬‫א‬ .(‫ ﻟﻠﺸﻮاذ‬Hawkins ‫ )ﺗﻌﺮﻳﻒ‬1.10 ‫اﻟﺘﻌﺮﻳﻒ‬


K‫א‬‫א‬‫א‬‫א‬
 ‫א‬   ‫א‬   .(Natural Variation) ‫اﻻﺧﺘﻼف اﻟﻄﺒﻴﻌﻲ‬
‫א‬،E‫א‬F‫א‬،
‫א‬ K ‫א‬‫א‬
E  F‫א‬
‫א‬‫א‬‫א‬
 ،K ‫א‬‫א‬‫א‬
‫א‬‫א‬،‫א‬‫א‬
‫א‬‫א‬  K ‫א‬‫א‬ E‫א‬F
K‫א‬
 ‫א‬ ‫א‬  ‫א‬      .‫أﺧﻄﺎء ﻗﻴﺎس وﺟﻤﻊ اﻟﺒﻴﺎﻧﺎت‬
،،K
  ،‫א‬   ‫א‬  ‫ א‬KEnoiseF    ،‫א‬
 K‫א‬
 ‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬E‫א‬EcleaningFF‫א‬‫א‬
      ‫א‬        .‫ﻣﻠﺨﺺ‬
 ،  ‫א‬      ‫א‬‫א‬   K
‫א‬‫א‬K‫א‬‫א‬
‫א‬،‫א‬‫א‬
K‫א‬‫א‬ K‫א‬

l^âçjÖ]<àÂ<ÌÓÖ]<ц <2.1.10
K ‫א‬ ‫א‬  ‫א‬  ‫א‬     
K‫א‬1‫א‬،‫א‬‫א‬‫א‬
‫א‬ .(Model-Based) ‫اﻟﺘﻘﻨﻴﺎت اﻟﺘﻲ ﺗﺴﺘﻨﺪ إﻟﻰ اﻟﻨﻤﻮذج‬
K ‫א‬‫ א‬ K
 ‫א‬
657 l^âçjÖ]<ÌÒ<V10

‫א‬‫א‬‫א‬‫א‬‫א‬
F  ‫א‬ K  ‫א‬‫א‬ EparametersF
 ،‫א‬   ‫א‬  ‫א‬K ‫א‬  ‫א‬    ‫א‬E
،Eregression modelF‫א‬‫א‬‫א‬‫א‬K ‫א‬
KEpredictedF‫א‬‫א‬‫א‬
  ،     ‫א‬ ‫א‬ ‫א‬    
‫א‬‫א‬ K  ‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬،‫א‬ KEtraining setF 
KE7.5‫א‬‫א‬FK‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬،‫א‬‫א‬
‫א‬‫א‬‫א‬K  
 
K‫א‬،
‫א‬ .(Proximity-Based) ‫اﻟﺘﻘﻨﻴﺎت اﻟﺘﻲ ﺗﺴﺘﻨﺪ إﻟﻰ اﻟﻘﺮاﺑﺔ‬
‫א‬K‫א‬‫א‬   ‫א‬  ‫א‬     ،‫א‬  ‫א‬
‫א‬K  ‫א‬‫א‬‫א‬‫א‬
 K ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬ ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
K‫א‬‫א‬،‫א‬
‫א‬‫א‬ .(Density-Based) ‫اﻟﺘﻘﻨﻴﺎت اﻟﺘﻲ ﺗﺴﺘﻨﺪ إﻟﻰ اﻟﻜﺜﺎﻓﺔ‬
‫א‬ K
 ‫א‬‫א‬‫א‬،‫א‬
K
 ‫א‬،‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬
 ‫א‬ ،
K‫א‬
†^ÃÖ]<Ø’ËÖ] 658

Í^ß‘ù]<l^éÛŠi<Ý]‚~j‰] <3.1.10
،‫א‬‫א‬،EunsupervisedF ‫א‬‫א‬ W
 ‫א‬
F ‫א‬‫א‬‫ א‬ KEsemi-supervisedF ‫א‬‫א‬
K‫א‬‫א‬EEnormalFEanomalyF
  .(Supervised anomaly detection) ‫اﻟﻜﺸﻒ اﻟﻤُﺮاﻗﺐ ﻋﻦ اﻟﺘﺸﻮهﺎت‬
      ‫א‬  ‫א‬‫א‬ ‫א‬
KE  ‫א‬FK
(rare class)‫א‬‫א‬‫א‬
‫א‬K ‫א‬     ‫א‬    
K7.5‫א‬
 .(Unsupervised anomaly detection) ‫اﻟﻜﺸﻒ ﻏﻴﺮ اﻟﻤُﺮاﻗﺐ ﻋﻦ اﻟﺘﺸﻮهﺎت‬
     ‫א‬K
 ‫א‬  ‫א‬ 
 ‫א‬  ‫א‬ 
‫א‬EinstanceFE FEscoreF
 ‫א‬    ‫א‬ ‫א‬     K
             
‫א‬  ‫א‬‫א‬  ‫א‬    KElow outlier scoreF
،‫א‬‫א‬،
K‫א‬‫א‬
.(Semi-supervised anomaly detection) ‫اﻟﻜﺸﻒ ﺷﺒﻪ اﻟﻤُﺮاﻗﺐ ﻋﻦ اﻟﺘﺸﻮهﺎت‬

   ،ElabledF      ‫א‬  
‫א‬‫א‬‫א‬‫ א‬K ‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬K ‫א‬‫א‬
        ‫א‬ ‫א‬   
    ‫א‬ ‫א‬   ‫א‬   K‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬
659 l^âçjÖ]<ÌÒ<V10

‫א‬  ‫א‬ ‫א‬  ‫א‬ ‫א‬  ‫א‬   ‫א‬‫א‬ 
‫א‬‫א‬‫א‬‫א‬ K   ‫א‬‫א‬‫א‬‫א‬
K7.5‫א‬‫א‬Erare classF‫א‬‫א‬

íÚ^â<Øñ^ŠÚ <4.1.10
K‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬ .‫ﻋﺪد اﻟﺴﻤﺎت اﻟﻤﺴﺘﺨﺪﻣﺔ ﻟﺘﻌﺮﻳﻒ ﺗﺸﻮﻩ‬
K ‫א‬‫א‬‫א‬‫א‬
K ‫א‬،‫א‬،
 ‫א‬
EF‫א‬‫א‬K‫א‬‫א‬
K 300‫א‬،300
‫א‬‫א‬‫א‬‫א‬‫א‬
K‫א‬ K‫א‬
،‫א‬‫א‬ .‫اﻟﻤﻨﻈﻮر اﻟﺸﺎﻣﻞ ﻓﻲ ﻣﻘﺎﺑﻞ اﻟﻤﺤﻠﻲ‬
‫א‬ K  ‫א‬‫א‬
،‫א‬‫א‬ 5 ‫א‬ 6 
K‫א‬‫א‬
‫א‬‫א‬‫א‬ .ً‫إﻟﻰ أي درﺟﺔ ﺗﻜﻮن ﻧﻘﻄﺔ ﺗﺸﻮهﺎ‬
‫א‬‫א‬‫א‬‫א‬ K  ‫א‬ W 
  ‫א‬   K ‫א‬   ‫א‬   ‫א‬
‫א‬  ‫א‬ ‫א‬ ‫א‬ K       
KEanomaly or outlier scoreF
   .‫ﺗﺤﺪﻳﺪ ﺗﺸﻮﻩ واﺣﺪ ﻓﻲ آﻞ ﻣﺮة أم ﺗﺤﺪﻳﺪ ﻋﺪة ﺗﺸﻮهﺎت دﻓﻌﺔ واﺣﺪة‬
‫א‬‫א‬‫א‬،‫א‬‫א‬‫א‬‫א‬‫א‬
 K‫א‬  K ‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 660

،EmaskingF‫א‬‫א‬
‫א‬‫א‬K
 ‫א‬
 
 ،EswampingF‫א‬ ‫א‬
‫א‬‫א‬،Emodel-basedF‫א‬‫א‬‫א‬K ‫א‬
K‫א‬ ‫א‬
‫א‬،‫א‬‫א‬‫א‬‫א‬‫א‬ .‫اﻟﺘﻘﻴﻴﻢ‬
K7.5‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
،‫א‬‫א‬ ‫א‬‫א‬‫א‬
Efalse positive errorF ‫א‬‫א‬  EprecisionF ‫א‬ ErecallF ‫א‬ 
،‫א‬‫א‬  KEaccuracyF ‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
K‫א‬
K ‫א‬‫א‬‫א‬‫א‬.‫اﻟﻔﻌﺎﻟﻴﺔ‬
،‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬  K 
‫א‬‫א‬‫א‬‫א‬‫א‬ K
 
‫א‬ ‫א‬  ‫א‬ ‫א‬    ،‫א‬   m  ،O(m)2 
‫א‬‫א‬‫א‬ KEproximity matrixF ‫א‬‫א‬
  ‫א‬  ،‫א‬  ‫א‬   ،  
K‫א‬‫א‬‫א‬3‫א‬K‫א‬

Ðè†ŞÖ]<íŞè†}
،‫א‬ W  ‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬K ‫א‬‫א‬‫א‬،‫א‬‫א‬‫א‬،‫א‬‫א‬‫א‬‫א‬
 
    ‫א‬  ‫א‬ ‫א‬   K  ‫א‬    
K
661 l^âçjÖ]<ÌÒ<V10

íéñ^’uý]<цŞÖ] <2.10
،،‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬K  ‫א‬
‫א‬ K
 ‫א‬‫א‬‫א‬‫א‬
K‫א‬2.10
‫א‬    ‫א‬ ‫א‬ .(‫ )اﻟﺘﻌﺮﻳﻒ اﻹﺣﺼﺎﺋﻲ ﻟﻠﻜﺎﺋﻦ اﻟﺸﺎذ‬2.10 ‫اﻟﺘﻌﺮﻳﻒ‬

K‫א‬‫א‬
‫א‬K ‫א‬‫א‬‫א‬
EmaenF ‫א‬  ‫א‬   ،   ‫א‬  ‫א‬
 K  ‫א‬ ‫א‬‫א‬   ‫א‬ ‫א‬ ‫א‬ ‫א‬‫א‬
K‫א‬‫א‬‫א‬
‫א‬2.10‫א‬‫א‬‫א‬‫א‬  ‫א‬‫א‬‫א‬‫א‬
K‫א‬Ediscordant observationsF‫א‬‫א‬،‫א‬‫א‬
‫א‬        ‫א‬ ‫א‬‫א‬   
‫א‬‫א‬‫א‬‫א‬K  ‫א‬‫א‬‫א‬
K‫א‬،
íÚ^â<Øñ^ŠÚ
W‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬ .‫ﺗﺤﺪﻳﺪ ﺗﻮزﻳﻊ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎت‬
‫א‬  ،EGaussianF ‫א‬  ،‫א‬ ‫א‬   
     ‫א‬   ، EbinominalF ‫א‬   ،EPoissonF
‫א‬‫א‬‫א‬ K 
‫א‬‫א‬   K 
‫א‬‫א‬‫א‬‫א‬،
‫א‬ K  ‫א‬‫א‬ E  ‫א‬F
KEheavy-tailed distributionsF‫א‬‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 662

‫א‬‫א‬‫א‬.‫ﻋﺪد اﻟﺴﻤﺎت اﻟﻤﺴﺘﺨﺪﻣﺔ‬


K(multivariate)‫א‬‫א‬،‫א‬
،‫א‬‫א‬ .‫ﺧﻠﻴﻂ ﻣﻦ اﻟﺘﻮزﻳﻌﺎت‬
‫א‬،‫א‬  K ‫א‬‫א‬‫א‬‫א‬
  ‫א‬ K ‫א‬‫א‬ ‫א‬     ،‫א‬  
 ‫א‬K‫א‬   ‫א‬      ‫א‬  ‫א‬
K2.2.9‫א‬‫א‬‫א‬EM‫א‬‫א‬‫א‬

Çj¹]<ì‚éuæ<íéÃéf<l^Ãè‡çi<»<ƒ]çÖ]<àÂ<ÌÓÖ] <1.2.10
،‫א‬‫א‬‫א‬‫א‬  E‫א‬F ‫א‬‫א‬
µ   ‫א‬ ‫א‬ K ‫א‬‫א‬     
‫א‬ KN(µ,σ) ‫א‬‫א‬،E  ‫א‬‫א‬‫א‬Fσ  E‫א‬F
KN(0, 1)‫א‬1.10
‫اﻟﻜﺜﺎﻓﺔ اﻻﺣﺘﻤﺎﻟﻴﺔ‬

1 ‫ واﻧﺤﺮاف ﻣﻌﻴﺎري‬0 ‫ﺗﺎﺑﻊ اﻟﻜﺜﺎﻓﺔ اﻻﺣﺘﻤﺎﻟﻴﺔ ﻟﺘﻮزﻳﻊ ﻏﻮﺻﻲ ﺑﻤﺘﻮﺳﻂ‬ 10.1 ‫اﻟﺸﻜﻞ‬
663 l^âçjÖ]<ÌÒ<V10


 K‫א‬EtailFN(0, 1)‫א‬EF
K ‫א‬‫א‬‫א‬±3‫א‬‫א‬ 0.0027‫א‬
|x| ≥ c‫א‬ ،‫א‬xc‫א‬
 1.10‫א‬Kα = prob(|x| ≥ c)Kc‫א‬‫א‬
‫א‬‫א‬ KN(0,1) ‫א‬‫א‬ α  c ‫א‬
K‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬4

0 ‫ ﻣﻦ أﺟﻞ ﺗﻮزﻳﻊ ﻏﻮﺻﻲ ﺑﻤﺘﻮﺳﻂ‬،α = prob(|x| ≥ c) ‫ ﺣﻴﺚ‬،(c, α) ‫أﻣﺜﻠﺔ ﻋﻦ أزواج‬ 1.10 ‫اﻟﺠﺪول‬
1 ‫واﻧﺤﺮاف ﻣﻌﻴﺎري‬

c N(0,1) ‫ﻣﻦ أﺟﻞ‬α


1.00 0.3173
1.50 0.1336
2.00 0.0455
2.50 0.0124
3.00 0.0027
3.50 0.0005
4.00 0.0001

‫א‬،‫א‬ N(0,1) ‫א‬ c ‫א‬ 


K3.10‫א‬‫א‬EF‫א‬‫א‬‫א‬
.(N(0,1) ‫ )اﻟﻜﺎﺋﻦ اﻟﺸﺎذ ﻣﻦ أﺟﻞ ﺳﻤﺔ وﺣﻴﺪة ﺗﺨﻀﻊ ﻟﻠﺘﻮزﻳﻊ اﻟﻐﻮﺻﻲ‬3.10 ‫اﻟﺘﻌﺮﻳﻒ‬
W‫א‬‫א‬1‫א‬‫א‬0‫א‬x

|x| ≥ c (1.10)

Kprob(|x| ≥ c) = α‫א‬c
E‫א‬F‫א‬Kα‫א‬‫א‬‫א‬‫א‬
‫א‬ α  ،‫א‬
‫א‬‫א‬ K ‫א‬‫א‬
K‫א‬α،N(0,1)‫א‬
†^ÃÖ]<Ø’ËÖ] 664

µ   E ‫א‬ ‫א‬  F ‫א‬     ‫א‬
‫א‬‫א‬‫א‬،EN(µ,σ)Fσ‫א‬‫א‬
 KN(0,1) ،z  x ‫א‬ EtransformF  3.10
 KEEz scoreFz  z FKz = (x-µ)/σ ‫א‬
‫א‬‫א‬‫א‬ x ‫א‬‫א‬ σµ
 K‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬ Ksx
‫א‬   ‫א‬  KN(0, 1)   z     
K7‫א‬EGrubbs‫א‬F

l÷çvj¹]<ì‚ÃjÚ<íéÃéfŞÖ]<l^Ãè‡çjÖ]<»<ìƒ^Ö]<l^ßñ^ÓÖ] <2.2.10
‫א‬‫א‬‫א‬‫א‬‫א‬
 ‫א‬K ‫א‬
K  ‫א‬‫א‬
K‫א‬‫א‬،‫א‬‫א‬
‫א‬ ‫א‬  ،E‫א‬F ‫א‬ ‫א‬ EcorrelationF ‫א‬ 
‫א‬2.10‫א‬K EsymmetricalF‫א‬ ‫א‬
 (0, 0) ‫א‬‫א‬ ‫א‬
WEcovariance matrixF
⎛ 1.00 0.75 ⎞
∑ = ⎜⎜ 0.75 ⎟
3.00 ⎟⎠

،‫א‬‫א‬  EthresholdF ‫א‬



  Mahalanobis  K
 ‫א‬‫א‬
xMahalanobis2.10‫א‬K14.2‫א‬‫א‬K  ‫א‬‫א‬
K x ‫א‬
mahalanobis (x, x) = (x − x)S −1 (x − x)T (2.10)

K‫א‬‫א‬S
665 l^âçjÖ]<ÌÒ<V10

‫اﻟﻜﺜﺎﻓﺔ‬
‫اﻻﺣﺘﻤﺎﻟﻴﺔ‬

3.10 ‫اﻟﻜﺜﺎﻓﺔ اﻻﺣﺘﻤﺎﻟﻴﺔ ﻟﺘﻮزﻳﻊ ﻏﻮﺻﻲ ﺗﻢ اﺳﺘﺨﺪاﻣﻪ ﻟﺘﻮﻟﻴﺪ ﻧﻘﺎط اﻟﺸﻜﻞ‬ 2.10 ‫اﻟﺸﻜﻞ‬

 ‫א‬ ‫א‬     Mahalanobis     ‫א‬ 
‫א‬ElogF   Mahalanobis  K ‫א‬   
K5‫א‬‫א‬K‫א‬
 .(‫ )اﻟﻜﺎﺋﻨﺎت اﻟﺸﺎذة ﻓﻲ اﻟﺘﻮزﻳﻌﺎت اﻟﻄﺒﻴﻌﻴﺔ ﻣﺘﻌﺪدة اﻟﻤﺘﺤﻮﻻت‬1.10 ‫اﻟﻤﺜﺎل‬
     E ‫א‬  FMahalanobis  3.10 ‫א‬
 B(5,5)  A(-4,4) ‫א‬ K ‫א‬
   K
 ‫א‬   Mahalanobis     ،‫א‬
K2.10‫א‬‫א‬‫א‬‫א‬2000‫א‬‫א‬
 A ، K  Mahalanobis  B  A 
‫א‬   ‫א‬ ‫א‬E(0,0)      x   F ‫א‬
 Mahalanobis  Mahalanobis  B  ،‫א‬
،24  Mahalanobis  5 2   B ‫א‬ K ‫א‬‫א‬
„ K35Mahalanobis 4 2 A
†^ÃÖ]<Ø’ËÖ] 666

‫ﻣﺴﺎﻓﺔ‬
Mahalanobis

‫ ﺛﻨﺎﺋﻴﺔ اﻷﺑﻌﺎد‬2002 ‫ ﻟﻠﻨﻘﺎط ﻣﻦ ﻣﺮآﺰ ﻣﺠﻤﻮﻋﺔ ﻧﻘﺎط ﻋﺪدهﺎ‬Mahalanobis ‫ﻣﺴﺎﻓﺔ‬ 3.10 ‫اﻟﺸﻜﻞ‬

åçjÖ]<àÂ<ÌÓ×Ö<¼×j~¹]<tƒçÛßÖ]<íÏè† <3.2.10
‫א‬F‫א‬K‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬     ‫א‬  ‫א‬ ‫א‬   ،E2.2.9 ‫א‬
‫א‬‫א‬‫א‬، K‫א‬
K‫א‬‫א‬‫א‬‫א‬،
 ‫א‬    ‫א‬  ‫א‬ ‫א‬    ‫א‬
 ،‫א‬  K E‫א‬F ‫א‬ ‫א‬ EmaximizeF 
‫א‬‫א‬‫א‬‫א‬ K  ‫א‬ EM ‫א‬
‫א‬‫א‬‫א‬ K 
‫א‬‫א‬‫א‬‫א‬‫א‬K ‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
667 l^âçjÖ]<ÌÒ<V10

MW
 ‫א‬   D‫א‬
‫א‬‫א‬ K ‫א‬‫א‬ A ،E‫א‬F ‫א‬
W‫א‬

D(x) = (1 - λ)M(x) + λA(x) (3.10)

 M ‫א‬  K ‫א‬ ‫א‬‫א‬   1  0   λ   x 
 At  Mt KEuniformF    A ‫א‬    ،‫א‬
A0M0 = D،t = 0 ‫א‬ Kt ‫א‬ E  ‫א‬F ‫א‬‫א‬‫א‬
‫א‬‫א‬ ‫א‬ElogFt‫א‬K
W‫א‬‫א‬D

⎛ ⎞⎛ A ⎞
∏ PD (xi ) = ⎜⎜ (1 − λ) ∏ PM ⎟⎜ λ t
∏ PA (xi ) ⎟⎟
Mt
Lt ( D) = ( x ) (4.10)
t i ⎟⎜ t
x i ∈D ⎝ x i ∈M t ⎠⎝ x i ∈ At ⎠

LLt ( D) = M t log(1 − λ) + ∑ log PM t


(xi ) + At log(λ) + ∑ log PA (xi )
t
(5.10)
x i ∈M t x i ∈ At

K
 ‫א‬ At  Mt  D ‫א‬‫א‬‫א‬ PA   PM   PD  t t

‫א‬F6.9 ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬ ‫א‬


0‫א‬‫א‬‫א‬‫א‬‫א‬KE2.2.9
M(2)،A(1)W  ‫א‬‫א‬
K1.10‫א‬‫א‬‫א‬K
‫א‬‫א‬،‫א‬‫א‬‫א‬‫א‬
‫א‬K ‫א‬‫א‬‫א‬
‫א‬‫א‬K‫א‬‫א‬‫א‬
‫א‬‫א‬،‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬ K ‫א‬‫א‬‫א‬
EλF‫א‬‫א‬‫א‬‫א‬ 
†^ÃÖ]<Ø’ËÖ] 668

 KE1-λ F ‫א‬‫א‬‫א‬‫א‬‫א‬


‫א‬     ‫א‬  ‫א‬ ‫א‬     ‫א‬‫א‬ 
K‫א‬‫א‬‫א‬

‫اﻟﻜﺸﻒ ﻋﻦ اﻟﺸﻮاذ اﺳﺘﻨﺎداً إﻟﻰ اﻷرﺟﺤﻴﺔ‬ 1.10 ‫اﻟﺨﻮارزﻣﻴﺔ‬

1: Initialization: At time t = 0, let Mt contain all the objects, while At is empty.


Let LLt (D) = LL(Mt) + LL(At) be the log likelihood of all the data.
2: for each point x that belongs to Mt do
3: Move x from Mt to At to produce the new data sets At+1 and Mt+1.
4: Compute the new log likelihood of D, LLt+1(D) = LL(Mt+1) + LL(At+1)
5: Compute the difference, ∆= LLt(D) - LLt+1(D)
6: if ∆ > c, where c is some threshold then
7: x is classified as an anomaly, i.e., Mt+1 and At+1 are unchanged and
become the current normal and anomaly sets.
8: end if
9: end for

‫א‬‫א‬ 1.10 ‫א‬‫א‬‫א‬‫א‬‫א‬


،K ‫א‬‫א‬‫א‬‫א‬‫א‬
FB  A ‫א‬ 3.10 ‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬،  K  E
 ‫א‬
،‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬ K ‫א‬‫א‬‫א‬
KEmultimodalF‫א‬‫א‬‫א‬

ÌÖÖ]æ<ìçÏÖ]<äqæ_ <4.2.10
‫א‬‫א‬‫א‬‫א‬
‫א‬  K  ،
‫א‬ K‫א‬‫א‬‫א‬‫א‬
669 l^âçjÖ]<ÌÒ<V10

 K     ‫א‬‫א‬   ‫א‬ ‫א‬ ‫א‬‫א‬
‫א‬‫א‬،  ‫א‬‫א‬
K‫א‬

íe]†ÏÖ]<±c<ğ]^ßj‰]<l^âçjÖ]<àÂ<ÌÓÖ] <3.10
،‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
K
 ‫א‬‫א‬‫א‬K ‫א‬‫א‬
‫א‬،‫א‬‫א‬
K‫א‬‫א‬
k ‫א‬‫א‬‫א‬‫א‬ ‫א‬‫א‬‫א‬
   K   4.10 ‫א‬ KEk-nearest neighborF ‫א‬ 
،‫א‬‫א‬‫א‬‫א‬،0‫א‬EscoreF
KE

‫درﺟﺔ اﻟﺸﺬوذ‬

‫درﺟﺔ اﻟﺸﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻤﺴﺎﻓﺔ إﻟﻰ اﻟﺠﺎر اﻷﻗﺮب اﻟﺨﺎﻣﺲ‬ 4.10 ‫اﻟﺸﻜﻞ‬
†^ÃÖ]<Ø’ËÖ] 670

‫درﺟﺔ اﻟﺸﺬوذ‬
‫ ﺗﻜﻮن ﻟﻠﻜﺎﺋﻨﺎت اﻟﺸﺎذة‬.‫درﺟﺔ اﻟﺸﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻤﺴﺎﻓﺔ إﻟﻰ أول ﺟﺎر أﻗﺮب‬ 5.10 ‫اﻟﺸﻜﻞ‬
‫اﻟﻤﺠﺎورة درﺟﺎت ﺷﺬوذ ﻣﻨﺨﻔﻀﺔ‬

‫א‬ .(‫ ﺟﺎر اﻷﻗﺮب‬k ‫ )اﻟﻤﺴﺎﻓﺔ إﻟﻰ اﻟـ‬4.10 ‫اﻟﺘﻌﺮﻳﻒ‬


Kk
‫א‬K ‫א‬ 4.10‫א‬
KCKk = 5‫א‬‫א‬
 E  1F  k ‫א‬ Kk ‫א‬
K ‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬5.10‫א‬
CKk=1‫א‬‫א‬‫א‬KC
‫א‬‫א‬،‫א‬k‫א‬K 
‫א‬‫א‬ 6.10 ‫א‬ K‫א‬‫א‬ k 
 k = 5 K30  5 
k ‫א‬‫א‬ K‫א‬‫א‬‫א‬‫א‬‫א‬
Kk‫א‬‫א‬4.10‫א‬
‫‪671‬‬ ‫‪l^âçjÖ]<ÌÒ<V10‬‬

‫درﺟﺔ اﻟﺸﺬوذ‬

‫درﺟﺔ اﻟﺸﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻤﺴﺎﻓﺔ إﻟﻰ اﻟﺠﺎر اﻷﻗﺮب اﻟﺨﺎﻣﺲ‪ .‬ﻳﺼﺒﺢ اﻟﻌﻨﻘﻮد‬ ‫اﻟﺸﻜﻞ ‪6.10‬‬
‫اﻟﺼﻐﻴﺮ ﺷﺎذاً‬

‫درﺟﺔ اﻟﺸﺬوذ‬

‫درﺟﺔ اﻟﺸﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻤﺴﺎﻓﺔ إﻟﻰ اﻟﺠﺎر اﻷﻗﺮب اﻟﺨﺎﻣﺲ‪ .‬ﻋﻨﺎﻗﻴﺪ ذات آﺜﺎﻓﺎت‬ ‫اﻟﺸﻜﻞ ‪7.10‬‬
‫ﻣﺨﺘﻠﻔﺔ‬
†^ÃÖ]<Ø’ËÖ] 672

ÌÖÖ]æ<ìçÏÖ]<äqæ_ <1.3.10
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬  
 KO(m2) ‫א‬‫א‬‫א‬‫א‬ K ‫א‬‫א‬
   ‫א‬  ،     ‫א‬  ‫א‬  
 K ‫א‬  ‫א‬   ‫א‬‫א‬   ‫א‬ ‫א‬‫א‬
 
         K ‫א‬   ‫א‬
          ‫א‬   ‫א‬ 
K‫א‬‫א‬
‫א‬ K7.10 ‫א‬‫א‬‫א‬‫א‬‫א‬،
D  C ،‫א‬ ‫א‬،‫א‬‫א‬‫א‬
 4.10 ‫א‬‫א‬ K ‫א‬‫א‬
، C  k = 5 
‫א‬ D‫א‬‫א‬‫א‬KD‫א‬
K‫א‬‫א‬‫א‬‫א‬

íÊ^nÓÖ]<±c<ğ]^ßj‰]<ƒæ„Ö]<àÂ<ÌÓÖ] <4.10
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
K‫א‬
       .(‫ )اﻟﻜﺎﺋﻦ اﻟﺸﺎذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻜﺜﺎﻓﺔ‬5.10 ‫اﻟﺘﻌﺮﻳﻒ‬

K‫א‬‫א‬
‫א‬‫א‬  ‫א‬‫א‬ ‫א‬‫א‬    ‫א‬  ‫א‬‫א‬ ‫א‬‫א‬  ‫א‬ 
‫א‬‫א‬‫א‬K ‫א‬‫א‬‫א‬
،‫א‬‫א‬ K  ‫א‬ k ‫א‬‫א‬‫א‬
K6.10‫א‬‫א‬K،‫א‬
673 l^âçjÖ]<ÌÒ<V10

.(‫ )ﻣﻘﻠﻮب اﻟﻤﺴﺎﻓﺔ‬6.10 ‫اﻟﺘﻌﺮﻳﻒ‬

−1
⎛ ∑ y∈N ( x, k ) distance(x, y ) ⎞
density (x, k ) = ⎜ ⎟ (6.10)
⎜ N ( x, k ) ⎟
⎝ ⎠

 |N( x, k)| ،x ‫א‬ k ‫א‬‫א‬‫א‬ N(x, k) 


Ky،‫א‬
K4.8‫א‬‫א‬KDBSCAN‫א‬‫א‬
‫א‬ .(‫ )ﻋﺪّ اﻟﻨﻘﺎط ﺿﻤﻦ ﻧﺼﻒ ﻗﻄﺮ ﻣُﻌﻄﻰ‬7.10 ‫اﻟﺘﻌﺮﻳﻒ‬
K‫א‬d‫א‬‫א‬
‫א‬‫א‬ ‫א‬‫א‬ d ‫א‬ K  d ‫א‬‫א‬
‫א‬‫א‬d‫א‬K 
K‫א‬‫א‬EF
    ‫א‬    ‫א‬ ‫א‬‫א‬   
K3.10 ‫א‬    ‫א‬ ‫א‬‫א‬  ‫א‬‫א‬ ‫א‬‫א‬  ‫א‬  ‫א‬
‫א‬     ‫א‬ ‫א‬      
 ‫א‬ ‫א‬   KE7.10 ‫א‬ ‫א‬FK   ‫א‬ 
  ‫א‬            
6.10F‫א‬7.10‫א‬D‫א‬K ‫א‬‫א‬
K‫א‬‫א‬‫א‬،A‫א‬E7.10
 ‫א‬  ‫א‬  K  ‫א‬ ‫א‬    
K8.4.9‫א‬‫א‬SNN‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬ xEratioF‫א‬‫א‬
Wy‫א‬
density (x, k )
average relative density(x, k ) = (7.10)
∑y∈N ( x, k ) density( y, k ) / N (x, k )
†^ÃÖ]<Ø’ËÖ] 674

íéfŠßÖ]<íÊ^nÓÖ]<Ý]‚~j‰^e<ƒ]çÖ]<àÂ<ÌÓÖ] <1.4.10
F ‫א‬ K ‫א‬‫א‬‫א‬‫א‬
K2.10‫א‬‫א‬‫א‬،ELocal Outlier FactorFLOF‫א‬‫א‬
K      ،  ‫א‬‫א‬    
EkF‫א‬‫א‬
‫א‬‫א‬‫א‬  K  ‫א‬‫א‬‫א‬‫א‬ density(x, k)
K7.10 ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬ x ‫א‬‫א‬‫א‬‫א‬‫א‬
Kx‫א‬‫א‬‫א‬

‫ﺧﻮارزﻣﻴﺔ ﺣﺴﺎب درﺟﺔ ﺷﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻜﺜﺎﻓﺔ اﻟﻨﺴﺒﻴﺔ‬ 2.10 ‫اﻟﺨﻮارزﻣﻴﺔ‬

1: {k is the number of nearest neighbors}


2: for all objects x do
3: Determine N(x, k), the k- nearest neighbors of x.
4: Determine density(x, k), the density of x using its nearest neighbors, i.e., the
objects in N(x, k).
5: end for
6: for all objects x do
7: Set the outlier score(x, k) = average relative density(x, k) from Equation 10.7.
8: end for

  .(‫ )اﻟﻜﺸﻒ ﻋﻦ اﻟﻜﺎﺋﻨﺎت اﻟﺸﺎذة اﺳﺘﻨﺎداً إﻟﻰ اﻟﻜﺜﺎﻓﺔ اﻟﻨﺴﺒﻴﺔ‬2.10 ‫اﻟﻤﺜﺎل‬
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
 8.10 ‫א‬ Kk = 10  K7.10 ‫א‬  ‫א‬‫א‬ ‫א‬ 
‫א‬،K  ‫א‬‫א‬
‫א‬ C  B  A ‫א‬ K ‫א‬
‫א‬،‫א‬‫א‬‫א‬‫א‬K ‫א‬‫א‬
   ‫א‬ ‫א‬ ،EcompactF ‫א‬ ‫א‬    ‫א‬
„ K‫א‬‫א‬
675 l^âçjÖ]<ÌÒ<V10

LOF
‫( ﻣﻦ أﺟﻞ اﻟﻨﻘﺎط ﺛﻨﺎﺋﻴﺔ اﻷﺑﻌﺎد‬LOF) ‫درﺟﺎت اﻟﺸﺬوذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻜﺜﺎﻓﺔ اﻟﻨﺴﺒﻴﺔ‬ 8.10 ‫اﻟﺸﻜﻞ‬
7.10 ‫اﻟﻮاردة ﻓﻲ اﻟﺸﻜﻞ‬

ÌÖÖ]æ<ìçÏÖ]<äqæ_ <2.4.10
‫א‬      ‫א‬ ‫א‬  ‫א‬‫א‬ ‫א‬‫א‬  ‫א‬ 
 K ‫א‬‫א‬
mFO(m2)‫א‬‫א‬  ‫א‬‫א‬‫א‬
 O(m log m) ‫א‬‫א‬‫א‬، E‫א‬
‫א‬‫א‬‫א‬K ‫א‬‫א‬
k‫א‬ ‫א‬LOF‫א‬‫א‬
‫א‬ ‫א‬ ‫א‬  ‫א‬     K  ‫א‬ ‫א‬  
K‫א‬‫א‬‫א‬

ì‚ÏßÃÖ]<±c<‚ßjŠi<l^éßÏi <5.10
 ‫א‬،‫א‬
‫א‬   K
 ‫א‬     ‫א‬ ‫א‬  ‫א‬
†^ÃÖ]<Ø’ËÖ] 676

‫א‬ K ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬


K‫א‬
‫א‬‫א‬ ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
 ،    ‫א‬  ‫א‬‫א‬ K
 ‫א‬ ‫א‬  ‫א‬ 
‫א‬‫א‬‫א‬‫א‬
K
 ‫א‬‫א‬‫א‬ K‫א‬
   ‫א‬   K ‫א‬ ‫א‬  ‫א‬  ‫א‬ ‫א‬ 
  ‫א‬   ‫א‬   K  ‫א‬ ‫א‬ ‫א‬ 
،‫א‬‫א‬‫א‬‫א‬ EextendF
K‫א‬
‫א‬‫א‬
‫א‬‫א‬،Eprototype-basedF‫א‬K 
،K ‫א‬
‫א‬‫א‬‫א‬‫א‬،EobjectiveFEF 
‫א‬،K ‫א‬
 K ‫א‬،‫א‬‫א‬
‫א‬ K-means 
‫א‬KESSEF‫א‬
K‫א‬8.10‫א‬K‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬ .(‫ )اﻟﻜﺎﺋﻦ اﻟﺸﺎذ اﺳﺘﻨﺎداً إﻟﻰ اﻟﻌﻨﻘﺪة‬8.10 ‫اﻟﺘﻌﺮﻳﻒ‬
K‫א‬‫א‬
EF ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬ K  ‫א‬‫א‬‫א‬‫א‬‫א‬
  ‫א‬ ‫א‬  ‫א‬         8.10
EconnectivityF ‫א‬ ‫א‬ ‫א‬  ‫א‬‫א‬ ‫א‬     ،
 ‫א‬‫א‬‫א‬‫א‬ K ‫א‬‫א‬
677 l^âçjÖ]<ÌÒ<V10

‫א‬‫א‬‫א‬،‫א‬‫א‬
K‫א‬‫א‬‫א‬
 ‫א‬‫א‬‫א‬‫א‬ 
KK-means،‫א‬‫א‬‫א‬‫א‬K‫א‬

çÏßÂ<±c<àñ^Ò<ð^ÛjÞ]<ï‚Ú<ÜééÏi <1.5.10
K ‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬ ‫א‬‫א‬‫א‬
‫א‬  K‫א‬ Eoutlier scoreF 
‫א‬‫א‬‫א‬‫א‬‫א‬
E     ‫א‬     ‫א‬F  ‫א‬ K‫א‬
KMahalanobis‫א‬‫א‬
‫א‬‫א‬
 K ‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬K ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
 ‫א‬‫א‬ ‫א‬   ‫א‬ ‫א‬  .(‫ )ﻣﺜﺎل ﻳﺴﺘﻨﺪ إﻟﻰ اﻟﻌﻨﻘﺪة‬3.10 ‫اﻟﻤﺜﺎل‬
،K-means ‫א‬‫א‬‫א‬‫א‬ ‫א‬‫א‬ K7.10 ‫א‬
EcentroidF ‫א‬ (1) W  ‫א‬
E‫א‬F‫א‬،‫א‬‫א‬(2)،
Emedian distanceF‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬ K  ‫א‬‫א‬‫א‬
K‫א‬‫א‬‫א‬‫א‬
‫א‬  ‫א‬K10.109.10‫א‬‫א‬‫א‬
‫א‬K  ‫א‬E  ‫א‬‫א‬‫א‬‫א‬‫א‬F
‫א‬‫א‬  ‫א‬ ‫א‬ K  
†^ÃÖ]<Ø’ËÖ] 678

‫א‬‫א‬،‫א‬‫א‬  ‫א‬K‫א‬D
 ،‫א‬
„ KEDCAFLOF‫א‬‫א‬

‫اﻟﺒُﻌﺪ‬
‫ﺑُﻌﺪ اﻟﻨﻘﺎط ﻋﻦ أﻗﺮب ﻣﺮآﺰ ﺛﻘﻞ‬ 9.10 ‫اﻟﺸﻜﻞ‬

‫اﻟﺒُﻌﺪ‬

‫اﻟﺒُﻌﺪ اﻟﻨﺴﺒﻲ ﻟﻠﻨﻘﺎط ﻋﻦ أﻗﺮب ﻣﺮآﺰ ﺛﻘﻞ‬ 10.10 ‫اﻟﺸﻜﻞ‬


679 l^âçjÖ]<ÌÒ<V10

íéÖæù]<ì‚ÏßÃÖ]<î×Â<ƒ]çÖ]<m`i <2.5.10
‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
W ‫א‬‫א‬‫א‬‫א‬‫א‬ K ‫א‬‫א‬‫א‬
K  ‫א‬،‫א‬‫א‬‫א‬،‫א‬
‫א‬   ،   ‫א‬        ‫א‬
‫א‬ ‫א‬ K‫א‬‫א‬
K ‫א‬‫א‬‫א‬‫א‬ K ‫א‬
‫א‬‫א‬  K ‫א‬‫א‬
‫א‬‫א‬،  ‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬
K ‫א‬‫א‬‫א‬‫א‬ K ‫א‬‫א‬
‫א‬ 
،K‫א‬‫א‬‫א‬
‫א‬ K ‫א‬ EnoiseF 
K‫א‬E‫א‬F‫א‬‫א‬‫א‬‫א‬

íÚ‚~jŠ¹]<‚éÎ^ßÃÖ]<‚ <3.5.10
K ‫א‬K-means
‫א‬،‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬
،10 ،K ‫א‬
  ‫א‬  K     ‫א‬     ‫א‬ 
   ‫א‬       ، ‫א‬ ‫א‬ ‫א‬ ‫א‬
K
‫א‬    K
 ‫א‬ ‫א‬     ،‫א‬      
K ‫א‬‫א‬ K ‫א‬    ‫א‬
‫א‬‫א‬ (2)،‫א‬‫א‬ (1)‫א‬
K ‫א‬‫א‬،‫א‬‫א‬ 
K‫א‬‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 680

ÌÖÖ]æ<ìçÏÖ]<äqæ_ <4.5.10
 ElinearF ‫א‬ EK-means F ‫א‬
 ‫א‬‫א‬ ‫א‬‫א‬  ‫א‬    ‫א‬   ،‫א‬  ‫א‬
،‫א‬‫א‬  K ‫א‬‫א‬
‫א‬‫א‬ K
 ‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬          ‫א‬ ‫א‬‫א‬ 
‫א‬‫א‬‫א‬، K  ‫א‬‫א‬‫א‬
 ‫א‬ ‫א‬ ‫א‬  K
 ‫א‬    ‫א‬  ‫א‬
‫א‬ 98‫א‬K ‫א‬‫א‬‫א‬‫א‬‫א‬
‫א‬‫א‬،‫א‬
K‫א‬‫א‬

àè…^ÛjÖ] <6.10
K2.1.10 ‫א‬   ‫א‬‫א‬ ‫א‬   ‫א‬ ‫א‬   K1
‫א‬‫א‬‫א‬ ‫א‬‫א‬،
   ‫א‬    ‫א‬ ‫א‬    ‫א‬ ‫א‬
K‫א‬‫א‬‫א‬‫א‬K
 ‫א‬‫א‬   K
   ‫א‬  ‫א‬‫א‬  ‫א‬‫א‬  K2
‫א‬‫א‬‫א‬‫ א‬ K  ‫א‬
‫א‬‫א‬‫א‬‫א‬،  K‫א‬
K ‫א‬     8.6 ‫א‬    ‫א‬EhypercliqueF
،‫א‬ h-confidence‫א‬،
‫א‬‫א‬ KEmaximal hyperclique patternF 
K‫א‬‫א‬‫א‬‫א‬
W ‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬‫א‬ .3
K ‫א‬،‫א‬‫א‬‫א‬‫א‬،‫א‬‫א‬ (model-based)‫א‬‫א‬‫א‬
681 l^âçjÖ]<ÌÒ<V10

 ‫א‬ ‫א‬ ‫א‬    K     
K‫א‬‫א‬،
‫א‬‫א‬‫א‬ E3.10 ‫א‬‫א‬‫א‬FGrubb ‫א‬ K4
‫א‬‫א‬‫א‬K3.10‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬Ez-scoreFz‫א‬
‫א‬ z ‫א‬‫א‬‫א‬ K ‫א‬‫א‬
 EcriticalF ‫א‬‫א‬،gc  z 
‫א‬‫א‬ Kα  Esignificance levelF 
‫א‬‫א‬‫א‬  K‫א‬
K‫א‬gc

‫ ﻻﺳﺘﺒﻌﺎد اﻟﻜﺎﺋﻨﺎت اﻟﺸﺎذة‬Grubb ‫ﻃﺮﻳﻘﺔ‬ 3.10 ‫اﻟﺨﻮارزﻣﻴﺔ‬

1: Input the values and α


{m is number of values, α is a parameter, and tc is a value chosen so that
α = prob(x ≥ tc for a t distribution with m-2 degrees of freedom.}
2: repeat
3: Compute the sample mean ( x ) and standard deviation (sx).
4: Compute a value gc so that prob(|z| ≥ gc) = α
m −1 tc2
(In terms of tc and m, g c = .)
m m − 2 + tc2

5: Compute the z-score of each value, i.e., z = ( x − x ) / s x


6: Let g = max|z|, i.e., find the z-score of largest magnitude and call it g.
7: if g > gc then
8: Eliminate the value corresponding to g.
9: mÅm-1
10: end if
11: until No objects eliminated.
†^ÃÖ]<Ø’ËÖ] 682

t c2
 Grubb ‫א‬‫א‬ m − 1  ‫א‬‫א‬ (a)
m m − 2 + t c2

K0.05‫א‬‫؟‬‫א‬m
K‫א‬‫א‬EF‫ א‬(b)
E‫א‬‫א‬F ‫א‬     x  ‫א‬ ‫א‬  K5
W‫א‬‫א‬Σµ
( x − µ ) ∑ −1 ( x −µ )
1 −
prob(x) = 1/ 2
e 2 (8.10)
( 2π )m ∑

µ   S  ‫א‬ ‫א‬  x  ‫א‬  ‫א‬ ‫א‬
  log prob(x)   ،E  ‫א‬ FΣ ‫א‬ ‫א‬ 
Kx x ‫א‬xMahalanobis
‫א‬‫א‬‫א‬  E ‫א‬‫א‬FK-means ‫א‬ K6
KE10.10‫א‬F‫א‬‫א‬‫א‬5.10‫א‬
 10.10 ‫א‬‫א‬‫א‬‫א‬‫א‬ (a)
‫א؟‬K‫א‬‫א‬‫א‬‫א‬
 K
  10 ،‫א‬ ‫א‬ (b)
  ‫א‬  ‫א‬ ‫א‬ ‫א‬    ‫א‬ ‫א‬
‫؟‬،‫؟‬‫א‬
K ‫א‬   ‫א‬  ‫א‬E‫א‬F ‫א‬ ‫א‬‫א‬  (c)
K‫א‬‫א‬‫א‬
‫א‬0.01‫א‬‫א‬ K7
Efalse alarm rateF‫א‬‫א‬‫א‬،0.99
KE‫א‬‫א‬‫א‬‫א‬F‫؟‬‫א‬99%‫א‬Edetection rateF‫א‬
683 l^âçjÖ]<ÌÒ<V10

number of anomalies detected


detection rate = (9.10)
total number of anomalies

number of false anomalies


false alarm rate = (10.10)
number of objects classified as anomalies

   ‫א‬     ،‫א‬      K8
‫א‬ K ‫א‬،‫א‬
،‫א‬‫א‬‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬K
 ‫א‬
‫؟‬‫א‬‫א‬
‫א‬ K[0, 1] ‫א‬‫א‬ K9
‫א‬‫א‬‫א‬‫א‬
‫؟‬‫א‬
‫א‬‫א‬ K10
‫א‬‫א‬‫א‬K‫א‬
K‫א‬
‫א‬  ‫א‬ ‫א‬  ‫א‬   ‫א‬    (a)
   ‫א‬ ‫א‬   ،    ‫א‬FK‫א‬
KE‫א‬
‫א‬‫א‬‫א‬‫א‬‫א‬ (b)
‫؟‬‫א‬‫א‬
†^ÃÖ]<Ø’ËÖ] 684

You might also like