You are on page 1of 22

‫ﻣﺘﺮﺟﻢ ﻣﻦ ﺍﻹﻧﺠﻠﻴﺰﻳﺔ ﺇﻟﻰ ﺍﻟﻌﺮﺑﻴﺔ ‪www.onlinedoctranslator.

com -‬‬

‫ﺟﺎﻣﻌﺔﺍﺩﻟﺐ‬

‫ﻛﻠﻴﺔﺍﻟﻤﻌﻠﻮﻣﺎﺗﻴﺔ‬

‫ﺍﻟﺴﻨﺔﺍﻟﺪﺭﺍﺳﻴﺔ ﺍﻟﺨﺎﻣﺴﺔ‬

‫ﺍﻟﺘﻨﻘﻴﺐﻓﻲ ﺍﻟﺒﻴﺎﻧﺎﺕ ‪ -‬ﺍﻟﺠﻠﺴﺔ ﺍﻟﻌﻤﻠﻴﺔ ‪1‬‬

‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺇﻋﺪﺍﺩﻡ‪ .‬ﺃﺳﺎﻣﺔ ﺍﻟﻤﺼﻄﻔﻰ ﻭﻣﺴﺎﻋﺪﻳﻪ‪ .‬ﺍﻟﺒﺮﻭﻓﻴﺴﻮﺭ ﺩﻛﺘﻮﺭ ﻣﺤﻤﺪ ﻛﺮﺩﻱ‬


‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻣﻘﺪﻣﺔ‬
‫ﺗﻢﺗﻄﻮﻳﺮ ‪ WEKA‬ﻓﻲ ﺟﺎﻣﻌﺔ ﻭﺍﻳﻜﺎﺗﻮ ﻓﻲ ﻧﻴﻮﺯﻳﻠﻨﺪﺍ‪ ،‬ﺍﻻﺳﻢ ﻳﺮﻣﺰ ﺇﻟﻰﺑﻴﺉﺔ ﻭﺍﻳﻜﺎﺗﻮ ﻟﺘﺤﻠﻴﻞ‬
‫ﺍﻟﻤﻌﺮﻓﺔ‪.‬ﻧﺤﻦ ﻓﻘﻂ ﻧﺴﻤﻴﻬﺎ ﻭﻳﻜﺎ‪.‬‬

‫‪Mac.‬ﻭ ‪Windows‬ﻭ ‪ Linux‬ﻭﻳﻌﻤﻞ ﻋﻠﻰ ‪ Java،‬ﻋﺒﺎﺭﺓ ﻋﻦ ﻣﻨﺼﺔ ﻋﻤﻞ ﺷﺎﻣﻠﺔ‪ ،‬ﻭﻫﻲ ﻣﺠﺎﻧﻴﺔ‬
‫ﻭﻣﻔﺘﻮﺣﺔﺍﻟﻤﺼﺪﺭ‪ .‬ﻳﺘﻢ ﺗﺸﻐﻴﻠﻪ ﻋﻠﻰ ﺃﻱ ﺟﻬﺎﺯ ﻛﻤﺒﻴﻮﺗﺮ‪ .‬ﺇﻧﻪ ﻣﻜﺘﻮﺏ ﺑﻠﻐﺔ ‪. WEKA‬ﻫﻮ ﺑﺮﻧﺎﻣﺞ‬
‫ﻻﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ‪ .‬ﺇﻧﻬﺎ ﻋﺒﺎﺭﺓ ﻋﻦ ﻣﺠﻤﻮﻋﺔ ﻣﻦ ﺧﻮﺍﺭﺯﻣﻴﺎﺕ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ﻟﻤﻬﺎﻡ ﺍﺳﺘﺨﺮﺍﺝ‬
‫ﺍﻟﺒﻴﺎﻧﺎﺕ‪.‬ﻳﺤﺘﻮﻱ ﻋﻠﻰ ﺃﺩﻭﺍﺕ ﻹﻋﺪﺍﺩ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻭﺗﺼﻨﻴﻔﻬﺎ ﻭﺍﻻﻧﺤﺪﺍﺭ ﻭﺍﻟﺘﺠﻤﻴﻊ ﻭﺍﺳﺘﺨﺮﺍﺝ ﻗﻮﺍﻋﺪ‬
‫ﺍﻻﺭﺗﺒﺎﻁﻭﺍﻟﺘﺼﻮﺭ ‪Weka‬‬

‫ﻣﺘﺎﺡﻣﻦ ‪ .http://www.cs.waikato.ac.nz/ml/wekaWEKA‬ﻳﻤﻜﻨﻚ ﺗﻨﺰﻳﻞ ﺇﻣﺎ ﺑﺮﻧﺎﻣﺞ ﺗﺜﺒﻴﺖ‬


‫ﺧﺎﺹﺑﺎﻟﻨﻈﺎﻡ ﺍﻷﺳﺎﺳﻲ ﺃﻭ ﻣﻠﻒ ‪ Java jar‬ﻗﺎﺑﻞ ﻟﻠﺘﻨﻔﻴﺬ ﻭﺍﻟﺬﻱ ﺗﻘﻮﻡ ﺑﺘﺸﻐﻴﻠﻪ ﺑﺎﻟﻄﺮﻳﻘﺔ ﺍﻟﻤﻌﺘﺎﺩﺓ‬
‫ﻓﻲﺣﺎﻟﺔ ﺗﺜﺒﻴﺖ ‪.Java‬‬

‫ﺍﻟﻨﺎﻓﺬﺓﺍﻟﺮﺉﻴﺴﻴﺔ‬

‫ﻫﻨﺎﻙﺧﻤﺲ ﻭﺍﺟﻬﺎﺕ ﻓﻲ ‪ .Weka‬ﺍﻟﻤﺴﺘﻜﺸﻒ ﺍﻟﺬﻱ ﺳﻴﺘﻢ ﺷﺮﺣﻪ‪ ،‬ﻫﻨﺎﻙ ﺍﻟﻤﺠﺮﺏ‪ :‬ﻹﺟﺮﺍء‬


‫ﻣﻘﺎﺭﻧﺎﺕﻭﺍﺳﻌﺔ ﺍﻟﻨﻄﺎﻕ ﻟﻸﺩﺍء ﻷﺳﺎﻟﻴﺐ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ﺍﻟﻤﺨﺘﻠﻔﺔ ﻓﻲ ﻣﺠﻤﻮﻋﺎﺕ ﺑﻴﺎﻧﺎﺕ ﻣﺨﺘﻠﻔﺔ‪،‬‬
‫ﻭﺍﺟﻬﺔ‪ :KnowledgeFlow‬ﻭﻫﻲ ﻭﺍﺟﻬﺔ ﺭﺳﻮﻣﻴﺔ ﻷﺩﻭﺍﺕ ‪ ،Weka‬ﻭﺗﻮﺟﺪ ﻭﺍﺟﻬﺔ ﺳﻄﺮ ﺃﻭﺍﻣﺮ‪.‬‬

‫ﺍﺳﺘﻜﺸﺎﻑﺍﻟﻤﺴﺘﻜﺸﻒ‬

‫ﻳﻮﺟﺪﻓﻲ ‪ Explorer‬ﺧﻤﺲ ﻟﻮﺣﺎﺕ‪ :‬ﻟﻮﺣﺔ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ‪ ،‬ﻭﻟﻮﺣﺔ ﺍﻟﺘﺼﻨﻴﻒ ﺣﻴﺚ ﻳﻤﻜﻨﻚ‬
‫ﺇﻧﺸﺎءﻣﺼﻨﻔﺎﺕ ﻟﻤﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ ،‬ﻭﺍﻟﺘﺠﻤﻊ ﻫﻮ ﺇﺟﺮﺍء ﺁﺧﺮ ﺗﺠﻴﺪﻩ ‪ ،Weka‬ﻭﻗﻮﺍﻋﺪ ﺍﻻﻗﺘﺮﺍﻥ‪،‬‬
‫ﻭﺍﺧﺘﻴﺎﺭﺍﻟﺴﻤﺎﺕ‪ ،‬ﻭﺍﻟﺘﺼﻮﺭ‪.‬‬

‫‪1‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻓﻲﻫﺬﻩ ﺍﻟﻤﺤﺎﺿﺮﺓ‪ ،‬ﺳﻨﺴﺘﺨﺪﻡ ﺑﺸﻜﻞ ﺃﺳﺎﺳﻲ ﻟﻮﺣﺔ ‪ Preprocess‬ﻟﻔﺘﺢ ﺍﻟﻤﻠﻔﺎﺕ ﻭﻣﺎ ﺇﻟﻰ‬
‫ﺫﻟﻚ‪،‬ﻭﻟﻮﺣﺔ ‪ Classify‬ﻟﺘﺠﺮﺑﺔ ﺍﻟﻤﺼﻨﻔﺎﺕ‪ ،‬ﻭﻟﻮﺣﺔ ‪ Visualize‬ﻟﺘﺼﻮﺭ ﻣﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ‬
‫ﺍﻟﺨﺎﺻﺔﺑﻨﺎ‪.‬‬

‫ﺩﻋﻮﻧﺎﻧﻔﺘﺢ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ‪.‬‬

‫ﻳﻤﻜﻨﻚﺍﻟﻌﺜﻮﺭ ﻋﻠﻰ ﻋﻴﻨﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﻘﺎﺩﻣﺔ ﻣﻊ ‪ WEKA‬ﻓﻲ ﻣﺠﻠﺪ ﺍﻟﺘﻨﺰﻳﻞ ﺍﻟﺨﺎﺹ ﺑﺎﻟﺒﺮﻧﺎﻣﺞ‪.‬‬
‫ﻓﺘﺢ‪ .Weather.nominal.arff‬ﻭﻳﺒﻴﻦ ﺍﻟﺸﻜﻞ ﺍﻟﺘﺎﻟﻲ ﻛﻴﻔﻴﺔ ﺍﻟﻘﻴﺎﻡ ﺑﺬﻟﻚ‪ .‬ﺗﺴﻤﻰ ﺟﻤﻴﻊ ﻣﻠﻔﺎﺕ‬
‫ﺑﻴﺎﻧﺎﺕ‪) Weka ARFF‬ﺗﻨﺴﻴﻖ ﻣﻠﻒ ﻋﻼﻗﺔ ﺍﻟﺴﻤﺎﺕ(‪.‬‬

‫‪2‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻟﺪﻳﻬﺎ‪ 14‬ﺣﺎﻟﺔ‪ 14 ،‬ﻳﻮﻣﺎً‪ ،‬ﻭﻟﻜﻞ ﻳﻮﻡ ﻣﻦ ﻫﺬﻩ ﺍﻷﻳﺎﻡ‪ ،‬ﻗﻤﻨﺎ ﺑﺘﺴﺠﻴﻞ ﻗﻴﻢ ﺧﻤﺲ ﺳﻤﺎﺕ‪ .‬ﺃﺭﺑﻌﺔ‬
‫ﺗﺘﻌﻠﻖﺑﺎﻟﻄﻘﺲ‪ :‬ﺍﻟﺘﻮﻗﻌﺎﺕ‪ ،‬ﻭﺩﺭﺟﺔ ﺍﻟﺤﺮﺍﺭﺓ‪ ،‬ﻭﺍﻟﺮﻃﻮﺑﺔ‪ ،‬ﻭﺍﻟﺮﻳﺎﺡ‪ .‬ﺍﻟﺨﺎﻣﺲ‪ ،‬ﺍﻟﻠﻌﺐ‪ ،‬ﻫﻮ ﻣﺎ ﺇﺫﺍ ﻛﻨﺎ‬
‫ﺳﻨﻠﻌﺐﺃﻡ ﻻ‪ .‬ﻓﻲ ﺍﻟﻮﺍﻗﻊ‪ ،‬ﻣﺎ ﺳﻨﻔﻌﻠﻪ ﻫﻮ ﺍﻟﺘﻨﺒﺆ ﺑﺴﻤﺔ ﺍﻟﺘﺸﻐﻴﻞ ﻣﻦ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ‪.‬‬

‫ﻫﺬﻩﻫﻲ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ‪ .‬ﺇﺫﺍ ﻗﻤﺖ ﺑﺘﺤﺪﻳﺪ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ‪ -‬ﺗﻢ ﺗﺤﺪﻳﺪ ﺍﻟﺘﻮﻗﻌﺎﺕ ﻓﻲ ﺍﻟﺸﻜﻞ ‪-‬‬
‫ﻳﻤﻜﻨﻨﺎﺭﺅﻳﺔ ﺍﻟﻘﻴﻢ‪ .‬ﺗﻜﻮﻥ ﻗﻴﻢ ﺳﻤﺔ ﺍﻟﺘﻮﻗﻌﺎﺕ ﻣﺸﻤﺴﺔ ﻭﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ ﻭﻣﻤﻄﺮﺓ‪ .‬ﻋﺪﺩ ﻣﺮﺍﺕ‬
‫ﻇﻬﻮﺭﻫﺎﻓﻲ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ 5 :‬ﺃﻳﺎﻡ ﻣﺸﻤﺴﺔ‪4 ،‬‬
‫‪3‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺃﻳﺎﻡﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ‪ ،‬ﻭ‪ 3‬ﺃﻳﺎﻡ ﻣﻤﻄﺮﺓ‪ ،‬ﻟﻴﺼﺒﺢ ﺍﻟﻤﺠﻤﻮﻉ ‪ 14‬ﻳﻮﻣﺎً‪ 14 ،‬ﺣﺎﻟﺔ‪ .‬ﺇﺫﺍ ﻧﻈﺮﻧﺎ ﺇﻟﻰ ﺧﺎﺻﻴﺔ‬
‫ﺩﺭﺟﺔﺍﻟﺤﺮﺍﺭﺓ‪ ،‬ﻓﺴﺘﺠﺪ ﺃﻥ ﺍﻟﻘﻴﻢ ﺍﻟﻤﺤﺘﻤﻠﺔ ﻫﻲ ﺍﻟﺴﺎﺧﻨﺔ ﻭﺍﻟﻤﻌﺘﺪﻝ ﻭﺍﻟﺒﺎﺭﺩﺓ‪ .‬ﺇﺫﺍ ﺫﻫﺒﻨﺎ ﺇﻟﻰ ﺳﻤﺔ‬
‫ﺍﻟﻠﻌﺐ‪.‬ﻫﻨﺎﻙ ﻗﻴﻤﺘﺎﻥ ﻟﻠﻌﺐ‪ ،‬ﻧﻌﻢ ﻭﻻ‪.‬‬

‫ﺍﻷﺯﺭﻕﻳﺘﻮﺍﻓﻖ ﻣﻊ ﻧﻌﻢ‪ ،‬ﻭﺍﻷﺣﻤﺮ ﻳﺘﻮﺍﻓﻖ ﻣﻊ ﻻ‪.‬‬

‫ﺇﺫﺍﻧﻈﺮﺕ ﺇﻟﻰ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ‪ ،‬ﻣﺜﻞ ﺍﻟﺘﻮﻗﻌﺎﺕ‪ ،‬ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻧﻪ ﻋﻨﺪﻣﺎ ﺗﻜﻮﻥ‬
‫ﺍﻟﺘﻮﻗﻌﺎﺕﻣﺸﻤﺴﺔ ‪ -‬ﻭﻫﺬﺍ ﻣﺜﻞ ﺍﻟﺮﺳﻢ ﺍﻟﺒﻴﺎﻧﻲ ‪ -‬ﻫﻨﺎﻙ ﺛﻼﺙ ﺣﺎﻻﺕ "ﻻ" ﻭﺣﺎﻟﺘﻴﻦ "ﻧﻌﻢ"‪ .‬ﻋﻨﺪﻣﺎ‬
‫ﻳﻜﻮﻥﺍﻟﺘﻮﻗﻌﺎﺕ ﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ‪ ،‬ﻫﻨﺎﻙ ﺃﺭﺑﻊ ﺣﺎﻻﺕ "ﻧﻌﻢ" ﻭﺣﺎﻻﺕ "ﻻ" ﺻﻔﺮ‪ .‬ﺗﺸﺒﻪ ﻫﺬﻩ ﺭﺳﻤﺎً‬
‫ﺑﻴﺎﻧﻴﺎًﻟﻘﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﺣﻴﺚ ﺍﻟﺴﻤﺔ ﺍﻟﺘﻲ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ ﺑﻬﺎ‪.‬‬

‫‪4‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺇﺫﺍﺍﻧﺘﻘﻠﺖ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﺘﻌﺪﻳﻞ‪ ،‬ﻓﺴﺘﺮﻯ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻓﻲ ﺍﻟﻨﻤﻮﺫﺝ ﻣﻊ ‪ 14‬ﻳﻮﻣﺎً ﻷﺳﻔﻞ ﻭﺍﻟﺴﻤﺎﺕ‬
‫ﺍﻟﺨﻤﺲ‪،‬ﻭﻫﺬﺍ ﻋﺮﺽ ﺁﺧﺮ ﻟﻠﺒﻴﺎﻧﺎﺕ‪ ،‬ﻭﻳﻤﻜﻨﻚ ﺑﺎﻟﻔﻌﻞ ﺗﻐﻴﻴﺮ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ‪.‬‬

‫‪5‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺍﺳﺘﻜﺸﺎﻑﻣﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ‬

‫ﺑﻴﺎﻧﺎﺕﺍﻟﻄﻘﺲ‪ ،‬ﻟﺪﻳﻬﺎ ‪ 14‬ﻳﻮﻣﺎً‪ ،‬ﺃﻭ ﺣﺎﻻﺕ‪ ،‬ﻭﻛﻞ ﺣﺎﻟﺔ‪ ،‬ﻣﻮﺻﻮﻓﺔ ﺑﺨﻤﺲ ﺳﻤﺎﺕ‪ ،‬ﺃﺭﺑﻊ ﻣﻨﻬﺎ‬
‫ﺗﺘﻌﻠﻖﺑﺎﻟﻄﻘﺲ‪ ،‬ﻭﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ‪ ،‬ﻭﺍﻟﺘﻲ ﻧﺴﻤﻴﻬﺎ ﻗﻴﻤﺔ "ﺍﻟﻔﺉﺔ" ‪ -‬ﺍﻟﺸﻲء ﺍﻟﺬﻱ ﻧﺤﺎﻭﻝ ﺍﻟﻘﻴﺎﻡ ﺑﻪ‬
‫ﺗﻮﻗﻊﻣﺎ ﺇﺫﺍ ﻛﻨﺖ ﺳﺘﻠﻌﺐ ﻟﻌﺒﺔ ﺃﻡ ﻻ‪ .‬ﻭﻫﺬﺍ ﻣﺎ ﻳﺴﻤﻰ ﻣﺸﻜﻠﺔ ﺍﻟﺘﺼﻨﻴﻒ‪ .‬ﻧﺤﻦ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ‬
‫ﺑﻘﻴﻤﺔﺍﻟﻔﺼﻞ‪.‬‬

‫ﻛﻤﺎﺗﺮﻯ‪ ،‬ﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ ﺣﺠﻢ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ ،‬ﻭﻋﺪﺩ ﺍﻟﺤﺎﻻﺕ )‪ ،(14‬ﻭﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ ﺍﻟﺴﻤﺎﺕ‪،‬‬
‫ﻭﻳﻤﻜﻨﻚﺍﻟﻨﻘﺮ ﻓﻮﻕ ﺃﻱ ﻣﻦ ﻫﺬﻩ ﺍﻟﺴﻤﺎﺕ ﻭﺍﻟﺤﺼﻮﻝ ﻋﻠﻰ ﻗﻴﻢ ﺗﻠﻚ ﺍﻟﺴﻤﺎﺕ‪ .‬ﻳﻤﻜﻨﻚ ﺃﻳﻀﺎً‬
‫ﺍﻟﺤﺼﻮﻝﻋﻠﻰ ﺭﺳﻢ ﺑﻴﺎﻧﻲ ﻓﻲ ﺍﻷﺳﻔﻞ ﻟﻘﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻓﻴﻤﺎ ﻳﺘﻌﻠﻖ ﺑﻘﻴﻢ ﺍﻟﻔﺉﺎﺕ ﺍﻟﻤﺨﺘﻠﻔﺔ‪ .‬ﻗﻴﻢ‬
‫ﺍﻟﻔﺉﺎﺕﺍﻟﻤﺨﺘﻠﻔﺔ ﻫﻲ ﺑﺎﻟﻠﻮﻥ ﺍﻷﺯﺭﻕ ﻟـ "ﻧﻌﻢ"‪ ،‬ﻭﺍﻟﺘﺸﻐﻴﻞ‪ ،‬ﻭﺍﻷﺣﻤﺮ ﻟـ "ﻻ"‪.‬‬

‫ﺑﺸﻜﻞﺍﻓﺘﺮﺍﺿﻲ‪،‬ﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ ﻓﻲ ‪ Weka‬ﻫﻲ ﺩﺍﺉﻤﺎً ﻗﻴﻤﺔ ﺍﻟﻔﺉﺔ‪ .‬ﺃﻧﺖﻳﻤﻜﻦ ﺃﻥ ﺗﺘﻐﻴﺮ ﻫﺬﺍ‬


‫ﺇﺫﺍﺃﺭﺩﺕ‪ .‬ﺇﺫﺍ ﻗﻤﺖ ﺑﺘﻐﻴﻴﺮﻫﺎ ﻫﻨﺎ‪ ،‬ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﻘﺮﺭ ﺗﻮﻗﻊ ﺳﻤﺔ ﻣﺨﺘﻠﻔﺔ ﺑﺨﻼﻑ ﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ‪.‬‬

‫‪6‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻛﻤﺎﻗﻠﻨﺎ‪ ،‬ﺇﻧﻬﺎ ﻣﺸﻜﻠﺔ ﺗﺼﻨﻴﻒ‪ ،‬ﻭﺗﺴﻤﻰ ﺃﺣﻴﺎﻧﺎً ﻣﺸﻜﻠﺔ ﺍﻟﺘﻌﻠﻢ ﺗﺤﺖ ﺍﻹﺷﺮﺍﻑ‪ .‬ﻳﺘﻢ ﺍﻹﺷﺮﺍﻑ‬
‫ﻋﻠﻴﻪﻷﻧﻚ ﺗﺘﻌﺮﻑ ﻋﻠﻰ ﻗﻴﻢ ﺍﻟﻔﺼﻞ ﺍﻟﺨﺎﺻﺔ ﺑﻤﺜﻴﻼﺕ ﺍﻟﺘﺪﺭﻳﺐ‪ .‬ﻧﺤﻦ ﻧﺄﺧﺬ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ‬
‫ﻛﻤﺪﺧﻼﺕﻛﺄﻣﺜﻠﺔ ﻣﺼﻨﻔﺔ‪ ،‬ﻭﻫﺬﻩ ﺍﻷﻣﺜﻠﺔ ﻫﻲ ﺃﻣﺜﻠﺔ ﻣﺴﺘﻘﻠﺔ ﻣﻊ ﻗﻴﻤﺔ ﻓﺉﺔ ﻣﺮﻓﻘﺔ‪.‬‬

‫ﻭﺍﻟﻔﻜﺮﺓﻫﻲ ﺇﻧﺘﺎﺝ ﻧﻮﻉ ﻣﻦ ﺍﻟﻨﻤﺎﺫﺝ ﺗﻠﻘﺎﺉﻴﺎً ﻳﻤﻜﻨﻪ ﺗﺼﻨﻴﻒ ﺍﻷﻣﺜﻠﺔ ﺍﻟﺠﺪﻳﺪﺓ‪ .‬ﻫﺬﻩ ﻣﺸﻜﻠﺔ "‬
‫ﺍﻟﺘﺼﻨﻴﻒ"‪.‬‬

‫‪7‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻳﻤﻜﻦﺃﻥ ﺗﻜﻮﻥ ﻫﺬﻩ ﺍﻟﺴﻤﺎﺕ ﺃﻭ ﺍﻟﻤﻴﺰﺍﺕ ﻣﻨﻔﺼﻠﺔ ﺃﻭ ﻣﺴﺘﻤﺮﺓ‪ .‬ﻣﺎ ﻧﻈﺮﻧﺎ ﺇﻟﻴﻪ ﻓﻲ ﺑﻴﺎﻧﺎﺕ‬
‫ﺍﻟﻄﻘﺲﻛﺎﻥ ﻣﻨﻔﺼﻼ‪ ً،‬ﻧﺴﻤﻴﻬﺎ ﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﺍﻻﺳﻤﻴﺔ ﻋﻨﺪﻣﺎ ﺗﻨﺘﻤﻲ ﺇﻟﻰ ﻣﺠﻤﻮﻋﺔ ﺛﺎﺑﺘﺔ ﻣﻌﻴﻨﺔ‪،‬‬
‫ﺃﻭﻳﻤﻜﻦ ﺃﻥ ﺗﻜﻮﻥ ﻗﻴﻤﺎً ﺭﻗﻤﻴﺔ ﺃﻭ ﻣﺴﺘﻤﺮﺓ‪ .‬ﺃﻳﻀﺎ‪ ،‬ﻳﻤﻜﻦ ﺃﻥ ﻳﻜﻮﻥ ﺍﻟﻔﺼﻞ ﻣﻨﻔﺼﻼ ﺃﻭ ﻣﺴﺘﻤﺮﺍ‪.‬‬
‫ﻧﺤﻦﻧﻨﻈﺮ ﺇﻟﻰ ﻓﺉﺔ ﻣﻨﻔﺼﻠﺔ‪" ،‬ﻧﻌﻢ" ﺃﻭ "ﻻ"‪ ،‬ﻓﻲ ﺣﺎﻟﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ‪ .‬ﻫﻨﺎﻙ ﻧﻮﻉ ﺁﺧﺮ ﻣﻦ‬
‫ﻣﺸﻜﻼﺕﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ﻳﺘﻀﻤﻦ ﺍﻟﻔﺼﻮﻝ ﺍﻟﻤﺴﺘﻤﺮﺓ‪ ،‬ﺣﻴﺚ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ ﺑﺮﻗﻢ‪ .‬ﻭﻫﺬﺍ ﻣﺎ ﻳﺴﻤﻰ‬
‫ﻣﺸﻜﻠﺔ"ﺍﻻﻧﺤﺪﺍﺭ"‪.‬‬

‫ﺗﻮﺟﺪﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﻣﺸﺎﺑﻬﺔ ﻟﻤﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ ﺍﻷﺧﻴﺮﺓ‪ :‬ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ ﺍﻟﺮﻗﻤﻴﺔ‪ .‬ﺍﻓﺘﺤﻪ‬
‫ﻓﻲ‪.Weka، Weather.numeric.arff‬‬

‫ﺇﻧﻪﻣﺸﺎﺑﻪ ﺟﺪﺍً‪ ،‬ﻣﺘﻄﺎﺑﻖ ﺗﻘﺮﻳﺒﺎً ﻓﻲ ﺍﻟﻮﺍﻗﻊ‪ ،‬ﻣﻊ ‪ 14‬ﺣﺎﻟﺔ‪ ،‬ﻭ‪ 5‬ﺳﻤﺎﺕ‪ ،‬ﻧﻔﺲ ﺍﻟﺴﻤﺎﺕ‪ .‬ﺇﺫﺍ ﻧﻈﺮﺕ‬
‫ﺇﻟﻰﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ ﻓﻲ ﻟﻮﺣﺔ ﺍﻟﺘﻌﺪﻳﻞ‪ .‬ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ ﺍﺛﻨﺘﻴﻦ ﻣﻦ ﺍﻟﺴﻤﺎﺕ ‪ -‬ﺩﺭﺟﺔ‬
‫ﺍﻟﺤﺮﺍﺭﺓﻭﺍﻟﺮﻃﻮﺑﺔ ‪ -‬ﻫﻲ ﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ‪ ،‬ﺑﻴﻨﻤﺎ ﻛﺎﻧﺖ ﻓﻲ ﺍﻟﺴﺎﺑﻖ ﺳﻤﺎﺕ ﺍﺳﻤﻴﺔ‪ .‬ﺣﺘﻰ ﻫﻨﺎ ﻫﻨﺎﻙ‬
‫ﺃﺭﻗﺎﻡ‪.‬ﻣﺎ ﻧﺮﺍﻩ ﻋﻨﺪﻣﺎ ﻧﻨﻈﺮ ﺇﻟﻰ ﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﺍﻟﺨﺎﺻﺔ ﺑﺎﻟﺘﻮﻗﻌﺎﺕ‪ ،‬ﺗﻤﺎﻣﺎً ﻛﻤﺎ ﻛﺎﻥ ﻣﻦ ﻗﺒﻞ‪ ،‬ﻟﺪﻳﻨﺎ‬
‫ﻣﺸﻤﺲﻭﻏﺎﺉﻢ ﻭﻣﻤﻄﺮ‪ .‬ﺑﺎﻟﻨﺴﺒﺔ ﻟﺪﺭﺟﺔ ﺍﻟﺤﺮﺍﺭﺓ‪ ،‬ﻻ ﻳﻤﻜﻨﻨﺎ ﺗﻌﺪﺍﺩ ﺍﻟﻘﻴﻢ‪ ،‬ﻓﻬﻨﺎﻙ ﺃﺭﻗﺎﻡ ﻛﺜﻴﺮﺓ‬
‫ﻟﻠﻐﺎﻳﺔﻻ ﻳﻤﻜﻦ ﺗﻌﺪﺍﺩﻫﺎ‪ .‬ﻟﺪﻳﻨﺎ ﺍﻟﺤﺪ ﺍﻷﺩﻧﻰ ﻭﺍﻟﺤﺪ ﺍﻷﻗﺼﻰ ﻟﻠﻘﻴﻤﺔ‪ ،‬ﻭﺍﻟﻤﺘﻮﺳﻂ‪ ،‬ﻭﺍﻻﻧﺤﺮﺍﻑ‬
‫ﺍﻟﻤﻌﻴﺎﺭﻱ‪.‬ﻫﺬﺍ ﻣﺎ ﻳﻘﺪﻣﻪ ‪ Weka‬ﻟﻠﻘﻴﻢ ﺍﻟﺮﻗﻤﻴﺔ‪.‬‬

‫ﺍﻓﺘﺢﺍﻵﻥ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺰﺟﺎﺟﻴﺔ‪ ،‬ﻭﻫﻲ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺃﻛﺜﺮ ﺷﻤﻮﻻ‪ ً.‬ﺇﻧﻬﺎ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ‬
‫ﺍﻟﻌﺎﻟﻢﺍﻟﺤﻘﻴﻘﻲ‪ .‬ﻟﺪﻳﻨﺎ ‪ 214‬ﺣﺎﻟﺔ ﻭ‪ 10‬ﺳﻤﺎﺕ‪ .‬ﺍﻧﻈﺮ ﺇﻟﻰ ﺍﻟﻔﺼﻞ‪ ،‬ﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ ﺍﻟﻤﻌﺮﻭﺿﺔ‬
‫ﺍﻓﺘﺮﺍﺿﻴﺎً‪،‬ﻫﻨﺎﻙ ﺳﺒﻊ ﻗﻴﻢ ﻟﻠﻔﺉﺔ ﻭﺍﻟﺘﺴﻤﻴﺎﺕ‬

‫‪8‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺗﻌﻄﻲﻫﺬﻩ ﺍﻟﻘﻴﻢ ﺑﻌﺾ ﺍﻟﻤﺆﺷﺮﺍﺕ ﺣﻮﻝ ﻣﻮﺿﻮﻉ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ‪ .‬ﻟﺪﻳﻨﺎ ﺍﻟﻤﺼﺎﺑﻴﺢ‬
‫ﺍﻷﻣﺎﻣﻴﺔ‪،‬ﻭﺃﺩﻭﺍﺕ ﺍﻟﻤﺎﺉﺪﺓ‪ ،‬ﻭﺍﻟﺤﺎﻭﻳﺎﺕ‪ ،‬ﺛﻢ ﻟﺪﻳﻨﺎ ﻧﻮﺍﻓﺬ ﺍﻟﻤﺒﺎﻧﻲ ﻭﺍﻟﻤﺮﻛﺒﺎﺕ‪ ،‬ﺍﻟﻌﺎﺉﻤﺔ ﻭﻏﻴﺮ‬
‫ﺍﻟﻌﺎﺉﻤﺔ‪.‬ﻫﺬﻩ ﺳﺒﻌﺔ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ‪.‬‬

‫ﺗﻨﺴﻴﻖﻣﻠﻒ ‪ ،ARFF‬ﻋﻠﻰ ﺳﺒﻴﻞ ﺍﻟﻤﺜﺎﻝ ﻣﻠﻒ ﺯﺟﺎﺟﻲ‪ .‬ﻳﺒﺪﺃ ﺑﺎﻟﺘﻌﻠﻴﻘﺎﺕ ﺣﻮﻝ ﻗﺎﻋﺪﺓ ﺍﻟﺒﻴﺎﻧﺎﺕ‬
‫ﺍﻟﺰﺟﺎﺟﻴﺔ‪.‬ﺗﻠﻚ ﺍﻷﺳﻄﺮ ﺍﻟﺘﻲ ﺗﺒﺪﺃ ﺑﻌﻼﻣﺎﺕ ﺍﻟﻨﺴﺒﺔ ﺍﻟﻤﺉﻮﻳﺔ )‪ (%‬ﻫﻲ ﺗﻌﻠﻴﻘﺎﺕ‪ .‬ﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ‬
‫ﺍﻟﺴﻤﺎﺕ‪،‬ﻭﻫﻲ ﻣﻌﺎﻣﻞ ﺍﻻﻧﻜﺴﺎﺭ ﻭﺍﻟﺼﻮﺩﻳﻮﻡ ﻭﺍﻟﻤﻐﻨﻴﺴﻴﻮﻡ ﻭﻣﺎ ﺇﻟﻰ ﺫﻟﻚ‪ .‬ﻭﻧﻮﻉ ﺍﻟﺰﺟﺎﺝ ‪ .‬ﺍﻟﻌﻼﻗﺔ‬
‫ﻟﻬﺎﺍﺳﻢ‪ ،‬ﻭﺍﻟﺴﻤﺎﺕ ﻣﺤﺪﺩﺓ‪ ،‬ﻭﻫﻲ ﺳﻤﺎﺕ ﺫﺍﺕ ﻗﻴﻤﺔ ﺣﻘﻴﻘﻴﺔ‪ ،‬ﻭﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ‪.‬‬

‫ﺛﻢﻟﺪﻳﻨﺎ ﺳﻄﺮ "@‪ ،"data‬ﻭﺑﻌﺪ ﺫﻟﻚ ﺑﺘﻨﺴﻴﻖ ‪ ARFF‬ﻫﻲ ﺑﺒﺴﺎﻃﺔ ﺍﻟﺤﺎﻻﺕ‪ ،‬ﻭﺍﺣﺪﺓ ﺗﻠﻮ ﺍﻷﺧﺮﻯ‪،‬‬
‫ﻣﻊﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻛﻠﻬﺎ ﻓﻲ ﺳﻄﺮ ﻭﺍﺣﺪ‪ ،‬ﺗﻨﺘﻬﻲ ﺑﺎﻟﻔﺉﺔ ﺍﻓﺘﺮﺍﺿﻴﺎً‪.‬‬

‫ﺑﻨﺎءﺍﻟﻤﺼﻨﻒ‬

‫ﺍﻵﻥ‪،‬ﻧﺤﻦ ﻓﻲ ﻃﺮﻳﻘﻨﺎ ﻟﺒﻨﺎء ﻣﺼﻨﻒ‪ .‬ﺳﻨﺴﺘﺨﺪﻡ ﻧﻈﺎﻣﺎً ﻳﺴﻤﻰ ‪ J48‬ﻟﺘﺤﻠﻴﻞ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‬
‫ﺍﻟﺰﺟﺎﺟﻴﺔﺍﻟﺘﻲ ﺑﺤﺜﻨﺎ ﻋﻨﻬﺎ‪.‬‬

‫ﻹﻧﺸﺎءﻣﺼﻨﻒ‪ ،‬ﺍﻧﺘﻘﻞ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﺘﺼﻨﻴﻒ‪ ،‬ﻭﺍﺧﺘﺮ ﻣﺼﻨﻔﺎً‪ .‬ﻫﻨﺎﻙ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ‬
‫ﺍﻟﻤﺼﻨﻔﺎﺕ‪.‬ﻟﺪﻯ ‪ Weka‬ﻣﺼﻨﻔﺎﺕ ‪ ،Bayes‬ﻭﻣﺼﻨﻔﺎﺕ ﺍﻟﻮﻇﺎﺉﻒ‪ ،‬ﻭﺍﻟﻤﺼﻨﻔﺎﺕ ﺍﻟﺒﻄﻴﺉﺔ‪،‬‬
‫ﻭﻣﺼﻨﻔﺎﺕﺍﻟﺘﻌﺮﻳﻒ‪ ،‬ﻭﻣﺎ ﺇﻟﻰ ﺫﻟﻚ‪ .‬ﺳﻨﺴﺘﺨﺪﻡ ﻣﺼﻨﻒ ﺷﺠﺮﺓ‪ J48 :‬ﻫﻮ ﻣﺼﻨﻒ ﺷﺠﺮﺓ‪ .‬ﺍﻧﺘﻘﻞ‬
‫ﺇﻟﻰ"ﺍﻷﺷﺠﺎﺭ" ﻭﺍﻧﻘﺮ ﻓﻮﻕ ‪.J48‬‬

‫‪9‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻟﺬﻟﻚ‪،‬ﺇﺫﺍ ﻗﻤﺖ ﺑﺘﺸﻐﻴﻠﻪ‪ .‬ﻓﻘﻂ ﺍﺿﻐﻂ ﻋﻠﻰ "ﺍﺑﺪﺃ"‪ ،‬ﻟﻘﺪ ﺣﺼﻠﺖ ﻋﻠﻰ ﺍﻟﻤﺼﻨﻒ‪.‬‬

‫ﺩﻋﻮﻧﺎﻧﻠﻘﻲ ﻧﻈﺮﺓ‪ ،‬ﻫﻨﺎﻙ ﺑﻌﺾ ﺍﻟﻤﻌﻠﻮﻣﺎﺕ ﺣﻮﻝ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ ،‬ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‬
‫ﺍﻟﺰﺟﺎﺟﻴﺔ‪:‬ﻋﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﻭﺍﻟﺴﻤﺎﺕ‪ .‬ﺛﻢ ﻳﺘﻢ ﻃﺒﺎﻋﺘﻬﺎ ﺗﻤﺜﻴﻼ ًﻟﺸﺠﺮﺓ‪ .‬ﻻﺣﻆ ﺃﻥ ﻫﺬﻩ ﺍﻟﺸﺠﺮﺓ‬
‫ﺗﺤﺘﻮﻱﻋﻠﻰ ‪ 30‬ﻭﺭﻗﺔ ﻭ‪ 59‬ﻋﻘﺪﺓ ﺇﺟﻤﺎﻻ‪ ً.‬ﺍﻟﺪﻗﺔ ﺍﻹﺟﻤﺎﻟﻴﺔ ﻫﻲ ‪ .٪66.8‬ﻟﻘﺪ ﺗﻢ ﺍﻷﻣﺮ ﺑﺸﻜﻞ ﺟﻴﺪ‪.‬‬

‫ﻓﻲﺍﻷﺳﻔﻞ‪ ،‬ﻟﺪﻳﻨﺎ ﻣﺼﻔﻮﻓﺔ ﺍﺭﺗﺒﺎﻙ‬

‫‪10‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺗﺬﻛﺮﺃﻧﻪ ﻛﺎﻥ ﻫﻨﺎﻙ ﺣﻮﺍﻟﻲ ﺳﺒﻌﺔ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ‪ .‬ﻧﺎﻓﺬﺓ ﺍﻟﻤﺒﻨﻰ ﻣﺼﻨﻮﻋﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ‬
‫ﺍﻟﻌﺎﺉﻢ‪،‬ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ ‪ 50‬ﻣﻨﻬﺎ ﻗﺪ ﺗﻢ ﺗﺼﻨﻴﻔﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺃ"‪ ،‬ﻭﻫﻲ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﺻﺤﻴﺢ‪.‬‬
‫ﺗﻢﺗﺼﻨﻴﻒ ‪ 15‬ﻣﻨﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺏ"‪ ،‬ﻭﻫﻲ ﻧﺎﻓﺬﺓ ﺑﻨﺎء‪ ،‬ﻭﺯﺟﺎﺝ ﻏﻴﺮ ﻋﺎﺉﻢ‪ ،‬ﻟﺬﺍ ﻓﻬﺬﻩ ﺃﺧﻄﺎء‪ ،‬ﻭﺗﻢ‬
‫ﺗﺼﻨﻴﻒ‪ 3‬ﻣﻨﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺝ"‪ ،‬ﻭﻫﻜﺬﺍ‪ .‬ﻻﺣﻆ ﺃﻥ ﻣﻌﻈﻢ ﺍﻟﻮﺯﻥ ﻳﻘﻊ ﺗﺤﺖ ﺍﻟﻘﻄﺮ ﺍﻟﺮﺉﻴﺴﻲ‪ ،‬ﻭﻫﻮ‬
‫ﻣﺎﻧﻮﺩ ﺭﺅﻳﺘﻪ ﻷﻧﻪ ﻳﺸﻴﺮ ﺇﻟﻰ ﺍﻟﺘﺼﻨﻴﻔﺎﺕ ﺍﻟﺼﺤﻴﺤﺔ‪ .‬ﻛﻞ ﺷﻲء ﻣﺎ ﻋﺪﺍ ﺍﻟﻘﻄﺮ ﺍﻟﺮﺉﻴﺴﻲ ﻳﺸﻴﺮ ﺇﻟﻰ‬
‫ﺳﻮءﺍﻟﺘﺼﻨﻴﻒ‪.‬‬

‫ﺩﻋﻮﻧﺎﺍﻟﺘﺤﻘﻴﻖ ﻓﻲ ﻫﺬﺍ ﺃﺑﻌﺪ ﻗﻠﻴﻼ‪ .‬ﺳﻨﻘﻮﻡ ﺑﻔﺘﺢ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ‪ .‬ﺍﻓﺘﺢ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ﻣﻦ ﺧﻼﻝ‬
‫ﺍﻟﻨﻘﺮﻋﻠﻰ ﻧﺺ ﺍﻟﺘﻌﺪﻳﻞ ﺑﺠﻮﺍﺭ ﺯﺭ "ﺍﺧﺘﻴﺎﺭ"‪:‬‬

‫ﻗﻢﺑﺘﻐﻴﻴﺮ ﺍﻟﻤﻌﻠﻤﺔ "ﻏﻴﺮ ﺍﻟﻤﻘﻄﻮﻋﺔ" ﻟﺠﻌﻠﻬﺎ "ﺻﺤﻴﺤﺔ"‪ ،‬ﻭﻗﻢ ﺑﺒﻨﺎء ﺷﺠﺮﺓ ﻏﻴﺮ ﻣﻘﻄﻮﻋﺔ‪.‬‬

‫ﺗﺸﻐﻴﻠﻪﻣﺮﺓ ﺃﺧﺮﻯ‪ .‬ﺍﻵﻥ ﻟﺪﻳﻚ ﻣﺼﻨﻒ ﻣﺨﺘﻠﻒ‪ .‬ﻟﺪﻳﻨﺎ ﺗﺼﻨﻴﻒ ﺻﺤﻴﺢ ﺑﻨﺴﺒﺔ ‪ .%67‬ﺑﻴﻨﻤﺎ‬
‫ﺣﺼﻠﻨﺎﻋﻠﻰ ﺩﻗﺔ ﺗﺼﻞ ﺇﻟﻰ ‪ %66.82‬ﻟﻠﺸﺠﺮﺓ ﺍﻟﻤﺸﺬﺑﺔ‪.‬‬

‫ﺍﻷﺭﻗﺎﻡﺍﻟﻤﻮﺟﻮﺩﺓ ﺑﻴﻦ ﻗﻮﺳﻴﻦ ﻫﻲ ﻋﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﺍﻟﺘﻲ ﺗﺼﻞ ﺇﻟﻰ ﺍﻟﻮﺭﻗﺔ‪ .‬ﻋﻨﺪﻣﺎ ﻳﻜﻮﻥ ﻫﻨﺎﻙ‬
‫ﺭﻗﻤﺎﻥ‪،‬ﻓﻬﺬﺍ ﻳﻌﻨﻲ ﺃﻥ ﻧﺴﺨﺔ ﻭﺍﺣﺪﺓ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﻏﻴﺮ ﺻﺤﻴﺢ ﻭﺻﻠﺖ ﺇﻟﻰ ﻫﺬﻩ ﺍﻟﺼﻔﺤﺔ‬
‫ﻭﺧﻤﺲﺣﺎﻻﺕ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﺻﺤﻴﺢ ﻭﺻﻠﺖ ﺇﻟﻰ ﻫﻨﺎﻙ‪.‬‬

‫ﻣﺮﺓﺃﺧﺮﻯ‪ ،‬ﺍﻧﻘﺮ ﻓﻮﻕ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ‪ ،‬ﻭﺍﻧﺘﻘﻞ ﻟﺘﻐﻴﻴﺮ ﺍﻟﻤﻌﻠﻤﺔ "‪ ."minNumObj‬ﻣﺎ ﻫﺬﺍ؟ ﺇﻧﻪ‬
‫ﺍﻟﺤﺪﺍﻷﺩﻧﻰ ﻟﻌﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﻟﻜﻞ ﻭﺭﻗﺔ‪ .‬ﻗﻢ ﺑﺘﻐﻴﻴﺮ ﺫﻟﻚ ﻣﻦ ‪ 2‬ﺇﻟﻰ ‪ 15‬ﻟﻠﺤﺼﻮﻝ ﻋﻠﻰ ﺃﻭﺭﺍﻕ ﺃﻛﺒﺮ‪.‬‬
‫ﺍﻧﻘﺮﻓﻮﻕ ﺍﺑﺪﺃ‪.‬‬

‫‪11‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺍﻵﻥﺣﺼﻠﻨﺎ ﻋﻠﻰ ﻧﺘﻴﺠﺔ ﺃﺳﻮﺃ‪ ،‬ﺗﺼﻨﻴﻒ ﺻﺤﻴﺢ ﺑﻨﺴﺒﺔ ‪ ،%61‬ﻟﻜﻦ ﺷﺠﺮﺓ ﺃﺻﻐﺮ ﺑﻜﺜﻴﺮ‪ ،‬ﺑﺜﻤﺎﻧﻴﺔ‬
‫ﺃﻭﺭﺍﻕﻓﻘﻂ‪ .‬ﺍﻵﻥ‪ ،‬ﻳﻤﻜﻨﻚ ﺗﺼﻮﺭ ﻫﺬﻩ ﺍﻟﺸﺠﺮﺓ‪ .‬ﺇﺫﺍ ﻧﻘﺮﺕ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﻋﻠﻰ ﺍﻟﻨﺘﻴﺠﺔ‪،‬‬
‫ﻓﺴﺘﺤﺼﻞﻋﻠﻰ ﻗﺎﺉﻤﺔ ﺻﻐﻴﺮﺓ‪ ،‬ﺣﺪﺩ ﺍﻟﺸﺠﺮﺓ ﺍﻟﻤﺮﺉﻴﺔ‪.‬‬

‫ﻫﺬﻩﻫﻲ ﺷﺠﺮﺓ ﺍﻟﻘﺮﺍﺭ ﻫﺬﺍ ﻳﻘﻮﻝ ﺃﻭﻝ ﻧﻈﺮﺓ ﻋﻠﻰ ﻣﺤﺘﻮﻯ ﺍﻟﺒﺎﺭﻳﻮﻡ )‪ .(Ba‬ﺇﺫﺍ ﻛﺎﻧﺖ ﻛﺒﻴﺮﺓ‪ ،‬ﻓﻴﺠﺐ‬
‫ﺃﻥﺗﻜﻮﻥ ﺍﻟﻤﺼﺎﺑﻴﺢ ﺍﻷﻣﺎﻣﻴﺔ‪ .‬ﺇﺫﺍ ﻛﺎﻧﺖ ﺻﻐﻴﺮﺓ‪ ،‬ﻓﺎﻟﻤﻐﻨﻴﺴﻴﻮﻡ )‪ .(Mg‬ﺇﺫﺍ ﻛﺎﻥ ﻫﺬﺍ ﺻﻐﻴﺮﺍً‪،‬‬
‫ﻓﻠﻨﻨﻈﺮﺇﻟﻰ ﺍﻟﺒﻮﺗﺎﺳﻴﻮﻡ )‪ ،(K‬ﻭﺇﺫﺍ ﻛﺎﻥ ﺻﻐﻴﺮﺍً‪ ،‬ﻓﻠﺪﻳﻨﺎ ﺃﺩﻭﺍﺕ ﻣﺎﺉﺪﺓ‪.‬‬

‫‪12‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻫﺬﺍﺗﺼﻮﺭ ﻟﻠﺸﺠﺮﺓ‪ ،‬ﻭﻓﻴﻤﺎ ﻳﻠﻲ ﺗﻤﺜﻴﻞ ﻣﺨﺘﻠﻒ ﻟﻨﻔﺲ ﺍﻟﺸﺠﺮﺓ‪.‬‬

‫ﻣﻦﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ‪ ،‬ﺍﺳﺘﺨﺪﻡ ﺯﺭ "ﺍﻟﻤﺰﻳﺪ" ﻟﻠﺤﺼﻮﻝ ﻋﻠﻰ ﻣﺰﻳﺪ ﻣﻦ ﺍﻟﻤﻌﻠﻮﻣﺎﺕ ﺣﻮﻝ ﺍﻟﻤﺼﻨﻒ‪،‬‬
‫ﻫﻨﺎﺣﻮﻝ ‪ .J48‬ﻣﻦ ﺍﻟﻤﻔﻴﺪ ﺩﺍﺉﻤﺎً ﺍﻟﻨﻈﺮ ﺇﻟﻰ ﺫﻟﻚ ﻟﻤﻌﺮﻓﺔ ﻣﻦ ﺃﻳﻦ ﺃﺗﺖ ﻫﺬﻩ ﺍﻟﻤﺼﻨﻔﺎﺕ‪.‬‬

‫‪ J48.‬ﻟﺬﻟﻚ ﻳﺴُﻤﻰ ‪ Java،‬ﻣﻜﺘﻮﺏ ﺑﻠﻐﺔ ‪. Weka‬ﻭﺃﻋﺎﺩﻭﺍ ﻛﺘﺎﺑﺘﻪ ‪ C4.8،‬ﻭﺍﻟﺬﻱ ﻛﺎﻥ ‪C4.5،‬‬


‫ﺍﻹﺻﺪﺍﺭﺍﻷﺣﺪﺙ ﻣﻦ ‪ WEKA‬ﺛﻢ ﺃﺻﺒﺢ ﺗﺠﺎﺭﻳﺎً‪ .‬ﺣﺘﻰ ﺫﻟﻚ ﺍﻟﺤﻴﻦ‪ ،‬ﻛﺎﻧﺖ ﻫﺬﻩ ﻛﻠﻬﺎ ﺃﻧﻈﻤﺔ‬
‫ﻣﻔﺘﻮﺣﺔﺍﻟﻤﺼﺪﺭ‪ .‬ﺃﺧﺬ ﻣﻄﻮﺭﻭ ‪ C4.8،‬ﺛﻢ ﺍﺭﺗﻔﻊ ﺇﻟﻰ ‪ ID3،‬ﺑﺮﺍﻣﺞ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ"‪ ،‬ﻣﻦ ﻗﺒﻞ ﻋﺎﻟﻢ‬
‫ﻛﻤﺒﻴﻮﺗﺮﺃﺳﺘﺮﺍﻟﻲ ﻳﺪﻋﻰ ﺭﻭﺱ ﻛﻮﻳﻨﻼﻥ‪ .‬ﻟﻘﺪ ﺑﺪﺃ ﺑﻨﻈﺎﻡ ﻳﺴﻤﻰ" ‪ C4.5،‬ﻳﻌﺘﻤﺪ ﻋﻠﻰ ﻧﻈﺎﻡ ﻣﺸﻬﻮﺭ‬
‫ﻳﺴﻤﻰ‪J48،‬‬

‫ﺑﺎﺳﺘﺨﺪﺍﻡﻣﺮﺷﺢ‬

‫ﺗﻌﺪﺍﻟﻤﺮﺷﺤﺎﺕ ﺇﺣﺪﻯ ﺃﺩﻭﺍﺕ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ‪ ،‬ﻟﺬﻟﻚ ﻳﺘﻢ ﺗﻄﺒﻴﻘﻬﺎ ﻋﺎﺩﺓ ًﻗﺒﻞ ﺗﻄﺒﻴﻖ‬
‫ﺍﻟﻤﺼﻨﻒ‪.‬‬

‫‪13‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺳﻮﻑﻧﺴﺘﺨﺪﻡ ﻣﺮﺷﺤﺎً ﻹﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ‪ .‬ﺍﻓﺘﺢ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ‪.‬‬
‫ﺳﻨﻘﻮﻡﺑﺈﺯﺍﻟﺔ ﺳﻤﺔ ﺍﻟﺮﻃﻮﺑﺔ‪ :‬ﻫﺬﻩ ﺍﻟﺴﻤﺔ ﺭﻗﻢ ‪ .3‬ﻳﻤﻜﻨﻚ ﺇﻟﻘﺎء ﻧﻈﺮﺓ ﻋﻠﻰ ﺍﻟﻤﺮﺷﺤﺎﺕ‪ ،‬ﺗﻤﺎﻣﺎً‬
‫ﻣﺜﻠﻤﺎﺍﺧﺘﺮﺕ ﺍﻟﻤﺼﻨﻒ ﺑﺎﺳﺘﺨﺪﺍﻡ ﺍﻟﺰﺭ "ﺍﺧﺘﻴﺎﺭ" ﻓﻲ ﻟﻮﺣﺔ "ﺍﻟﺘﺼﻨﻴﻒ"‪ ،‬ﺍﺧﺘﺮ ﺍﻟﻤﺮﺷﺤﺎﺕ‬
‫ﺑﺎﺳﺘﺨﺪﺍﻡﺍﻟﺰﺭ "ﺍﺧﺘﻴﺎﺭ" ﻓﻲ ﻟﻮﺣﺔ "ﺍﻟﺘﺼﻔﻴﺔ"‪.‬‬

‫ﻫﻨﺎﻙﺍﻟﻜﺜﻴﺮ ﻣﻦ ﺍﻟﻤﺮﺷﺤﺎﺕ ﺍﻟﻤﺨﺘﻠﻔﺔ‪ Allfilter .‬ﻭ‪ MultiFilter‬ﻫﻤﺎ ﻃﺮﻳﻘﺘﺎﻥ ﻟﺪﻣﺞ‬


‫ﺍﻟﻤﺮﺷﺤﺎﺕ‪.‬ﻟﺪﻳﻨﺎ ﻣﺮﺷﺤﺎﺕ ﺧﺎﺿﻌﺔ ﻟﻺﺷﺮﺍﻑ ﻭﻏﻴﺮ ﺧﺎﺿﻌﺔ ﻟﻠﺮﻗﺎﺑﺔ‪ .‬ﺍﻟﻤﺮﺷﺤﺎﺕ ﺍﻟﺨﺎﺿﻌﺔ‬
‫ﻟﻺﺷﺮﺍﻑﻫﻲ ﺗﻠﻚ ﺍﻟﺘﻲ ﺗﺴﺘﺨﺪﻡ ﻗﻴﻤﺔ ﻓﺉﺔ ﻟﺘﺸﻐﻴﻠﻬﺎ‪ .‬ﺇﻧﻬﺎ ﻟﻴﺴﺖ ﺷﺎﺉﻌﺔ ﻣﺜﻞ ﺍﻟﻤﺮﺷﺤﺎﺕ‬
‫ﻏﻴﺮﺍﻟﺨﺎﺿﻌﺔ ﻟﻠﺮﻗﺎﺑﺔ‪ ،‬ﻭﺍﻟﺘﻲ ﻻ ﺗﺴﺘﺨﺪﻡ ﻗﻴﻤﺔ ﺍﻟﻔﺉﺔ‪ .‬ﻫﻨﺎﻙ ﻋﻮﺍﻣﻞ ﺗﺼﻔﻴﺔ ﺍﻟﺴﻤﺎﺕ ﻭﻋﻮﺍﻣﻞ‬
‫ﺗﺼﻔﻴﺔﺍﻟﻤﺜﻴﻼﺕ‪ .‬ﻧﺮﻳﺪ ﺇﺯﺍﻟﺔ ﺍﻟﺴﻤﺔ‪ .‬ﻟﺬﻟﻚ ﻧﺤﻦ ﻧﺒﺤﺚ ﻋﻦ ﻣﺮﺷﺢ ﺍﻟﺴﻤﺎﺕ‪ .‬ﻫﻨﺎﻙ ﺍﻟﻌﺪﻳﺪ ﻣﻦ‬
‫ﺍﻟﻤﺮﺷﺤﺎﺕﻓﻲ ‪ Weka‬ﺑﺤﻴﺚ ﻋﻠﻴﻚ ﻓﻘﻂ ﺃﻥ ﺗﺘﻌﻠﻢ ﻛﻴﻔﻴﺔ ﺍﻟﺒﺤﺚ ﺣﻮﻟﻚ ﻭﺍﻟﻌﺜﻮﺭ ﻋﻠﻰ ﻣﺎ ﺗﺮﻳﺪ‪.‬‬

‫ﺳﻨﺒﺤﺚﻋﻦ ﺇﺯﺍﻟﺔ ﺍﻟﺴﻤﺔ‪ .‬ﺍﻟﻔﻠﺘﺮ ﻫﻮ "ﺇﺯﺍﻟﺔ"‪ .‬ﻣﻦ ﺧﻼﻝ ﺍﻟﻨﻘﺮ ﻋﻠﻰ ﻟﻮﺣﺔ ﺍﻟﺘﺼﻔﻴﺔ ﻳﻤﻜﻨﻨﺎ ﺗﻜﻮﻳﻦ‬
‫ﻋﺎﻣﻞﺍﻟﺘﺼﻔﻴﺔ‪ .‬ﻫﺬﺍ ﻫﻮ "ﻣﺮﺷﺢ ﻳﺰﻳﻞ ﻣﺠﻤﻮﻋﺔ ﻣﻦ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ"‪ .‬ﻳﻤﻜﻨﻚ‬
‫ﺗﺤﺪﻳﺪﻧﻄﺎﻕ ﻣﻦ ﺍﻟﺴﻤﺎﺕ‪ .‬ﻧﺮﻳﺪ ﻓﻘﻂ ﺇﺯﺍﻟﺔ ﻭﺍﺣﺪﺓ‪ ،‬ﻭﻛﺎﻧﺖ ﻫﺬﻩ ﻫﻲ ﺍﻟﺴﻤﺔ ﺭﻗﻢ ‪ 3‬ﺍﻟﺘﻲ ﻛﻨﺎ‬
‫ﺳﻨﻘﻮﻡﺑﺈﺯﺍﻟﺘﻬﺎ‪ .‬ﻳﻤﻜﻨﻚ ﻋﻜﺲ ﺍﻟﺘﺤﺪﻳﺪ ﻭﺇﺯﺍﻟﺔ ﻛﺎﻓﺔ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ ﻭﺗﺮﻙ ‪ .3‬ﺍﻧﻘﺮ ﻓﻮﻕ "ﻣﻮﺍﻓﻖ‬
‫"‪ ،‬ﻭﺷﺎﻫﺪ ﺍﻧﺨﻔﺎﺽ ﺍﻟﺮﻃﻮﺑﺔ ﻋﻨﺪﻣﺎ ﻧﻄﺒﻖ ﻋﺎﻣﻞ ﺍﻟﺘﺼﻔﻴﺔ‪ .‬ﻟﺤﺴﻦ ﺍﻟﺤﻆ ﻳﻤﻜﻨﻚ ﺍﻟﺘﺮﺍﺟﻊ ﻋﻦ‬
‫ﺗﺄﺛﻴﺮﺫﻟﻚ ﻭﺇﻋﺎﺩﺗﻪ ﻋﻦ ﻃﺮﻳﻖ ﺍﻟﻀﻐﻂ ﻋﻠﻰ ﺯﺭ ﺍﻟﺘﺮﺍﺟﻊ‪.‬‬

‫ﻓﻲﺍﻟﻮﺍﻗﻊ‪ ،‬ﻫﻨﺎﻙ ﻃﺮﻳﻘﺔ ﺃﺳﻬﻞ ﺑﻜﺜﻴﺮ ﻹﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ‪ :‬ﻻ ﺗﺤﺘﺎﺝ ﺇﻟﻰ ﺍﺳﺘﺨﺪﺍﻡ ﻣﺮﺷﺢ ﻋﻠﻰ‬
‫ﺍﻹﻃﻼﻕ‪.‬ﺇﺫﺍ ﻛﻨﺖ ﺗﺮﻳﺪ ﻓﻘﻂ ﺇﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ‪ ،‬ﻓﻴﻤﻜﻨﻚ ﺗﺤﺪﻳﺪﻫﺎ ﻭﺍﻟﻨﻘﺮ ﻓﻮﻕ ﺍﻟﺰﺭ "ﺇﺯﺍﻟﺔ"‬
‫ﻓﻲﺍﻷﺳﻔﻞ‪ .‬ﺇﻧﻬﺎ ﺗﻘﻮﻡ ﺑﻨﻔﺲ ﺍﻟﻤﻬﻤﺔ‪.‬‬

‫‪14‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺍﻟﻤﺮﺷﺤﺎﺕﻣﻔﻴﺪﺓ ﺣﻘﺎً‪ ،‬ﻭﻳﻤﻜﻨﻬﺎ ﺍﻟﻘﻴﺎﻡ ﺑﺄﺷﻴﺎء ﺃﻛﺜﺮ ﺗﻌﻘﻴﺪﺍً ﻣﻦ ﺫﻟﻚ ﺑﻜﺜﻴﺮ‪ .‬ﺩﻋﻮﻧﺎ‪ ،‬ﻋﻠﻰ ﺳﺒﻴﻞ‬
‫ﺍﻟﻤﺜﺎﻝ‪،‬ﻧﺘﺨﻴﻞ ﺇﺯﺍﻟﺔ‪ ،‬ﻭﻟﻴﺲ ﺳﻤﺔ‪ ،‬ﻭﻟﻜﻦ ﺩﻋﻮﻧﺎ ﻧﺰﻳﻞ ﺟﻤﻴﻊ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺘﻲ ﺗﻜﻮﻥ ﻓﻴﻬﺎ ﻗﻴﻤﺔ‬
‫ﺍﻟﺮﻃﻮﺑﺔ"ﻋﺎﻟﻴﺔ"‪ .‬ﺃﻱ ﺃﻥ ﺍﻟﺴﻤﺔ ﺭﻗﻢ ‪ 3‬ﻟﻬﺎ ﻫﺬﻩ ﺍﻟﻘﻴﻤﺔ ﺍﻷﻭﻟﻰ‪ .‬ﺩﻋﻮﻧﺎ ﻧﺒﺤﺚ ﻋﻦ ﻣﺮﺷﺢ ﻟﻠﻘﻴﺎﻡ‬
‫ﺑﺬﻟﻚ‪.‬ﻧﺮﻳﺪ ﺇﺯﺍﻟﺔ ﺍﻟﻤﺜﻴﻼﺕ‪ ،‬ﻟﺬﻟﻚ ﺳﻴﻜﻮﻥ ﻋﺎﻣﻞ ﺗﺼﻔﻴﺔ ﺍﻟﻤﺜﻴﻼﺕ‪.‬‬

‫ﻣﺎﺫﺍﻋﻦ ‪RemoveWithValues‬؟ ﺣﺪﺩ ﻋﺎﻣﻞ ﺍﻟﺘﺼﻔﻴﺔ ‪ .RemoveWithValues‬ﻳﻤﻜﻨﻚ‬


‫ﺗﻜﻮﻳﻨﻪ‪.‬ﺗﻌﻴﻴﻦ ﻓﻬﺮﺱ ﺍﻟﺴﻤﺔ‪ .‬ﻧﺮﻳﺪ ﺍﻟﺴﻤﺔ ﺍﻟﺜﺎﻟﺜﺔ )ﺍﻟﺮﻃﻮﺑﺔ(‪ ،‬ﻭﺍﻟﻘﻴﻤﺔ ﺍﻷﻭﻟﻰ‪ .‬ﻳﻤﻜﻨﻨﺎ ﺇﺯﺍﻟﺔ ﻋﺪﺩ‬
‫ﻣﻦﺍﻟﻘﻴﻢ ﺍﻟﻤﺨﺘﻠﻔﺔ؛ ﺳﻨﻘﻮﻡ ﻓﻘﻂ ﺑﺈﺯﺍﻟﺔ ﺍﻟﻘﻴﻤﺔ ﺍﻷﻭﻟﻰ‪ .‬ﻻ ﺷﻲء ﻳﺤﺪﺙ ﺣﺘﻰ ﻧﻄﺒﻖ ﺍﻟﻔﻠﺘﺮ‪.‬‬
‫ﺷﺎﻫﺪﻣﺎﺫﺍ ﻳﺤﺪﺙ ﻋﻨﺪﻣﺎ ﻧﻄﺒﻘﻪ‪.‬‬

‫ﻻﺗﺰﺍﻝ ﻟﺪﻳﻨﺎ ﺧﺎﺻﻴﺔ ﺍﻟﺮﻃﻮﺑﺔ ﻫﻨﺎﻙ‪ ،‬ﻟﻜﻦ ﻟﻴﺲ ﻟﺪﻳﻨﺎ ﺃﻱ ﻋﻨﺎﺻﺮ ﺫﺍﺕ ﺭﻃﻮﺑﺔ ﻋﺎﻟﻴﺔ‪ .‬ﻓﻲ ﺍﻟﻮﺍﻗﻊ‪،‬‬
‫ﺗﻢﺗﺨﻔﻴﺾ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺇﻟﻰ ‪ 7‬ﺣﺎﻻﺕ ﻓﻘﻂ‪ .‬ﻳﻤﻜﻨﻚ ﺣﻔﻆ ﺍﻟﻨﺘﺎﺉﺞ‪.‬‬

‫ﻗﻤﻨﺎﺑﺈﺯﺍﻟﺔ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺘﻲ ﺗﻜﻮﻥ ﻓﻴﻬﺎ ﺍﻟﺮﻃﻮﺑﺔ ﻋﺎﻟﻴﺔ‪ .‬ﻋﻠﻴﻨﺎ ﺃﻥ ﻧﻔﻜﺮ‪ ،‬ﻋﻨﺪﻣﺎ ﻧﺒﺤﺚ ﻋﻦ ﻋﻮﺍﻣﻞ‬
‫ﺍﻟﺘﺼﻔﻴﺔ‪،‬ﻓﻴﻤﺎ ﺇﺫﺍ ﻛﻨﺎ ﻧﺮﻳﺪ ﻣﺮﺷﺤﺎً ﺧﺎﺿﻌﺎً ﻟﻺﺷﺮﺍﻑ ﺃﻭ ﻏﻴﺮ ﺧﺎﺿﻊ ﻟﻺﺷﺮﺍﻑ‪ ،‬ﻭﻣﺎ ﺇﺫﺍ ﻛﻨﺎ ﻧﺮﻳﺪ‬
‫ﻣﺮﺷﺤﺎًﻟﻠﺴﻤﺎﺕ ﺃﻭ ﻣﺮﺷﺢ ﻣﺜﻴﻞ‪ ،‬ﺛﻢ ﺍﺳﺘﺨﺪﻡ ﺍﻟﻔﻄﺮﺓ ﺍﻟﺴﻠﻴﻤﺔ ﻟﻠﻨﻈﺮ ﺇﻟﻰ ﺃﺳﻔﻞ ﻗﺎﺉﻤﺔ‬
‫ﺍﻟﻤﺮﺷﺤﺎﺕﻟﺘﺮﻯ ﺍﻱ ﻭﺍﺣﺪ ﺗﺮﻳﺪ‪.‬‬

‫ﻓﻲﺑﻌﺾ ﺍﻷﺣﻴﺎﻥ‪ ،‬ﻋﻨﺪ ﺗﺼﻔﻴﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ ،‬ﺗﺤﺼﻞ ﻋﻠﻰ ﺗﺼﻨﻴﻒ ﺃﻓﻀﻞ ﺑﻜﺜﻴﺮ‪.‬‬

‫ﻭﻫﻨﺎﻣﺜﺎﻝ ﺑﺴﻴﻂ ﺣﻘﺎ‪ .‬ﺳﻨﻘﻮﻡ ﺑﻔﺘﺢ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺰﺟﺎﺟﻴﺔ ﺍﻟﺘﻲ ﺭﺃﻳﻨﺎﻫﺎ ﻣﻦ ﻗﺒﻞ‪ .‬ﺳﻮﻑ‬
‫ﻧﺴﺘﺨﺪﻡ‪ ،J48‬ﻭﻫﻮ ﻣﺎ ﻓﻌﻠﻨﺎﻩ ﻣﻦ ﻗﺒﻞ‪ .‬ﺇﻧﻪ ﻣﺼﻨﻒ ﺷﺠﺮﺓ‪ .‬ﺍﺑﺪﺃ‪ ،‬ﺳﻮﻑ ﺗﺤﺼﻞ ﻋﻠﻰ ﻣﺼﻨﻒ‬
‫ﺑﺪﻗﺔ‪ .%66.8‬ﺩﻋﻮﻧﺎ ﻧﺰﻳﻞ ‪ ،Fe‬ﺃﻱ ﺍﻟﺤﺪﻳﺪ‪ .‬ﻳﺰﻳﻞ‬

‫‪15‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻫﺬﻩﺍﻟﺴﻤﺔ‪ ،‬ﻭﺳﺘﺤﺼﻞ ﻋﻠﻰ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺃﺻﻐﺮ‪ .‬ﺍﺫﻫﺐ ﻭﻗﻢ ﺑﺘﺸﻐﻴﻞ ‪ J48‬ﻣﺮﺓ ﺃﺧﺮﻯ‪ .‬ﺍﻵﻥ‬
‫ﺣﺼﻠﻨﺎﻋﻠﻰ ﺩﻗﺔ ‪ .%67.3‬ﻟﺬﻟﻚ ﻗﻤﻨﺎ ﺑﺘﺤﺴﻴﻦ ﺍﻟﺪﻗﺔ ﻗﻠﻴﻼ ًﻋﻦ ﻃﺮﻳﻖ ﺇﺯﺍﻟﺔ ﻫﺬﻩ ﺍﻟﺴﻤﺔ‪.‬‬

‫ﺗﺼﻮﺭﺍﻟﺒﻴﺎﻧﺎﺕ‬

‫ﻣﻦﺍﻟﻀﺮﻭﺭﻱ ﺍﻻﻗﺘﺮﺍﺏ ﻣﻦ ﺑﻴﺎﻧﺎﺗﻚ ﻭﺍﻟﻨﻈﺮ ﺇﻟﻴﻬﺎ ﺑﻜﻞ ﻃﺮﻳﻘﺔ ﻣﻤﻜﻨﺔ‪ .‬ﺳﻨﻨﻈﺮ ﻓﻲ ﺗﺼﻮﺭ ﺍﻟﺒﻴﺎﻧﺎﺕ‪.‬‬

‫ﺳﻨﺴﺘﺨﺪﻡﻟﻮﺣﺔ ‪ .Visualize‬ﺍﻓﺘﺢ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺇﻳﺮﻳﺲ‪ .‬ﻟﻬﺎ ﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ‪ ،‬ﺃﺭﺑﻊ ﺳﻤﺎﺕ‬
‫ﺭﻗﻤﻴﺔ‪:‬ﻃﻮﻝ ﺍﻟﻜﺄﺱ‪ ،‬ﻋﺮﺽ ﺍﻟﻜﺄﺱ‪ ،‬ﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ‪ ،‬ﻋﺮﺽ ﺍﻟﺒﺘﻠﺔ‪ .‬ﺍﻟﻄﺒﻘﺔ ﻫﻲ ﺛﻼﺛﺔ ﺃﻧﻮﺍﻉ ﻣﻦ ﺯﻫﺮﺓ‬
‫ﺍﻟﻘﺰﺣﻴﺔ‪:‬ﺇﻳﺮﻳﺲ ﺳﻴﺘﻮﺳﺎ‪ ،‬ﺇﻳﺮﻳﺲ ﺍﻟﻤﺒﺮﻗﺸﺔ‪ ،‬ﻭﺇﺭﻳﺴﻔﻴﺮﺟﻴﻨﻴﻜﺎ‪.‬‬

‫ﺩﻋﻨﺎﻧﺬﻫﺐ ﺇﻟﻰ ﻟﻮﺣﺔ ‪ Visualize‬ﻭﻧﺘﺼﻮﺭ ﻫﺬﻩ ﺍﻟﺒﻴﺎﻧﺎﺕ‪.‬‬

‫‪16‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻫﻨﺎﻙﻣﺼﻔﻮﻓﺔ ﻣﻦ ﻗﻄﻊ ﺍﻷﺭﺽ ﺛﻨﺎﺉﻴﺔ ﺍﻷﺑﻌﺎﺩ‪ ،‬ﻭﻣﺼﻔﻮﻓﺔ ﻣﻦ ﻗﻄﻊ ﺍﻷﺭﺽ ﺫﺍﺕ ﺧﻤﺴﺔ ﻓﻲ‬
‫ﺧﻤﺴﺔ‪.‬ﻳﻤﻜﻨﻚ ﺗﺤﺪﻳﺪ ﺃﺣﺪ ﻫﺬﻩ ﺍﻟﻤﺨﻄﻄﺎﺕ‪ ،‬ﻋﻠﻰ ﺳﺒﻴﻞ ﺍﻟﻤﺜﺎﻝ‪ ،‬ﺇﺫﺍ ﺣﺪﺩﺕ ﻣﺨﻄﻄﺎً ﺑﻌﺮﺽ‬
‫‪ sepalwidth‬ﻋﻠﻰ ﺍﻟﻤﺤﻮﺭ ‪ x‬ﻭﻋﺮﺽ ﺍﻟﺒﺘﻠﺔ ﻋﻠﻰ ﺍﻟﻤﺤﻮﺭ ‪ ،y‬ﻓﻬﺬﺍ ﻳﻤﺜﻞ ﻣﺨﻄﻄﺎً ﻟﻠﺒﻴﺎﻧﺎﺕ‪.‬‬
‫ﺍﻷﻟﻮﺍﻥﺗﺘﻮﺍﻓﻖ ﻣﻊ ﺍﻟﻔﺉﺎﺕ ﺍﻟﺜﻼﺙ‪ .‬ﻳﻤﻜﻨﻚ ﺑﺎﻟﻔﻌﻞ ﺗﻐﻴﻴﺮ ﺍﻷﻟﻮﺍﻥ‪ ،‬ﺇﺫﺍ ﻟﻢ ﺗﻌﺠﺒﻚ‪.‬‬

‫ﻳﻤﻜﻨﻚﺇﻟﻘﺎء ﻧﻈﺮﺓ ﻋﻠﻰ ﻧﻘﺎﻁ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﻔﺮﺩﻳﺔ ﻣﻦ ﺧﻼﻝ ﺍﻟﻨﻘﺮ ﻋﻠﻴﻬﺎ‪ .‬ﻫﺬﺍ ﻳﺘﺤﺪﺙ ﻋﻦ ﺍﻟﻤﺜﺎﻝ‬
‫ﺭﻗﻢ‪ 86‬ﺑﻄﻮﻝ ﻛﺄﺳﻲ ‪ ،6‬ﻭﻋﺮﺽ ﻛﺄﺳﻲ ‪ ،3.4‬ﻭﻫﻜﺬﺍ‪ .‬ﻫﺬﺍ ﻣﺘﻌﺪﺩ ﺍﻷﻟﻮﺍﻥ‪ ،‬ﻭﻟﻬﺬﺍ ﺍﻟﺴﺒﺐ ﺗﻢ‬
‫ﺗﻠﻮﻳﻦﻫﺬﻩ ﺍﻟﺒﻘﻌﺔ ﺑﺎﻟﻠﻮﻥ ﺍﻷﺣﻤﺮ‪ .‬ﻳﻤﻜﻨﻨﺎ ﺃﻥ ﻧﻨﻈﺮ ﺍﻟﺤﺎﻻﺕ ﺍﻟﻔﺮﺩﻳﺔ‪.‬‬

‫‪17‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻳﻤﻜﻨﻨﺎﺗﻐﻴﻴﺮ ﺍﻟﻤﺤﻮﺭ ﺍﻟﺴﻴﻨﻲ ﻭﺍﻟﺼﺎﺩﻱ ﻋﻦ ﻃﺮﻳﻖ ﺗﻐﻴﻴﺮ ﺍﻟﻘﻮﺍﺉﻢ ﻷﻋﻠﻰ‪ .‬ﻭﺍﻷﻓﻀﻞ ﻣﻦ ﺫﻟﻚ‪ ،‬ﺇﺫﺍ ﻧﻘﺮﻧﺎ ﻋﻠﻰ‬
‫ﻣﺠﻤﻮﻋﺔﺍﻷﺷﺮﻃﺔ ﺍﻟﺼﻐﻴﺮﺓ ﻋﻠﻰ ﺍﻟﻴﻤﻴﻦ‪ ،‬ﻓﺈﻧﻬﺎ ﺗﻤﺜﻞ ﺍﻟﺴﻤﺎﺕ‪ .‬ﺇﺫﺍ ﻧﻘﺮﺕ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﺴﺮ ﻋﻠﻰ ﻫﺬﺍ‪،‬‬
‫ﻓﺴﻴﺘﻐﻴﺮﺍﻟﻤﺤﻮﺭ ﺍﻟﺴﻴﻨﻲ‪ ،‬ﻭﺳﻴﺆﺩﻱ ﺍﻟﻨﻘﺮ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﺇﻟﻰ ﺗﻐﻴﻴﺮ ﺍﻟﻤﺤﻮﺭ ﺍﻟﺼﺎﺩﻱ‪ .‬ﺣﺘﻰ ﺗﺘﻤﻜﻦ ﻣﻦ‬
‫ﺗﺼﻔﺢﻫﺬﻩ ﺍﻟﻤﺆﺍﻣﺮﺍﺕ ﺍﻟﻤﺨﺘﻠﻔﺔ ﺑﺴﺮﻋﺔ‪.‬‬

‫ﻓﻲﺑﻌﺾ ﺍﻷﺣﻴﺎﻥ‪ ،‬ﺗﻮﺿﻊ ﺍﻟﻨﻘﺎﻁ ﻓﻮﻕ ﺑﻌﻀﻬﺎ ﺍﻟﺒﻌﺾ ﻣﺒﺎﺷﺮﺓ‪ ً،‬ﻭﻳﻤﻜﻨﻚ ﺍﺳﺘﺨﺪﺍﻡ ‪Jitter‬‬
‫ﻹﺿﺎﻓﺔﺍﻟﻘﻠﻴﻞ ﻣﻦ ﺍﻟﻌﺸﻮﺍﺉﻴﺔ ﺇﻟﻰ ﺍﻟﻤﺤﻮﺭﻳﻦ ‪ x‬ﻭ‪ .y‬ﻣﻊ ﺍﻟﻘﻠﻴﻞ ﻣﻦ ﺍﻻﺭﺗﻌﺎﺵ‪ ،‬ﺗﻤﺜﻞ ﺍﻟﺒﻘﻊ‬
‫ﺍﻟﺪﺍﻛﻨﺔﺣﺎﻻﺕ ﻣﺘﻌﺪﺩﺓ‪ .‬ﺇﺫﺍ ﻧﻘﺮﺕ ﻋﻠﻰ ﻭﺍﺣﺪﺓ ﻣﻨﻬﺎ‪ ،‬ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ ﺍﻟﻨﻘﻄﺔ ﺗﻤﺜﻞ ﺛﻼﺙ‬
‫ﺣﺎﻻﺕﻣﻨﻔﺼﻠﺔ‪ ،‬ﻛﻠﻬﺎ ﻣﻦ ﻓﺉﺔ ‪ ،Iris-setosa‬ﻭﻟﻬﺎ ﺟﻤﻴﻌﺎً ﻧﻔﺲ ﻗﻴﻤﺔ ﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ ﻭﻋﺮﺽ‬
‫‪ .sepalwidth‬ﻳﺒﻠﻎ ﻋﺮﺽ ﺍﻟﻜﺄﺱ ﻭﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ ‪ 3.0‬ﻭ‪ 1.4‬ﻟﻜﻞ ﺣﺎﻟﺔ ﻣﻦ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺜﻼﺙ‪.‬‬

‫‪18‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﺷﻲءﺁﺧﺮ ﻳﻤﻜﻨﻨﺎ ﺍﻟﻘﻴﺎﻡ ﺑﻪ ﻫﻮ ﺍﺧﺘﻴﺎﺭ ﺑﻌﺾ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ‪ .‬ﺍﺧﺘﺮ "ﺗﺤﺪﻳﺪ ﺍﻟﻤﺴﺘﻄﻴﻞ"‬
‫ﻫﻨﺎ‪.‬ﺇﺫﺍ ﻗﻤﺖ ﺑﺮﺳﻢ ﻣﺴﺘﻄﻴﻞ ﺍﻵﻥ‪ ،‬ﻳﻤﻜﻨﻚ ﺗﺤﺪﻳﺪ ﻫﺬﻩ ﺍﻟﻨﻘﺎﻁ‪ .‬ﺇﺫﺍ ﻗﻤﺖ ﺑﺈﺭﺳﺎﻝ ﻫﺬﺍ‬
‫ﺍﻟﻤﺴﺘﻄﻴﻞ‪،‬ﻓﺴﻴﺘﻢ ﺍﺳﺘﺒﻌﺎﺩ ﺟﻤﻴﻊ ﺍﻟﻨﻘﺎﻁ ﺍﻷﺧﺮﻯ ﻭﺳﺘﻈﻬﺮ ﻫﺬﻩ ﺍﻟﻨﻘﺎﻁ ﻓﻘﻂ ﻋﻠﻰ ﺍﻟﺮﺳﻢ‬
‫ﺍﻟﺒﻴﺎﻧﻲ‪،‬ﻣﻊ ﺇﻋﺎﺩﺓ ﻗﻴﺎﺱ ﺍﻟﻮﺻﻮﻝ ﺑﺸﻜﻞ ﻣﻨﺎﺳﺐ‪ .‬ﻗﺪ ﺗﻜﻮﻥ ﻫﺬﻩ ﻃﺮﻳﻘﺔ ﻟﺘﻨﻈﻴﻒ ﺍﻟﻘﻴﻢ‬
‫ﺍﻟﻤﺘﻄﺮﻓﺔﻓﻲ ﺑﻴﺎﻧﺎﺗﻚ‪ ،‬ﻋﻦ ﻃﺮﻳﻖ ﺗﺤﺪﻳﺪ ﺍﻟﻤﺴﺘﻄﻴﻼﺕ ﻭﺣﻔﻆ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺠﺪﻳﺪﺓ‪ .‬ﺛﻢ‬
‫ﺍﻧﻘﺮﻋﻠﻰ ﺯﺭ ﺇﻋﺎﺩﺓ ﺍﻟﻀﺒﻂ ﻹﻇﻬﺎﺭ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺄﻛﻤﻠﻬﺎ‪.‬‬

‫ﻫﺬﺍﻫﻮ ﺗﺼﻮﺭ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻧﻔﺴﻬﺎ‪ .‬ﻣﺎﺫﺍ ﻋﻦ ﺗﺼﻮﺭ ﻧﺘﻴﺠﺔ ﺍﻟﻤﺼﻨﻒ؟‬

‫ﺩﻋﻨﺎﻧﻌﻮﺩ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ‪ .‬ﺳﻮﻑ ﻧﺴﺘﺨﺪﻡ ﺍﻟﻤﺼﻨﻒ‪ .‬ﺍﺳﺘﺨﺪﻡ ‪ ،J48‬ﻭﻗﻢ ﺑﺘﺸﻐﻴﻠﻪ‪ ،‬ﺛﻢ ﺇﺫﺍ‬
‫ﻧﻘﺮﺕﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﻋﻠﻰ ﺍﻟﻨﺘﻴﺠﺔ ﻓﻲ ﻣﻨﻄﻘﺔ ﺍﻟﺴﺠﻞ‪ ،‬ﻓﻴﻤﻜﻨﻚ ﻋﺮﺽ ﺃﺧﻄﺎء ﺍﻟﻤﺼﻨﻒ‪.‬‬

‫‪19‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻫﻨﺎﻟﺪﻳﻨﺎ ﺍﻟﻔﺼﻞ ﺍﻟﻤﺮﺳﻮﻡ ﺿﺪ ﺍﻟﻔﺼﻞ ﺍﻟﻤﺘﻮﻗﻊ‪ .‬ﺍﻟﻤﺮﺑﻌﺎﺕ ﺍﻟﻤﺮﺑﻌﺔ ﺗﻤﺜﻞ ﺍﻷﺧﻄﺎء‪ .‬ﺇﺫﺍ ﻗﻤﺖ‬
‫ﺑﺎﻟﻨﻘﺮﻓﻮﻕ ﺃﺣﺪ ﻫﺬﻩ ﺍﻟﻤﺮﺑﻌﺎﺕ‪ ،‬ﻳﻤﻜﻨﻚ ﻣﻌﺮﻓﺔ ﻣﻜﺎﻥ ﺍﻷﺧﻄﺎء‪ .‬ﻫﻨﺎﻙ ﺣﺎﻟﺘﺎﻥ ﺣﻴﺚ ﺗﻜﻮﻥ ﺍﻟﻔﺉﺔ‬
‫ﺍﻟﻤﺘﻮﻗﻌﺔﻫﻲ ‪ virginica‬ﻭﺍﻟﻔﺉﺔ ﺍﻟﻔﻌﻠﻴﺔ ﻣﺘﻌﺪﺩﺓ ﺍﻷﻟﻮﺍﻥ‪.‬‬

‫ﻳﻤﻜﻨﻨﺎﺃﻥ ﻧﺮﻯ ﻫﺬﻩ ﻓﻲ ﻣﺼﻔﻮﻓﺔ ﺍﻻﺭﺗﺒﺎﻙ‪.‬‬

‫ﻳﻮﺟﺪﻣﺮﺷﺢ ﻳﺴﻤﺢ ﻟﻚ ﺑﺈﺿﺎﻓﺔ ﺍﻟﺘﺼﻨﻴﻔﺎﺕ ﻛﺴﻤﺔ ﺟﺪﻳﺪﺓ‪ .‬ﺩﻋﻨﺎ ﻓﻘﻂ ﻧﺬﻫﺐ ﻭﻧﻠﻘﻲ ﻧﻈﺮﺓ ﻋﻠﻰ‬
‫ﺫﻟﻚ‪.‬ﺳﻨﻘﻮﻡ ﺑﺈﺿﺎﻓﺔ ﺳﻤﺔ‪ .‬ﻳﺘﻢ ﺍﻹﺷﺮﺍﻑ ﻋﻠﻴﻪ ﻷﻧﻪ ﻳﺴﺘﺨﺪﻡ "ﺍﻟﻔﺼﻞ"‪ .‬ﺇﺿﺎﻓﺔ ﺳﻤﺔ‬
‫‪ .AddClassification‬ﺍﺧﺘﺮ ﻫﻨﺎ ﻓﻲ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ﻧﻈﺎﻡ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ‪ ،‬ﻭﺍﺧﺘﺮ ‪ J48‬ﻭﺍﻧﺘﻘﻞ ﺇﻟﻰ‬
‫‪ - OutputClassification‬ﺍﺟﻌﻞ ﺫﻟﻚ "ﺻﺤﻴﺤﺎً"‪ ،‬ﻭﻗﻢ ﺑﺘﻄﺒﻴﻘﻪ‪ .‬ﻭﺳﺘﻀﻴﻒ ﺳﻤﺔ ﺟﺪﻳﺪﺓ‪،‬‬
‫ﻭﻫﺬﻩﺍﻟﺴﻤﺔ ﻫﻲ ﺍﻟﺘﺼﻨﻴﻒ ﺣﺴﺐ ‪.J48‬‬

‫‪2‬‬

‫‪20‬‬
‫ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ ‪WEKA Workbench‬‬

‫ﻣﺮﺍﺟﻊ‪:‬‬
‫ﺍﻟﻤﻠﺤﻖﻋﺒﺮ ﺍﻹﻧﺘﺮﻧﺖ ﻟـ "ﺍﻟﺘﻨﻘﻴﺐ ﻓﻲ ﺍﻟﺒﻴﺎﻧﺎﺕ‪ :‬ﺃﺩﻭﺍﺕ ﻭﺗﻘﻨﻴﺎﺕ ﻋﻤﻠﻴﺔ ﻟﻠﺘﻌﻠﻢ ﺍﻵﻟﻲ"‪ ،‬ﺇﻳﺒﻲ ﻓﺮﺍﻧﻚ‪ ،‬ﻣﺎﺭﻙ‬
‫ﺃ‪.‬ﻫﻮﻝ‪ ،‬ﻭﺇﻳﺎﻥ ﺇﺗﺶ ﻭﻳﺘﻦ‪1. The WEKA Workbench، 4 ،‬ﺫﺍﻹﺻﺪﺍﺭ‬
‫‪https://www.cs.waikato.ac.nz/ml/weka/mooc/dataminingwithweka/.2‬‬

‫‪21‬‬

You might also like