Professional Documents
Culture Documents
com -
ﺟﺎﻣﻌﺔﺍﺩﻟﺐ
ﻛﻠﻴﺔﺍﻟﻤﻌﻠﻮﻣﺎﺗﻴﺔ
ﺍﻟﺴﻨﺔﺍﻟﺪﺭﺍﺳﻴﺔ ﺍﻟﺨﺎﻣﺴﺔ
ﻣﻘﺪﻣﺔ
ﺗﻢﺗﻄﻮﻳﺮ WEKAﻓﻲ ﺟﺎﻣﻌﺔ ﻭﺍﻳﻜﺎﺗﻮ ﻓﻲ ﻧﻴﻮﺯﻳﻠﻨﺪﺍ ،ﺍﻻﺳﻢ ﻳﺮﻣﺰ ﺇﻟﻰﺑﻴﺉﺔ ﻭﺍﻳﻜﺎﺗﻮ ﻟﺘﺤﻠﻴﻞ
ﺍﻟﻤﻌﺮﻓﺔ.ﻧﺤﻦ ﻓﻘﻂ ﻧﺴﻤﻴﻬﺎ ﻭﻳﻜﺎ.
Mac.ﻭ Windowsﻭ Linuxﻭﻳﻌﻤﻞ ﻋﻠﻰ Java،ﻋﺒﺎﺭﺓ ﻋﻦ ﻣﻨﺼﺔ ﻋﻤﻞ ﺷﺎﻣﻠﺔ ،ﻭﻫﻲ ﻣﺠﺎﻧﻴﺔ
ﻭﻣﻔﺘﻮﺣﺔﺍﻟﻤﺼﺪﺭ .ﻳﺘﻢ ﺗﺸﻐﻴﻠﻪ ﻋﻠﻰ ﺃﻱ ﺟﻬﺎﺯ ﻛﻤﺒﻴﻮﺗﺮ .ﺇﻧﻪ ﻣﻜﺘﻮﺏ ﺑﻠﻐﺔ . WEKAﻫﻮ ﺑﺮﻧﺎﻣﺞ
ﻻﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ .ﺇﻧﻬﺎ ﻋﺒﺎﺭﺓ ﻋﻦ ﻣﺠﻤﻮﻋﺔ ﻣﻦ ﺧﻮﺍﺭﺯﻣﻴﺎﺕ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ﻟﻤﻬﺎﻡ ﺍﺳﺘﺨﺮﺍﺝ
ﺍﻟﺒﻴﺎﻧﺎﺕ.ﻳﺤﺘﻮﻱ ﻋﻠﻰ ﺃﺩﻭﺍﺕ ﻹﻋﺪﺍﺩ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻭﺗﺼﻨﻴﻔﻬﺎ ﻭﺍﻻﻧﺤﺪﺍﺭ ﻭﺍﻟﺘﺠﻤﻴﻊ ﻭﺍﺳﺘﺨﺮﺍﺝ ﻗﻮﺍﻋﺪ
ﺍﻻﺭﺗﺒﺎﻁﻭﺍﻟﺘﺼﻮﺭ Weka
ﺍﻟﻨﺎﻓﺬﺓﺍﻟﺮﺉﻴﺴﻴﺔ
ﺍﺳﺘﻜﺸﺎﻑﺍﻟﻤﺴﺘﻜﺸﻒ
ﻳﻮﺟﺪﻓﻲ Explorerﺧﻤﺲ ﻟﻮﺣﺎﺕ :ﻟﻮﺣﺔ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ ،ﻭﻟﻮﺣﺔ ﺍﻟﺘﺼﻨﻴﻒ ﺣﻴﺚ ﻳﻤﻜﻨﻚ
ﺇﻧﺸﺎءﻣﺼﻨﻔﺎﺕ ﻟﻤﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ ،ﻭﺍﻟﺘﺠﻤﻊ ﻫﻮ ﺇﺟﺮﺍء ﺁﺧﺮ ﺗﺠﻴﺪﻩ ،Wekaﻭﻗﻮﺍﻋﺪ ﺍﻻﻗﺘﺮﺍﻥ،
ﻭﺍﺧﺘﻴﺎﺭﺍﻟﺴﻤﺎﺕ ،ﻭﺍﻟﺘﺼﻮﺭ.
1
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻓﻲﻫﺬﻩ ﺍﻟﻤﺤﺎﺿﺮﺓ ،ﺳﻨﺴﺘﺨﺪﻡ ﺑﺸﻜﻞ ﺃﺳﺎﺳﻲ ﻟﻮﺣﺔ Preprocessﻟﻔﺘﺢ ﺍﻟﻤﻠﻔﺎﺕ ﻭﻣﺎ ﺇﻟﻰ
ﺫﻟﻚ،ﻭﻟﻮﺣﺔ Classifyﻟﺘﺠﺮﺑﺔ ﺍﻟﻤﺼﻨﻔﺎﺕ ،ﻭﻟﻮﺣﺔ Visualizeﻟﺘﺼﻮﺭ ﻣﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ
ﺍﻟﺨﺎﺻﺔﺑﻨﺎ.
ﻳﻤﻜﻨﻚﺍﻟﻌﺜﻮﺭ ﻋﻠﻰ ﻋﻴﻨﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﻘﺎﺩﻣﺔ ﻣﻊ WEKAﻓﻲ ﻣﺠﻠﺪ ﺍﻟﺘﻨﺰﻳﻞ ﺍﻟﺨﺎﺹ ﺑﺎﻟﺒﺮﻧﺎﻣﺞ.
ﻓﺘﺢ .Weather.nominal.arffﻭﻳﺒﻴﻦ ﺍﻟﺸﻜﻞ ﺍﻟﺘﺎﻟﻲ ﻛﻴﻔﻴﺔ ﺍﻟﻘﻴﺎﻡ ﺑﺬﻟﻚ .ﺗﺴﻤﻰ ﺟﻤﻴﻊ ﻣﻠﻔﺎﺕ
ﺑﻴﺎﻧﺎﺕ) Weka ARFFﺗﻨﺴﻴﻖ ﻣﻠﻒ ﻋﻼﻗﺔ ﺍﻟﺴﻤﺎﺕ(.
2
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻟﺪﻳﻬﺎ 14ﺣﺎﻟﺔ 14 ،ﻳﻮﻣﺎً ،ﻭﻟﻜﻞ ﻳﻮﻡ ﻣﻦ ﻫﺬﻩ ﺍﻷﻳﺎﻡ ،ﻗﻤﻨﺎ ﺑﺘﺴﺠﻴﻞ ﻗﻴﻢ ﺧﻤﺲ ﺳﻤﺎﺕ .ﺃﺭﺑﻌﺔ
ﺗﺘﻌﻠﻖﺑﺎﻟﻄﻘﺲ :ﺍﻟﺘﻮﻗﻌﺎﺕ ،ﻭﺩﺭﺟﺔ ﺍﻟﺤﺮﺍﺭﺓ ،ﻭﺍﻟﺮﻃﻮﺑﺔ ،ﻭﺍﻟﺮﻳﺎﺡ .ﺍﻟﺨﺎﻣﺲ ،ﺍﻟﻠﻌﺐ ،ﻫﻮ ﻣﺎ ﺇﺫﺍ ﻛﻨﺎ
ﺳﻨﻠﻌﺐﺃﻡ ﻻ .ﻓﻲ ﺍﻟﻮﺍﻗﻊ ،ﻣﺎ ﺳﻨﻔﻌﻠﻪ ﻫﻮ ﺍﻟﺘﻨﺒﺆ ﺑﺴﻤﺔ ﺍﻟﺘﺸﻐﻴﻞ ﻣﻦ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ.
ﻫﺬﻩﻫﻲ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ .ﺇﺫﺍ ﻗﻤﺖ ﺑﺘﺤﺪﻳﺪ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ -ﺗﻢ ﺗﺤﺪﻳﺪ ﺍﻟﺘﻮﻗﻌﺎﺕ ﻓﻲ ﺍﻟﺸﻜﻞ -
ﻳﻤﻜﻨﻨﺎﺭﺅﻳﺔ ﺍﻟﻘﻴﻢ .ﺗﻜﻮﻥ ﻗﻴﻢ ﺳﻤﺔ ﺍﻟﺘﻮﻗﻌﺎﺕ ﻣﺸﻤﺴﺔ ﻭﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ ﻭﻣﻤﻄﺮﺓ .ﻋﺪﺩ ﻣﺮﺍﺕ
ﻇﻬﻮﺭﻫﺎﻓﻲ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ 5 :ﺃﻳﺎﻡ ﻣﺸﻤﺴﺔ4 ،
3
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺃﻳﺎﻡﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ ،ﻭ 3ﺃﻳﺎﻡ ﻣﻤﻄﺮﺓ ،ﻟﻴﺼﺒﺢ ﺍﻟﻤﺠﻤﻮﻉ 14ﻳﻮﻣﺎً 14 ،ﺣﺎﻟﺔ .ﺇﺫﺍ ﻧﻈﺮﻧﺎ ﺇﻟﻰ ﺧﺎﺻﻴﺔ
ﺩﺭﺟﺔﺍﻟﺤﺮﺍﺭﺓ ،ﻓﺴﺘﺠﺪ ﺃﻥ ﺍﻟﻘﻴﻢ ﺍﻟﻤﺤﺘﻤﻠﺔ ﻫﻲ ﺍﻟﺴﺎﺧﻨﺔ ﻭﺍﻟﻤﻌﺘﺪﻝ ﻭﺍﻟﺒﺎﺭﺩﺓ .ﺇﺫﺍ ﺫﻫﺒﻨﺎ ﺇﻟﻰ ﺳﻤﺔ
ﺍﻟﻠﻌﺐ.ﻫﻨﺎﻙ ﻗﻴﻤﺘﺎﻥ ﻟﻠﻌﺐ ،ﻧﻌﻢ ﻭﻻ.
ﺇﺫﺍﻧﻈﺮﺕ ﺇﻟﻰ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ ،ﻣﺜﻞ ﺍﻟﺘﻮﻗﻌﺎﺕ ،ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻧﻪ ﻋﻨﺪﻣﺎ ﺗﻜﻮﻥ
ﺍﻟﺘﻮﻗﻌﺎﺕﻣﺸﻤﺴﺔ -ﻭﻫﺬﺍ ﻣﺜﻞ ﺍﻟﺮﺳﻢ ﺍﻟﺒﻴﺎﻧﻲ -ﻫﻨﺎﻙ ﺛﻼﺙ ﺣﺎﻻﺕ "ﻻ" ﻭﺣﺎﻟﺘﻴﻦ "ﻧﻌﻢ" .ﻋﻨﺪﻣﺎ
ﻳﻜﻮﻥﺍﻟﺘﻮﻗﻌﺎﺕ ﻣﻠﺒﺪﺓ ﺑﺎﻟﻐﻴﻮﻡ ،ﻫﻨﺎﻙ ﺃﺭﺑﻊ ﺣﺎﻻﺕ "ﻧﻌﻢ" ﻭﺣﺎﻻﺕ "ﻻ" ﺻﻔﺮ .ﺗﺸﺒﻪ ﻫﺬﻩ ﺭﺳﻤﺎً
ﺑﻴﺎﻧﻴﺎًﻟﻘﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﺣﻴﺚ ﺍﻟﺴﻤﺔ ﺍﻟﺘﻲ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ ﺑﻬﺎ.
4
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺇﺫﺍﺍﻧﺘﻘﻠﺖ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﺘﻌﺪﻳﻞ ،ﻓﺴﺘﺮﻯ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻓﻲ ﺍﻟﻨﻤﻮﺫﺝ ﻣﻊ 14ﻳﻮﻣﺎً ﻷﺳﻔﻞ ﻭﺍﻟﺴﻤﺎﺕ
ﺍﻟﺨﻤﺲ،ﻭﻫﺬﺍ ﻋﺮﺽ ﺁﺧﺮ ﻟﻠﺒﻴﺎﻧﺎﺕ ،ﻭﻳﻤﻜﻨﻚ ﺑﺎﻟﻔﻌﻞ ﺗﻐﻴﻴﺮ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ.
5
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺍﺳﺘﻜﺸﺎﻑﻣﺠﻤﻮﻋﺎﺕ ﺍﻟﺒﻴﺎﻧﺎﺕ
ﺑﻴﺎﻧﺎﺕﺍﻟﻄﻘﺲ ،ﻟﺪﻳﻬﺎ 14ﻳﻮﻣﺎً ،ﺃﻭ ﺣﺎﻻﺕ ،ﻭﻛﻞ ﺣﺎﻟﺔ ،ﻣﻮﺻﻮﻓﺔ ﺑﺨﻤﺲ ﺳﻤﺎﺕ ،ﺃﺭﺑﻊ ﻣﻨﻬﺎ
ﺗﺘﻌﻠﻖﺑﺎﻟﻄﻘﺲ ،ﻭﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ ،ﻭﺍﻟﺘﻲ ﻧﺴﻤﻴﻬﺎ ﻗﻴﻤﺔ "ﺍﻟﻔﺉﺔ" -ﺍﻟﺸﻲء ﺍﻟﺬﻱ ﻧﺤﺎﻭﻝ ﺍﻟﻘﻴﺎﻡ ﺑﻪ
ﺗﻮﻗﻊﻣﺎ ﺇﺫﺍ ﻛﻨﺖ ﺳﺘﻠﻌﺐ ﻟﻌﺒﺔ ﺃﻡ ﻻ .ﻭﻫﺬﺍ ﻣﺎ ﻳﺴﻤﻰ ﻣﺸﻜﻠﺔ ﺍﻟﺘﺼﻨﻴﻒ .ﻧﺤﻦ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ
ﺑﻘﻴﻤﺔﺍﻟﻔﺼﻞ.
ﻛﻤﺎﺗﺮﻯ ،ﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ ﺣﺠﻢ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ،ﻭﻋﺪﺩ ﺍﻟﺤﺎﻻﺕ ) ،(14ﻭﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ ﺍﻟﺴﻤﺎﺕ،
ﻭﻳﻤﻜﻨﻚﺍﻟﻨﻘﺮ ﻓﻮﻕ ﺃﻱ ﻣﻦ ﻫﺬﻩ ﺍﻟﺴﻤﺎﺕ ﻭﺍﻟﺤﺼﻮﻝ ﻋﻠﻰ ﻗﻴﻢ ﺗﻠﻚ ﺍﻟﺴﻤﺎﺕ .ﻳﻤﻜﻨﻚ ﺃﻳﻀﺎً
ﺍﻟﺤﺼﻮﻝﻋﻠﻰ ﺭﺳﻢ ﺑﻴﺎﻧﻲ ﻓﻲ ﺍﻷﺳﻔﻞ ﻟﻘﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻓﻴﻤﺎ ﻳﺘﻌﻠﻖ ﺑﻘﻴﻢ ﺍﻟﻔﺉﺎﺕ ﺍﻟﻤﺨﺘﻠﻔﺔ .ﻗﻴﻢ
ﺍﻟﻔﺉﺎﺕﺍﻟﻤﺨﺘﻠﻔﺔ ﻫﻲ ﺑﺎﻟﻠﻮﻥ ﺍﻷﺯﺭﻕ ﻟـ "ﻧﻌﻢ" ،ﻭﺍﻟﺘﺸﻐﻴﻞ ،ﻭﺍﻷﺣﻤﺮ ﻟـ "ﻻ".
6
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻛﻤﺎﻗﻠﻨﺎ ،ﺇﻧﻬﺎ ﻣﺸﻜﻠﺔ ﺗﺼﻨﻴﻒ ،ﻭﺗﺴﻤﻰ ﺃﺣﻴﺎﻧﺎً ﻣﺸﻜﻠﺔ ﺍﻟﺘﻌﻠﻢ ﺗﺤﺖ ﺍﻹﺷﺮﺍﻑ .ﻳﺘﻢ ﺍﻹﺷﺮﺍﻑ
ﻋﻠﻴﻪﻷﻧﻚ ﺗﺘﻌﺮﻑ ﻋﻠﻰ ﻗﻴﻢ ﺍﻟﻔﺼﻞ ﺍﻟﺨﺎﺻﺔ ﺑﻤﺜﻴﻼﺕ ﺍﻟﺘﺪﺭﻳﺐ .ﻧﺤﻦ ﻧﺄﺧﺬ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ
ﻛﻤﺪﺧﻼﺕﻛﺄﻣﺜﻠﺔ ﻣﺼﻨﻔﺔ ،ﻭﻫﺬﻩ ﺍﻷﻣﺜﻠﺔ ﻫﻲ ﺃﻣﺜﻠﺔ ﻣﺴﺘﻘﻠﺔ ﻣﻊ ﻗﻴﻤﺔ ﻓﺉﺔ ﻣﺮﻓﻘﺔ.
ﻭﺍﻟﻔﻜﺮﺓﻫﻲ ﺇﻧﺘﺎﺝ ﻧﻮﻉ ﻣﻦ ﺍﻟﻨﻤﺎﺫﺝ ﺗﻠﻘﺎﺉﻴﺎً ﻳﻤﻜﻨﻪ ﺗﺼﻨﻴﻒ ﺍﻷﻣﺜﻠﺔ ﺍﻟﺠﺪﻳﺪﺓ .ﻫﺬﻩ ﻣﺸﻜﻠﺔ "
ﺍﻟﺘﺼﻨﻴﻒ".
7
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻳﻤﻜﻦﺃﻥ ﺗﻜﻮﻥ ﻫﺬﻩ ﺍﻟﺴﻤﺎﺕ ﺃﻭ ﺍﻟﻤﻴﺰﺍﺕ ﻣﻨﻔﺼﻠﺔ ﺃﻭ ﻣﺴﺘﻤﺮﺓ .ﻣﺎ ﻧﻈﺮﻧﺎ ﺇﻟﻴﻪ ﻓﻲ ﺑﻴﺎﻧﺎﺕ
ﺍﻟﻄﻘﺲﻛﺎﻥ ﻣﻨﻔﺼﻼ ً،ﻧﺴﻤﻴﻬﺎ ﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﺍﻻﺳﻤﻴﺔ ﻋﻨﺪﻣﺎ ﺗﻨﺘﻤﻲ ﺇﻟﻰ ﻣﺠﻤﻮﻋﺔ ﺛﺎﺑﺘﺔ ﻣﻌﻴﻨﺔ،
ﺃﻭﻳﻤﻜﻦ ﺃﻥ ﺗﻜﻮﻥ ﻗﻴﻤﺎً ﺭﻗﻤﻴﺔ ﺃﻭ ﻣﺴﺘﻤﺮﺓ .ﺃﻳﻀﺎ ،ﻳﻤﻜﻦ ﺃﻥ ﻳﻜﻮﻥ ﺍﻟﻔﺼﻞ ﻣﻨﻔﺼﻼ ﺃﻭ ﻣﺴﺘﻤﺮﺍ.
ﻧﺤﻦﻧﻨﻈﺮ ﺇﻟﻰ ﻓﺉﺔ ﻣﻨﻔﺼﻠﺔ" ،ﻧﻌﻢ" ﺃﻭ "ﻻ" ،ﻓﻲ ﺣﺎﻟﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ .ﻫﻨﺎﻙ ﻧﻮﻉ ﺁﺧﺮ ﻣﻦ
ﻣﺸﻜﻼﺕﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ﻳﺘﻀﻤﻦ ﺍﻟﻔﺼﻮﻝ ﺍﻟﻤﺴﺘﻤﺮﺓ ،ﺣﻴﺚ ﻧﺤﺎﻭﻝ ﺍﻟﺘﻨﺒﺆ ﺑﺮﻗﻢ .ﻭﻫﺬﺍ ﻣﺎ ﻳﺴﻤﻰ
ﻣﺸﻜﻠﺔ"ﺍﻻﻧﺤﺪﺍﺭ".
ﺗﻮﺟﺪﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﻣﺸﺎﺑﻬﺔ ﻟﻤﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ ﺍﻷﺧﻴﺮﺓ :ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ ﺍﻟﺮﻗﻤﻴﺔ .ﺍﻓﺘﺤﻪ
ﻓﻲ.Weka، Weather.numeric.arff
ﺇﻧﻪﻣﺸﺎﺑﻪ ﺟﺪﺍً ،ﻣﺘﻄﺎﺑﻖ ﺗﻘﺮﻳﺒﺎً ﻓﻲ ﺍﻟﻮﺍﻗﻊ ،ﻣﻊ 14ﺣﺎﻟﺔ ،ﻭ 5ﺳﻤﺎﺕ ،ﻧﻔﺲ ﺍﻟﺴﻤﺎﺕ .ﺇﺫﺍ ﻧﻈﺮﺕ
ﺇﻟﻰﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ ﻓﻲ ﻟﻮﺣﺔ ﺍﻟﺘﻌﺪﻳﻞ .ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ ﺍﺛﻨﺘﻴﻦ ﻣﻦ ﺍﻟﺴﻤﺎﺕ -ﺩﺭﺟﺔ
ﺍﻟﺤﺮﺍﺭﺓﻭﺍﻟﺮﻃﻮﺑﺔ -ﻫﻲ ﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ ،ﺑﻴﻨﻤﺎ ﻛﺎﻧﺖ ﻓﻲ ﺍﻟﺴﺎﺑﻖ ﺳﻤﺎﺕ ﺍﺳﻤﻴﺔ .ﺣﺘﻰ ﻫﻨﺎ ﻫﻨﺎﻙ
ﺃﺭﻗﺎﻡ.ﻣﺎ ﻧﺮﺍﻩ ﻋﻨﺪﻣﺎ ﻧﻨﻈﺮ ﺇﻟﻰ ﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﺍﻟﺨﺎﺻﺔ ﺑﺎﻟﺘﻮﻗﻌﺎﺕ ،ﺗﻤﺎﻣﺎً ﻛﻤﺎ ﻛﺎﻥ ﻣﻦ ﻗﺒﻞ ،ﻟﺪﻳﻨﺎ
ﻣﺸﻤﺲﻭﻏﺎﺉﻢ ﻭﻣﻤﻄﺮ .ﺑﺎﻟﻨﺴﺒﺔ ﻟﺪﺭﺟﺔ ﺍﻟﺤﺮﺍﺭﺓ ،ﻻ ﻳﻤﻜﻨﻨﺎ ﺗﻌﺪﺍﺩ ﺍﻟﻘﻴﻢ ،ﻓﻬﻨﺎﻙ ﺃﺭﻗﺎﻡ ﻛﺜﻴﺮﺓ
ﻟﻠﻐﺎﻳﺔﻻ ﻳﻤﻜﻦ ﺗﻌﺪﺍﺩﻫﺎ .ﻟﺪﻳﻨﺎ ﺍﻟﺤﺪ ﺍﻷﺩﻧﻰ ﻭﺍﻟﺤﺪ ﺍﻷﻗﺼﻰ ﻟﻠﻘﻴﻤﺔ ،ﻭﺍﻟﻤﺘﻮﺳﻂ ،ﻭﺍﻻﻧﺤﺮﺍﻑ
ﺍﻟﻤﻌﻴﺎﺭﻱ.ﻫﺬﺍ ﻣﺎ ﻳﻘﺪﻣﻪ Wekaﻟﻠﻘﻴﻢ ﺍﻟﺮﻗﻤﻴﺔ.
ﺍﻓﺘﺢﺍﻵﻥ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺰﺟﺎﺟﻴﺔ ،ﻭﻫﻲ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺃﻛﺜﺮ ﺷﻤﻮﻻ ً.ﺇﻧﻬﺎ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ
ﺍﻟﻌﺎﻟﻢﺍﻟﺤﻘﻴﻘﻲ .ﻟﺪﻳﻨﺎ 214ﺣﺎﻟﺔ ﻭ 10ﺳﻤﺎﺕ .ﺍﻧﻈﺮ ﺇﻟﻰ ﺍﻟﻔﺼﻞ ،ﺍﻟﺴﻤﺔ ﺍﻷﺧﻴﺮﺓ ﺍﻟﻤﻌﺮﻭﺿﺔ
ﺍﻓﺘﺮﺍﺿﻴﺎً،ﻫﻨﺎﻙ ﺳﺒﻊ ﻗﻴﻢ ﻟﻠﻔﺉﺔ ﻭﺍﻟﺘﺴﻤﻴﺎﺕ
8
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺗﻌﻄﻲﻫﺬﻩ ﺍﻟﻘﻴﻢ ﺑﻌﺾ ﺍﻟﻤﺆﺷﺮﺍﺕ ﺣﻮﻝ ﻣﻮﺿﻮﻉ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ .ﻟﺪﻳﻨﺎ ﺍﻟﻤﺼﺎﺑﻴﺢ
ﺍﻷﻣﺎﻣﻴﺔ،ﻭﺃﺩﻭﺍﺕ ﺍﻟﻤﺎﺉﺪﺓ ،ﻭﺍﻟﺤﺎﻭﻳﺎﺕ ،ﺛﻢ ﻟﺪﻳﻨﺎ ﻧﻮﺍﻓﺬ ﺍﻟﻤﺒﺎﻧﻲ ﻭﺍﻟﻤﺮﻛﺒﺎﺕ ،ﺍﻟﻌﺎﺉﻤﺔ ﻭﻏﻴﺮ
ﺍﻟﻌﺎﺉﻤﺔ.ﻫﺬﻩ ﺳﺒﻌﺔ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ.
ﺗﻨﺴﻴﻖﻣﻠﻒ ،ARFFﻋﻠﻰ ﺳﺒﻴﻞ ﺍﻟﻤﺜﺎﻝ ﻣﻠﻒ ﺯﺟﺎﺟﻲ .ﻳﺒﺪﺃ ﺑﺎﻟﺘﻌﻠﻴﻘﺎﺕ ﺣﻮﻝ ﻗﺎﻋﺪﺓ ﺍﻟﺒﻴﺎﻧﺎﺕ
ﺍﻟﺰﺟﺎﺟﻴﺔ.ﺗﻠﻚ ﺍﻷﺳﻄﺮ ﺍﻟﺘﻲ ﺗﺒﺪﺃ ﺑﻌﻼﻣﺎﺕ ﺍﻟﻨﺴﺒﺔ ﺍﻟﻤﺉﻮﻳﺔ ) (%ﻫﻲ ﺗﻌﻠﻴﻘﺎﺕ .ﻳﻤﻜﻨﻚ ﺭﺅﻳﺔ
ﺍﻟﺴﻤﺎﺕ،ﻭﻫﻲ ﻣﻌﺎﻣﻞ ﺍﻻﻧﻜﺴﺎﺭ ﻭﺍﻟﺼﻮﺩﻳﻮﻡ ﻭﺍﻟﻤﻐﻨﻴﺴﻴﻮﻡ ﻭﻣﺎ ﺇﻟﻰ ﺫﻟﻚ .ﻭﻧﻮﻉ ﺍﻟﺰﺟﺎﺝ .ﺍﻟﻌﻼﻗﺔ
ﻟﻬﺎﺍﺳﻢ ،ﻭﺍﻟﺴﻤﺎﺕ ﻣﺤﺪﺩﺓ ،ﻭﻫﻲ ﺳﻤﺎﺕ ﺫﺍﺕ ﻗﻴﻤﺔ ﺣﻘﻴﻘﻴﺔ ،ﻭﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ.
ﺛﻢﻟﺪﻳﻨﺎ ﺳﻄﺮ "@ ،"dataﻭﺑﻌﺪ ﺫﻟﻚ ﺑﺘﻨﺴﻴﻖ ARFFﻫﻲ ﺑﺒﺴﺎﻃﺔ ﺍﻟﺤﺎﻻﺕ ،ﻭﺍﺣﺪﺓ ﺗﻠﻮ ﺍﻷﺧﺮﻯ،
ﻣﻊﻗﻴﻢ ﺍﻟﺴﻤﺎﺕ ﻛﻠﻬﺎ ﻓﻲ ﺳﻄﺮ ﻭﺍﺣﺪ ،ﺗﻨﺘﻬﻲ ﺑﺎﻟﻔﺉﺔ ﺍﻓﺘﺮﺍﺿﻴﺎً.
ﺑﻨﺎءﺍﻟﻤﺼﻨﻒ
ﺍﻵﻥ،ﻧﺤﻦ ﻓﻲ ﻃﺮﻳﻘﻨﺎ ﻟﺒﻨﺎء ﻣﺼﻨﻒ .ﺳﻨﺴﺘﺨﺪﻡ ﻧﻈﺎﻣﺎً ﻳﺴﻤﻰ J48ﻟﺘﺤﻠﻴﻞ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ
ﺍﻟﺰﺟﺎﺟﻴﺔﺍﻟﺘﻲ ﺑﺤﺜﻨﺎ ﻋﻨﻬﺎ.
ﻹﻧﺸﺎءﻣﺼﻨﻒ ،ﺍﻧﺘﻘﻞ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﺘﺼﻨﻴﻒ ،ﻭﺍﺧﺘﺮ ﻣﺼﻨﻔﺎً .ﻫﻨﺎﻙ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ
ﺍﻟﻤﺼﻨﻔﺎﺕ.ﻟﺪﻯ Wekaﻣﺼﻨﻔﺎﺕ ،Bayesﻭﻣﺼﻨﻔﺎﺕ ﺍﻟﻮﻇﺎﺉﻒ ،ﻭﺍﻟﻤﺼﻨﻔﺎﺕ ﺍﻟﺒﻄﻴﺉﺔ،
ﻭﻣﺼﻨﻔﺎﺕﺍﻟﺘﻌﺮﻳﻒ ،ﻭﻣﺎ ﺇﻟﻰ ﺫﻟﻚ .ﺳﻨﺴﺘﺨﺪﻡ ﻣﺼﻨﻒ ﺷﺠﺮﺓ J48 :ﻫﻮ ﻣﺼﻨﻒ ﺷﺠﺮﺓ .ﺍﻧﺘﻘﻞ
ﺇﻟﻰ"ﺍﻷﺷﺠﺎﺭ" ﻭﺍﻧﻘﺮ ﻓﻮﻕ .J48
9
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻟﺬﻟﻚ،ﺇﺫﺍ ﻗﻤﺖ ﺑﺘﺸﻐﻴﻠﻪ .ﻓﻘﻂ ﺍﺿﻐﻂ ﻋﻠﻰ "ﺍﺑﺪﺃ" ،ﻟﻘﺪ ﺣﺼﻠﺖ ﻋﻠﻰ ﺍﻟﻤﺼﻨﻒ.
ﺩﻋﻮﻧﺎﻧﻠﻘﻲ ﻧﻈﺮﺓ ،ﻫﻨﺎﻙ ﺑﻌﺾ ﺍﻟﻤﻌﻠﻮﻣﺎﺕ ﺣﻮﻝ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ،ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ
ﺍﻟﺰﺟﺎﺟﻴﺔ:ﻋﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﻭﺍﻟﺴﻤﺎﺕ .ﺛﻢ ﻳﺘﻢ ﻃﺒﺎﻋﺘﻬﺎ ﺗﻤﺜﻴﻼ ًﻟﺸﺠﺮﺓ .ﻻﺣﻆ ﺃﻥ ﻫﺬﻩ ﺍﻟﺸﺠﺮﺓ
ﺗﺤﺘﻮﻱﻋﻠﻰ 30ﻭﺭﻗﺔ ﻭ 59ﻋﻘﺪﺓ ﺇﺟﻤﺎﻻ ً.ﺍﻟﺪﻗﺔ ﺍﻹﺟﻤﺎﻟﻴﺔ ﻫﻲ .٪66.8ﻟﻘﺪ ﺗﻢ ﺍﻷﻣﺮ ﺑﺸﻜﻞ ﺟﻴﺪ.
10
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺗﺬﻛﺮﺃﻧﻪ ﻛﺎﻥ ﻫﻨﺎﻙ ﺣﻮﺍﻟﻲ ﺳﺒﻌﺔ ﺃﻧﻮﺍﻉ ﻣﺨﺘﻠﻔﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ .ﻧﺎﻓﺬﺓ ﺍﻟﻤﺒﻨﻰ ﻣﺼﻨﻮﻋﺔ ﻣﻦ ﺍﻟﺰﺟﺎﺝ
ﺍﻟﻌﺎﺉﻢ،ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ 50ﻣﻨﻬﺎ ﻗﺪ ﺗﻢ ﺗﺼﻨﻴﻔﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺃ" ،ﻭﻫﻲ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﺻﺤﻴﺢ.
ﺗﻢﺗﺼﻨﻴﻒ 15ﻣﻨﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺏ" ،ﻭﻫﻲ ﻧﺎﻓﺬﺓ ﺑﻨﺎء ،ﻭﺯﺟﺎﺝ ﻏﻴﺮ ﻋﺎﺉﻢ ،ﻟﺬﺍ ﻓﻬﺬﻩ ﺃﺧﻄﺎء ،ﻭﺗﻢ
ﺗﺼﻨﻴﻒ 3ﻣﻨﻬﺎ ﻋﻠﻰ ﺃﻧﻬﺎ "ﺝ" ،ﻭﻫﻜﺬﺍ .ﻻﺣﻆ ﺃﻥ ﻣﻌﻈﻢ ﺍﻟﻮﺯﻥ ﻳﻘﻊ ﺗﺤﺖ ﺍﻟﻘﻄﺮ ﺍﻟﺮﺉﻴﺴﻲ ،ﻭﻫﻮ
ﻣﺎﻧﻮﺩ ﺭﺅﻳﺘﻪ ﻷﻧﻪ ﻳﺸﻴﺮ ﺇﻟﻰ ﺍﻟﺘﺼﻨﻴﻔﺎﺕ ﺍﻟﺼﺤﻴﺤﺔ .ﻛﻞ ﺷﻲء ﻣﺎ ﻋﺪﺍ ﺍﻟﻘﻄﺮ ﺍﻟﺮﺉﻴﺴﻲ ﻳﺸﻴﺮ ﺇﻟﻰ
ﺳﻮءﺍﻟﺘﺼﻨﻴﻒ.
ﺩﻋﻮﻧﺎﺍﻟﺘﺤﻘﻴﻖ ﻓﻲ ﻫﺬﺍ ﺃﺑﻌﺪ ﻗﻠﻴﻼ .ﺳﻨﻘﻮﻡ ﺑﻔﺘﺢ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ .ﺍﻓﺘﺢ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ﻣﻦ ﺧﻼﻝ
ﺍﻟﻨﻘﺮﻋﻠﻰ ﻧﺺ ﺍﻟﺘﻌﺪﻳﻞ ﺑﺠﻮﺍﺭ ﺯﺭ "ﺍﺧﺘﻴﺎﺭ":
ﻗﻢﺑﺘﻐﻴﻴﺮ ﺍﻟﻤﻌﻠﻤﺔ "ﻏﻴﺮ ﺍﻟﻤﻘﻄﻮﻋﺔ" ﻟﺠﻌﻠﻬﺎ "ﺻﺤﻴﺤﺔ" ،ﻭﻗﻢ ﺑﺒﻨﺎء ﺷﺠﺮﺓ ﻏﻴﺮ ﻣﻘﻄﻮﻋﺔ.
ﺗﺸﻐﻴﻠﻪﻣﺮﺓ ﺃﺧﺮﻯ .ﺍﻵﻥ ﻟﺪﻳﻚ ﻣﺼﻨﻒ ﻣﺨﺘﻠﻒ .ﻟﺪﻳﻨﺎ ﺗﺼﻨﻴﻒ ﺻﺤﻴﺢ ﺑﻨﺴﺒﺔ .%67ﺑﻴﻨﻤﺎ
ﺣﺼﻠﻨﺎﻋﻠﻰ ﺩﻗﺔ ﺗﺼﻞ ﺇﻟﻰ %66.82ﻟﻠﺸﺠﺮﺓ ﺍﻟﻤﺸﺬﺑﺔ.
ﺍﻷﺭﻗﺎﻡﺍﻟﻤﻮﺟﻮﺩﺓ ﺑﻴﻦ ﻗﻮﺳﻴﻦ ﻫﻲ ﻋﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﺍﻟﺘﻲ ﺗﺼﻞ ﺇﻟﻰ ﺍﻟﻮﺭﻗﺔ .ﻋﻨﺪﻣﺎ ﻳﻜﻮﻥ ﻫﻨﺎﻙ
ﺭﻗﻤﺎﻥ،ﻓﻬﺬﺍ ﻳﻌﻨﻲ ﺃﻥ ﻧﺴﺨﺔ ﻭﺍﺣﺪﺓ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﻏﻴﺮ ﺻﺤﻴﺢ ﻭﺻﻠﺖ ﺇﻟﻰ ﻫﺬﻩ ﺍﻟﺼﻔﺤﺔ
ﻭﺧﻤﺲﺣﺎﻻﺕ ﻣﺼﻨﻔﺔ ﺑﺸﻜﻞ ﺻﺤﻴﺢ ﻭﺻﻠﺖ ﺇﻟﻰ ﻫﻨﺎﻙ.
ﻣﺮﺓﺃﺧﺮﻯ ،ﺍﻧﻘﺮ ﻓﻮﻕ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ،ﻭﺍﻧﺘﻘﻞ ﻟﺘﻐﻴﻴﺮ ﺍﻟﻤﻌﻠﻤﺔ " ."minNumObjﻣﺎ ﻫﺬﺍ؟ ﺇﻧﻪ
ﺍﻟﺤﺪﺍﻷﺩﻧﻰ ﻟﻌﺪﺩ ﺍﻟﻤﺜﻴﻼﺕ ﻟﻜﻞ ﻭﺭﻗﺔ .ﻗﻢ ﺑﺘﻐﻴﻴﺮ ﺫﻟﻚ ﻣﻦ 2ﺇﻟﻰ 15ﻟﻠﺤﺼﻮﻝ ﻋﻠﻰ ﺃﻭﺭﺍﻕ ﺃﻛﺒﺮ.
ﺍﻧﻘﺮﻓﻮﻕ ﺍﺑﺪﺃ.
11
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺍﻵﻥﺣﺼﻠﻨﺎ ﻋﻠﻰ ﻧﺘﻴﺠﺔ ﺃﺳﻮﺃ ،ﺗﺼﻨﻴﻒ ﺻﺤﻴﺢ ﺑﻨﺴﺒﺔ ،%61ﻟﻜﻦ ﺷﺠﺮﺓ ﺃﺻﻐﺮ ﺑﻜﺜﻴﺮ ،ﺑﺜﻤﺎﻧﻴﺔ
ﺃﻭﺭﺍﻕﻓﻘﻂ .ﺍﻵﻥ ،ﻳﻤﻜﻨﻚ ﺗﺼﻮﺭ ﻫﺬﻩ ﺍﻟﺸﺠﺮﺓ .ﺇﺫﺍ ﻧﻘﺮﺕ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﻋﻠﻰ ﺍﻟﻨﺘﻴﺠﺔ،
ﻓﺴﺘﺤﺼﻞﻋﻠﻰ ﻗﺎﺉﻤﺔ ﺻﻐﻴﺮﺓ ،ﺣﺪﺩ ﺍﻟﺸﺠﺮﺓ ﺍﻟﻤﺮﺉﻴﺔ.
ﻫﺬﻩﻫﻲ ﺷﺠﺮﺓ ﺍﻟﻘﺮﺍﺭ ﻫﺬﺍ ﻳﻘﻮﻝ ﺃﻭﻝ ﻧﻈﺮﺓ ﻋﻠﻰ ﻣﺤﺘﻮﻯ ﺍﻟﺒﺎﺭﻳﻮﻡ ) .(Baﺇﺫﺍ ﻛﺎﻧﺖ ﻛﺒﻴﺮﺓ ،ﻓﻴﺠﺐ
ﺃﻥﺗﻜﻮﻥ ﺍﻟﻤﺼﺎﺑﻴﺢ ﺍﻷﻣﺎﻣﻴﺔ .ﺇﺫﺍ ﻛﺎﻧﺖ ﺻﻐﻴﺮﺓ ،ﻓﺎﻟﻤﻐﻨﻴﺴﻴﻮﻡ ) .(Mgﺇﺫﺍ ﻛﺎﻥ ﻫﺬﺍ ﺻﻐﻴﺮﺍً،
ﻓﻠﻨﻨﻈﺮﺇﻟﻰ ﺍﻟﺒﻮﺗﺎﺳﻴﻮﻡ ) ،(Kﻭﺇﺫﺍ ﻛﺎﻥ ﺻﻐﻴﺮﺍً ،ﻓﻠﺪﻳﻨﺎ ﺃﺩﻭﺍﺕ ﻣﺎﺉﺪﺓ.
12
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻣﻦﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ،ﺍﺳﺘﺨﺪﻡ ﺯﺭ "ﺍﻟﻤﺰﻳﺪ" ﻟﻠﺤﺼﻮﻝ ﻋﻠﻰ ﻣﺰﻳﺪ ﻣﻦ ﺍﻟﻤﻌﻠﻮﻣﺎﺕ ﺣﻮﻝ ﺍﻟﻤﺼﻨﻒ،
ﻫﻨﺎﺣﻮﻝ .J48ﻣﻦ ﺍﻟﻤﻔﻴﺪ ﺩﺍﺉﻤﺎً ﺍﻟﻨﻈﺮ ﺇﻟﻰ ﺫﻟﻚ ﻟﻤﻌﺮﻓﺔ ﻣﻦ ﺃﻳﻦ ﺃﺗﺖ ﻫﺬﻩ ﺍﻟﻤﺼﻨﻔﺎﺕ.
ﺑﺎﺳﺘﺨﺪﺍﻡﻣﺮﺷﺢ
ﺗﻌﺪﺍﻟﻤﺮﺷﺤﺎﺕ ﺇﺣﺪﻯ ﺃﺩﻭﺍﺕ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ ،ﻟﺬﻟﻚ ﻳﺘﻢ ﺗﻄﺒﻴﻘﻬﺎ ﻋﺎﺩﺓ ًﻗﺒﻞ ﺗﻄﺒﻴﻖ
ﺍﻟﻤﺼﻨﻒ.
13
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺳﻮﻑﻧﺴﺘﺨﺪﻡ ﻣﺮﺷﺤﺎً ﻹﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ .ﺍﻓﺘﺢ ﺑﻴﺎﻧﺎﺕ ﺍﻟﻄﻘﺲ.
ﺳﻨﻘﻮﻡﺑﺈﺯﺍﻟﺔ ﺳﻤﺔ ﺍﻟﺮﻃﻮﺑﺔ :ﻫﺬﻩ ﺍﻟﺴﻤﺔ ﺭﻗﻢ .3ﻳﻤﻜﻨﻚ ﺇﻟﻘﺎء ﻧﻈﺮﺓ ﻋﻠﻰ ﺍﻟﻤﺮﺷﺤﺎﺕ ،ﺗﻤﺎﻣﺎً
ﻣﺜﻠﻤﺎﺍﺧﺘﺮﺕ ﺍﻟﻤﺼﻨﻒ ﺑﺎﺳﺘﺨﺪﺍﻡ ﺍﻟﺰﺭ "ﺍﺧﺘﻴﺎﺭ" ﻓﻲ ﻟﻮﺣﺔ "ﺍﻟﺘﺼﻨﻴﻒ" ،ﺍﺧﺘﺮ ﺍﻟﻤﺮﺷﺤﺎﺕ
ﺑﺎﺳﺘﺨﺪﺍﻡﺍﻟﺰﺭ "ﺍﺧﺘﻴﺎﺭ" ﻓﻲ ﻟﻮﺣﺔ "ﺍﻟﺘﺼﻔﻴﺔ".
ﺳﻨﺒﺤﺚﻋﻦ ﺇﺯﺍﻟﺔ ﺍﻟﺴﻤﺔ .ﺍﻟﻔﻠﺘﺮ ﻫﻮ "ﺇﺯﺍﻟﺔ" .ﻣﻦ ﺧﻼﻝ ﺍﻟﻨﻘﺮ ﻋﻠﻰ ﻟﻮﺣﺔ ﺍﻟﺘﺼﻔﻴﺔ ﻳﻤﻜﻨﻨﺎ ﺗﻜﻮﻳﻦ
ﻋﺎﻣﻞﺍﻟﺘﺼﻔﻴﺔ .ﻫﺬﺍ ﻫﻮ "ﻣﺮﺷﺢ ﻳﺰﻳﻞ ﻣﺠﻤﻮﻋﺔ ﻣﻦ ﺍﻟﺴﻤﺎﺕ ﻣﻦ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ" .ﻳﻤﻜﻨﻚ
ﺗﺤﺪﻳﺪﻧﻄﺎﻕ ﻣﻦ ﺍﻟﺴﻤﺎﺕ .ﻧﺮﻳﺪ ﻓﻘﻂ ﺇﺯﺍﻟﺔ ﻭﺍﺣﺪﺓ ،ﻭﻛﺎﻧﺖ ﻫﺬﻩ ﻫﻲ ﺍﻟﺴﻤﺔ ﺭﻗﻢ 3ﺍﻟﺘﻲ ﻛﻨﺎ
ﺳﻨﻘﻮﻡﺑﺈﺯﺍﻟﺘﻬﺎ .ﻳﻤﻜﻨﻚ ﻋﻜﺲ ﺍﻟﺘﺤﺪﻳﺪ ﻭﺇﺯﺍﻟﺔ ﻛﺎﻓﺔ ﺍﻟﺴﻤﺎﺕ ﺍﻷﺧﺮﻯ ﻭﺗﺮﻙ .3ﺍﻧﻘﺮ ﻓﻮﻕ "ﻣﻮﺍﻓﻖ
" ،ﻭﺷﺎﻫﺪ ﺍﻧﺨﻔﺎﺽ ﺍﻟﺮﻃﻮﺑﺔ ﻋﻨﺪﻣﺎ ﻧﻄﺒﻖ ﻋﺎﻣﻞ ﺍﻟﺘﺼﻔﻴﺔ .ﻟﺤﺴﻦ ﺍﻟﺤﻆ ﻳﻤﻜﻨﻚ ﺍﻟﺘﺮﺍﺟﻊ ﻋﻦ
ﺗﺄﺛﻴﺮﺫﻟﻚ ﻭﺇﻋﺎﺩﺗﻪ ﻋﻦ ﻃﺮﻳﻖ ﺍﻟﻀﻐﻂ ﻋﻠﻰ ﺯﺭ ﺍﻟﺘﺮﺍﺟﻊ.
ﻓﻲﺍﻟﻮﺍﻗﻊ ،ﻫﻨﺎﻙ ﻃﺮﻳﻘﺔ ﺃﺳﻬﻞ ﺑﻜﺜﻴﺮ ﻹﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ :ﻻ ﺗﺤﺘﺎﺝ ﺇﻟﻰ ﺍﺳﺘﺨﺪﺍﻡ ﻣﺮﺷﺢ ﻋﻠﻰ
ﺍﻹﻃﻼﻕ.ﺇﺫﺍ ﻛﻨﺖ ﺗﺮﻳﺪ ﻓﻘﻂ ﺇﺯﺍﻟﺔ ﺇﺣﺪﻯ ﺍﻟﺴﻤﺎﺕ ،ﻓﻴﻤﻜﻨﻚ ﺗﺤﺪﻳﺪﻫﺎ ﻭﺍﻟﻨﻘﺮ ﻓﻮﻕ ﺍﻟﺰﺭ "ﺇﺯﺍﻟﺔ"
ﻓﻲﺍﻷﺳﻔﻞ .ﺇﻧﻬﺎ ﺗﻘﻮﻡ ﺑﻨﻔﺲ ﺍﻟﻤﻬﻤﺔ.
14
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺍﻟﻤﺮﺷﺤﺎﺕﻣﻔﻴﺪﺓ ﺣﻘﺎً ،ﻭﻳﻤﻜﻨﻬﺎ ﺍﻟﻘﻴﺎﻡ ﺑﺄﺷﻴﺎء ﺃﻛﺜﺮ ﺗﻌﻘﻴﺪﺍً ﻣﻦ ﺫﻟﻚ ﺑﻜﺜﻴﺮ .ﺩﻋﻮﻧﺎ ،ﻋﻠﻰ ﺳﺒﻴﻞ
ﺍﻟﻤﺜﺎﻝ،ﻧﺘﺨﻴﻞ ﺇﺯﺍﻟﺔ ،ﻭﻟﻴﺲ ﺳﻤﺔ ،ﻭﻟﻜﻦ ﺩﻋﻮﻧﺎ ﻧﺰﻳﻞ ﺟﻤﻴﻊ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺘﻲ ﺗﻜﻮﻥ ﻓﻴﻬﺎ ﻗﻴﻤﺔ
ﺍﻟﺮﻃﻮﺑﺔ"ﻋﺎﻟﻴﺔ" .ﺃﻱ ﺃﻥ ﺍﻟﺴﻤﺔ ﺭﻗﻢ 3ﻟﻬﺎ ﻫﺬﻩ ﺍﻟﻘﻴﻤﺔ ﺍﻷﻭﻟﻰ .ﺩﻋﻮﻧﺎ ﻧﺒﺤﺚ ﻋﻦ ﻣﺮﺷﺢ ﻟﻠﻘﻴﺎﻡ
ﺑﺬﻟﻚ.ﻧﺮﻳﺪ ﺇﺯﺍﻟﺔ ﺍﻟﻤﺜﻴﻼﺕ ،ﻟﺬﻟﻚ ﺳﻴﻜﻮﻥ ﻋﺎﻣﻞ ﺗﺼﻔﻴﺔ ﺍﻟﻤﺜﻴﻼﺕ.
ﻻﺗﺰﺍﻝ ﻟﺪﻳﻨﺎ ﺧﺎﺻﻴﺔ ﺍﻟﺮﻃﻮﺑﺔ ﻫﻨﺎﻙ ،ﻟﻜﻦ ﻟﻴﺲ ﻟﺪﻳﻨﺎ ﺃﻱ ﻋﻨﺎﺻﺮ ﺫﺍﺕ ﺭﻃﻮﺑﺔ ﻋﺎﻟﻴﺔ .ﻓﻲ ﺍﻟﻮﺍﻗﻊ،
ﺗﻢﺗﺨﻔﻴﺾ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺇﻟﻰ 7ﺣﺎﻻﺕ ﻓﻘﻂ .ﻳﻤﻜﻨﻚ ﺣﻔﻆ ﺍﻟﻨﺘﺎﺉﺞ.
ﻗﻤﻨﺎﺑﺈﺯﺍﻟﺔ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺘﻲ ﺗﻜﻮﻥ ﻓﻴﻬﺎ ﺍﻟﺮﻃﻮﺑﺔ ﻋﺎﻟﻴﺔ .ﻋﻠﻴﻨﺎ ﺃﻥ ﻧﻔﻜﺮ ،ﻋﻨﺪﻣﺎ ﻧﺒﺤﺚ ﻋﻦ ﻋﻮﺍﻣﻞ
ﺍﻟﺘﺼﻔﻴﺔ،ﻓﻴﻤﺎ ﺇﺫﺍ ﻛﻨﺎ ﻧﺮﻳﺪ ﻣﺮﺷﺤﺎً ﺧﺎﺿﻌﺎً ﻟﻺﺷﺮﺍﻑ ﺃﻭ ﻏﻴﺮ ﺧﺎﺿﻊ ﻟﻺﺷﺮﺍﻑ ،ﻭﻣﺎ ﺇﺫﺍ ﻛﻨﺎ ﻧﺮﻳﺪ
ﻣﺮﺷﺤﺎًﻟﻠﺴﻤﺎﺕ ﺃﻭ ﻣﺮﺷﺢ ﻣﺜﻴﻞ ،ﺛﻢ ﺍﺳﺘﺨﺪﻡ ﺍﻟﻔﻄﺮﺓ ﺍﻟﺴﻠﻴﻤﺔ ﻟﻠﻨﻈﺮ ﺇﻟﻰ ﺃﺳﻔﻞ ﻗﺎﺉﻤﺔ
ﺍﻟﻤﺮﺷﺤﺎﺕﻟﺘﺮﻯ ﺍﻱ ﻭﺍﺣﺪ ﺗﺮﻳﺪ.
ﻓﻲﺑﻌﺾ ﺍﻷﺣﻴﺎﻥ ،ﻋﻨﺪ ﺗﺼﻔﻴﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ،ﺗﺤﺼﻞ ﻋﻠﻰ ﺗﺼﻨﻴﻒ ﺃﻓﻀﻞ ﺑﻜﺜﻴﺮ.
ﻭﻫﻨﺎﻣﺜﺎﻝ ﺑﺴﻴﻂ ﺣﻘﺎ .ﺳﻨﻘﻮﻡ ﺑﻔﺘﺢ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺰﺟﺎﺟﻴﺔ ﺍﻟﺘﻲ ﺭﺃﻳﻨﺎﻫﺎ ﻣﻦ ﻗﺒﻞ .ﺳﻮﻑ
ﻧﺴﺘﺨﺪﻡ ،J48ﻭﻫﻮ ﻣﺎ ﻓﻌﻠﻨﺎﻩ ﻣﻦ ﻗﺒﻞ .ﺇﻧﻪ ﻣﺼﻨﻒ ﺷﺠﺮﺓ .ﺍﺑﺪﺃ ،ﺳﻮﻑ ﺗﺤﺼﻞ ﻋﻠﻰ ﻣﺼﻨﻒ
ﺑﺪﻗﺔ .%66.8ﺩﻋﻮﻧﺎ ﻧﺰﻳﻞ ،Feﺃﻱ ﺍﻟﺤﺪﻳﺪ .ﻳﺰﻳﻞ
15
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻫﺬﻩﺍﻟﺴﻤﺔ ،ﻭﺳﺘﺤﺼﻞ ﻋﻠﻰ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺃﺻﻐﺮ .ﺍﺫﻫﺐ ﻭﻗﻢ ﺑﺘﺸﻐﻴﻞ J48ﻣﺮﺓ ﺃﺧﺮﻯ .ﺍﻵﻥ
ﺣﺼﻠﻨﺎﻋﻠﻰ ﺩﻗﺔ .%67.3ﻟﺬﻟﻚ ﻗﻤﻨﺎ ﺑﺘﺤﺴﻴﻦ ﺍﻟﺪﻗﺔ ﻗﻠﻴﻼ ًﻋﻦ ﻃﺮﻳﻖ ﺇﺯﺍﻟﺔ ﻫﺬﻩ ﺍﻟﺴﻤﺔ.
ﺗﺼﻮﺭﺍﻟﺒﻴﺎﻧﺎﺕ
ﻣﻦﺍﻟﻀﺮﻭﺭﻱ ﺍﻻﻗﺘﺮﺍﺏ ﻣﻦ ﺑﻴﺎﻧﺎﺗﻚ ﻭﺍﻟﻨﻈﺮ ﺇﻟﻴﻬﺎ ﺑﻜﻞ ﻃﺮﻳﻘﺔ ﻣﻤﻜﻨﺔ .ﺳﻨﻨﻈﺮ ﻓﻲ ﺗﺼﻮﺭ ﺍﻟﺒﻴﺎﻧﺎﺕ.
ﺳﻨﺴﺘﺨﺪﻡﻟﻮﺣﺔ .Visualizeﺍﻓﺘﺢ ﻣﺠﻤﻮﻋﺔ ﺑﻴﺎﻧﺎﺕ ﺇﻳﺮﻳﺲ .ﻟﻬﺎ ﺳﻤﺎﺕ ﺭﻗﻤﻴﺔ ،ﺃﺭﺑﻊ ﺳﻤﺎﺕ
ﺭﻗﻤﻴﺔ:ﻃﻮﻝ ﺍﻟﻜﺄﺱ ،ﻋﺮﺽ ﺍﻟﻜﺄﺱ ،ﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ ،ﻋﺮﺽ ﺍﻟﺒﺘﻠﺔ .ﺍﻟﻄﺒﻘﺔ ﻫﻲ ﺛﻼﺛﺔ ﺃﻧﻮﺍﻉ ﻣﻦ ﺯﻫﺮﺓ
ﺍﻟﻘﺰﺣﻴﺔ:ﺇﻳﺮﻳﺲ ﺳﻴﺘﻮﺳﺎ ،ﺇﻳﺮﻳﺲ ﺍﻟﻤﺒﺮﻗﺸﺔ ،ﻭﺇﺭﻳﺴﻔﻴﺮﺟﻴﻨﻴﻜﺎ.
16
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻫﻨﺎﻙﻣﺼﻔﻮﻓﺔ ﻣﻦ ﻗﻄﻊ ﺍﻷﺭﺽ ﺛﻨﺎﺉﻴﺔ ﺍﻷﺑﻌﺎﺩ ،ﻭﻣﺼﻔﻮﻓﺔ ﻣﻦ ﻗﻄﻊ ﺍﻷﺭﺽ ﺫﺍﺕ ﺧﻤﺴﺔ ﻓﻲ
ﺧﻤﺴﺔ.ﻳﻤﻜﻨﻚ ﺗﺤﺪﻳﺪ ﺃﺣﺪ ﻫﺬﻩ ﺍﻟﻤﺨﻄﻄﺎﺕ ،ﻋﻠﻰ ﺳﺒﻴﻞ ﺍﻟﻤﺜﺎﻝ ،ﺇﺫﺍ ﺣﺪﺩﺕ ﻣﺨﻄﻄﺎً ﺑﻌﺮﺽ
sepalwidthﻋﻠﻰ ﺍﻟﻤﺤﻮﺭ xﻭﻋﺮﺽ ﺍﻟﺒﺘﻠﺔ ﻋﻠﻰ ﺍﻟﻤﺤﻮﺭ ،yﻓﻬﺬﺍ ﻳﻤﺜﻞ ﻣﺨﻄﻄﺎً ﻟﻠﺒﻴﺎﻧﺎﺕ.
ﺍﻷﻟﻮﺍﻥﺗﺘﻮﺍﻓﻖ ﻣﻊ ﺍﻟﻔﺉﺎﺕ ﺍﻟﺜﻼﺙ .ﻳﻤﻜﻨﻚ ﺑﺎﻟﻔﻌﻞ ﺗﻐﻴﻴﺮ ﺍﻷﻟﻮﺍﻥ ،ﺇﺫﺍ ﻟﻢ ﺗﻌﺠﺒﻚ.
ﻳﻤﻜﻨﻚﺇﻟﻘﺎء ﻧﻈﺮﺓ ﻋﻠﻰ ﻧﻘﺎﻁ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﻔﺮﺩﻳﺔ ﻣﻦ ﺧﻼﻝ ﺍﻟﻨﻘﺮ ﻋﻠﻴﻬﺎ .ﻫﺬﺍ ﻳﺘﺤﺪﺙ ﻋﻦ ﺍﻟﻤﺜﺎﻝ
ﺭﻗﻢ 86ﺑﻄﻮﻝ ﻛﺄﺳﻲ ،6ﻭﻋﺮﺽ ﻛﺄﺳﻲ ،3.4ﻭﻫﻜﺬﺍ .ﻫﺬﺍ ﻣﺘﻌﺪﺩ ﺍﻷﻟﻮﺍﻥ ،ﻭﻟﻬﺬﺍ ﺍﻟﺴﺒﺐ ﺗﻢ
ﺗﻠﻮﻳﻦﻫﺬﻩ ﺍﻟﺒﻘﻌﺔ ﺑﺎﻟﻠﻮﻥ ﺍﻷﺣﻤﺮ .ﻳﻤﻜﻨﻨﺎ ﺃﻥ ﻧﻨﻈﺮ ﺍﻟﺤﺎﻻﺕ ﺍﻟﻔﺮﺩﻳﺔ.
17
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻳﻤﻜﻨﻨﺎﺗﻐﻴﻴﺮ ﺍﻟﻤﺤﻮﺭ ﺍﻟﺴﻴﻨﻲ ﻭﺍﻟﺼﺎﺩﻱ ﻋﻦ ﻃﺮﻳﻖ ﺗﻐﻴﻴﺮ ﺍﻟﻘﻮﺍﺉﻢ ﻷﻋﻠﻰ .ﻭﺍﻷﻓﻀﻞ ﻣﻦ ﺫﻟﻚ ،ﺇﺫﺍ ﻧﻘﺮﻧﺎ ﻋﻠﻰ
ﻣﺠﻤﻮﻋﺔﺍﻷﺷﺮﻃﺔ ﺍﻟﺼﻐﻴﺮﺓ ﻋﻠﻰ ﺍﻟﻴﻤﻴﻦ ،ﻓﺈﻧﻬﺎ ﺗﻤﺜﻞ ﺍﻟﺴﻤﺎﺕ .ﺇﺫﺍ ﻧﻘﺮﺕ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﺴﺮ ﻋﻠﻰ ﻫﺬﺍ،
ﻓﺴﻴﺘﻐﻴﺮﺍﻟﻤﺤﻮﺭ ﺍﻟﺴﻴﻨﻲ ،ﻭﺳﻴﺆﺩﻱ ﺍﻟﻨﻘﺮ ﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﺇﻟﻰ ﺗﻐﻴﻴﺮ ﺍﻟﻤﺤﻮﺭ ﺍﻟﺼﺎﺩﻱ .ﺣﺘﻰ ﺗﺘﻤﻜﻦ ﻣﻦ
ﺗﺼﻔﺢﻫﺬﻩ ﺍﻟﻤﺆﺍﻣﺮﺍﺕ ﺍﻟﻤﺨﺘﻠﻔﺔ ﺑﺴﺮﻋﺔ.
ﻓﻲﺑﻌﺾ ﺍﻷﺣﻴﺎﻥ ،ﺗﻮﺿﻊ ﺍﻟﻨﻘﺎﻁ ﻓﻮﻕ ﺑﻌﻀﻬﺎ ﺍﻟﺒﻌﺾ ﻣﺒﺎﺷﺮﺓ ً،ﻭﻳﻤﻜﻨﻚ ﺍﺳﺘﺨﺪﺍﻡ Jitter
ﻹﺿﺎﻓﺔﺍﻟﻘﻠﻴﻞ ﻣﻦ ﺍﻟﻌﺸﻮﺍﺉﻴﺔ ﺇﻟﻰ ﺍﻟﻤﺤﻮﺭﻳﻦ xﻭ .yﻣﻊ ﺍﻟﻘﻠﻴﻞ ﻣﻦ ﺍﻻﺭﺗﻌﺎﺵ ،ﺗﻤﺜﻞ ﺍﻟﺒﻘﻊ
ﺍﻟﺪﺍﻛﻨﺔﺣﺎﻻﺕ ﻣﺘﻌﺪﺩﺓ .ﺇﺫﺍ ﻧﻘﺮﺕ ﻋﻠﻰ ﻭﺍﺣﺪﺓ ﻣﻨﻬﺎ ،ﻳﻤﻜﻨﻚ ﺃﻥ ﺗﺮﻯ ﺃﻥ ﺍﻟﻨﻘﻄﺔ ﺗﻤﺜﻞ ﺛﻼﺙ
ﺣﺎﻻﺕﻣﻨﻔﺼﻠﺔ ،ﻛﻠﻬﺎ ﻣﻦ ﻓﺉﺔ ،Iris-setosaﻭﻟﻬﺎ ﺟﻤﻴﻌﺎً ﻧﻔﺲ ﻗﻴﻤﺔ ﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ ﻭﻋﺮﺽ
.sepalwidthﻳﺒﻠﻎ ﻋﺮﺽ ﺍﻟﻜﺄﺱ ﻭﻃﻮﻝ ﺍﻟﺒﺘﻠﺔ 3.0ﻭ 1.4ﻟﻜﻞ ﺣﺎﻟﺔ ﻣﻦ ﺍﻟﺤﺎﻻﺕ ﺍﻟﺜﻼﺙ.
18
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﺷﻲءﺁﺧﺮ ﻳﻤﻜﻨﻨﺎ ﺍﻟﻘﻴﺎﻡ ﺑﻪ ﻫﻮ ﺍﺧﺘﻴﺎﺭ ﺑﻌﺾ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﻫﺬﻩ .ﺍﺧﺘﺮ "ﺗﺤﺪﻳﺪ ﺍﻟﻤﺴﺘﻄﻴﻞ"
ﻫﻨﺎ.ﺇﺫﺍ ﻗﻤﺖ ﺑﺮﺳﻢ ﻣﺴﺘﻄﻴﻞ ﺍﻵﻥ ،ﻳﻤﻜﻨﻚ ﺗﺤﺪﻳﺪ ﻫﺬﻩ ﺍﻟﻨﻘﺎﻁ .ﺇﺫﺍ ﻗﻤﺖ ﺑﺈﺭﺳﺎﻝ ﻫﺬﺍ
ﺍﻟﻤﺴﺘﻄﻴﻞ،ﻓﺴﻴﺘﻢ ﺍﺳﺘﺒﻌﺎﺩ ﺟﻤﻴﻊ ﺍﻟﻨﻘﺎﻁ ﺍﻷﺧﺮﻯ ﻭﺳﺘﻈﻬﺮ ﻫﺬﻩ ﺍﻟﻨﻘﺎﻁ ﻓﻘﻂ ﻋﻠﻰ ﺍﻟﺮﺳﻢ
ﺍﻟﺒﻴﺎﻧﻲ،ﻣﻊ ﺇﻋﺎﺩﺓ ﻗﻴﺎﺱ ﺍﻟﻮﺻﻮﻝ ﺑﺸﻜﻞ ﻣﻨﺎﺳﺐ .ﻗﺪ ﺗﻜﻮﻥ ﻫﺬﻩ ﻃﺮﻳﻘﺔ ﻟﺘﻨﻈﻴﻒ ﺍﻟﻘﻴﻢ
ﺍﻟﻤﺘﻄﺮﻓﺔﻓﻲ ﺑﻴﺎﻧﺎﺗﻚ ،ﻋﻦ ﻃﺮﻳﻖ ﺗﺤﺪﻳﺪ ﺍﻟﻤﺴﺘﻄﻴﻼﺕ ﻭﺣﻔﻆ ﻣﺠﻤﻮﻋﺔ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻟﺠﺪﻳﺪﺓ .ﺛﻢ
ﺍﻧﻘﺮﻋﻠﻰ ﺯﺭ ﺇﻋﺎﺩﺓ ﺍﻟﻀﺒﻂ ﻹﻇﻬﺎﺭ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺄﻛﻤﻠﻬﺎ.
ﺩﻋﻨﺎﻧﻌﻮﺩ ﺇﻟﻰ ﻟﻮﺣﺔ ﺍﻟﻤﻌﺎﻟﺠﺔ ﺍﻟﻤﺴﺒﻘﺔ .ﺳﻮﻑ ﻧﺴﺘﺨﺪﻡ ﺍﻟﻤﺼﻨﻒ .ﺍﺳﺘﺨﺪﻡ ،J48ﻭﻗﻢ ﺑﺘﺸﻐﻴﻠﻪ ،ﺛﻢ ﺇﺫﺍ
ﻧﻘﺮﺕﺑﺰﺭ ﺍﻟﻤﺎﻭﺱ ﺍﻷﻳﻤﻦ ﻋﻠﻰ ﺍﻟﻨﺘﻴﺠﺔ ﻓﻲ ﻣﻨﻄﻘﺔ ﺍﻟﺴﺠﻞ ،ﻓﻴﻤﻜﻨﻚ ﻋﺮﺽ ﺃﺧﻄﺎء ﺍﻟﻤﺼﻨﻒ.
19
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻫﻨﺎﻟﺪﻳﻨﺎ ﺍﻟﻔﺼﻞ ﺍﻟﻤﺮﺳﻮﻡ ﺿﺪ ﺍﻟﻔﺼﻞ ﺍﻟﻤﺘﻮﻗﻊ .ﺍﻟﻤﺮﺑﻌﺎﺕ ﺍﻟﻤﺮﺑﻌﺔ ﺗﻤﺜﻞ ﺍﻷﺧﻄﺎء .ﺇﺫﺍ ﻗﻤﺖ
ﺑﺎﻟﻨﻘﺮﻓﻮﻕ ﺃﺣﺪ ﻫﺬﻩ ﺍﻟﻤﺮﺑﻌﺎﺕ ،ﻳﻤﻜﻨﻚ ﻣﻌﺮﻓﺔ ﻣﻜﺎﻥ ﺍﻷﺧﻄﺎء .ﻫﻨﺎﻙ ﺣﺎﻟﺘﺎﻥ ﺣﻴﺚ ﺗﻜﻮﻥ ﺍﻟﻔﺉﺔ
ﺍﻟﻤﺘﻮﻗﻌﺔﻫﻲ virginicaﻭﺍﻟﻔﺉﺔ ﺍﻟﻔﻌﻠﻴﺔ ﻣﺘﻌﺪﺩﺓ ﺍﻷﻟﻮﺍﻥ.
ﻳﻮﺟﺪﻣﺮﺷﺢ ﻳﺴﻤﺢ ﻟﻚ ﺑﺈﺿﺎﻓﺔ ﺍﻟﺘﺼﻨﻴﻔﺎﺕ ﻛﺴﻤﺔ ﺟﺪﻳﺪﺓ .ﺩﻋﻨﺎ ﻓﻘﻂ ﻧﺬﻫﺐ ﻭﻧﻠﻘﻲ ﻧﻈﺮﺓ ﻋﻠﻰ
ﺫﻟﻚ.ﺳﻨﻘﻮﻡ ﺑﺈﺿﺎﻓﺔ ﺳﻤﺔ .ﻳﺘﻢ ﺍﻹﺷﺮﺍﻑ ﻋﻠﻴﻪ ﻷﻧﻪ ﻳﺴﺘﺨﺪﻡ "ﺍﻟﻔﺼﻞ" .ﺇﺿﺎﻓﺔ ﺳﻤﺔ
.AddClassificationﺍﺧﺘﺮ ﻫﻨﺎ ﻓﻲ ﻟﻮﺣﺔ ﺍﻟﺘﻜﻮﻳﻦ ﻧﻈﺎﻡ ﺍﻟﺘﻌﻠﻢ ﺍﻵﻟﻲ ،ﻭﺍﺧﺘﺮ J48ﻭﺍﻧﺘﻘﻞ ﺇﻟﻰ
- OutputClassificationﺍﺟﻌﻞ ﺫﻟﻚ "ﺻﺤﻴﺤﺎً" ،ﻭﻗﻢ ﺑﺘﻄﺒﻴﻘﻪ .ﻭﺳﺘﻀﻴﻒ ﺳﻤﺔ ﺟﺪﻳﺪﺓ،
ﻭﻫﺬﻩﺍﻟﺴﻤﺔ ﻫﻲ ﺍﻟﺘﺼﻨﻴﻒ ﺣﺴﺐ .J48
2
20
ﺍﺳﺘﺨﺮﺍﺝﺍﻟﺒﻴﺎﻧﺎﺕ ﺑﺎﺳﺘﺨﺪﺍﻡ WEKA Workbench
ﻣﺮﺍﺟﻊ:
ﺍﻟﻤﻠﺤﻖﻋﺒﺮ ﺍﻹﻧﺘﺮﻧﺖ ﻟـ "ﺍﻟﺘﻨﻘﻴﺐ ﻓﻲ ﺍﻟﺒﻴﺎﻧﺎﺕ :ﺃﺩﻭﺍﺕ ﻭﺗﻘﻨﻴﺎﺕ ﻋﻤﻠﻴﺔ ﻟﻠﺘﻌﻠﻢ ﺍﻵﻟﻲ" ،ﺇﻳﺒﻲ ﻓﺮﺍﻧﻚ ،ﻣﺎﺭﻙ
ﺃ.ﻫﻮﻝ ،ﻭﺇﻳﺎﻥ ﺇﺗﺶ ﻭﻳﺘﻦ1. The WEKA Workbench، 4 ،ﺫﺍﻹﺻﺪﺍﺭ
https://www.cs.waikato.ac.nz/ml/weka/mooc/dataminingwithweka/.2
21