You are on page 1of 2

Atribute Selection

1. Attribute Evaluator
Correlation-based Feature Subset Selection [1]
WEKA name : CfsSubsetEval
Type : Subset Evaluation
SVM Attribute Selection [3]
Type : Attribute Ranker
Rough Set Attribute Reduction [4][5]
WEKA name : RSARSubsetEval
Type : Subset Evaluation
Correlation Attribute Evaluation
WEKA name : CorrelationAttributeEval
Type : Attribute Ranker
Based on correlation
Gain Ratio
WEKA name : GainRatioAttributeEval
Type : Attribute Ranker
Evaluates the worth of an attribute by measuring the gain ratio with respect to the class.
GainR(Class, Attribute) = (H(Class) - H(Class | Attribute)) / H(Attribute)
RELIEF [9][10]
WEKA name : ReliefAttributeEval
Type : Attribute Ranker
Principle Component Analysis
WEKA name : PrincipleComponents
Type : Attribute Ranker
2. Search Methods
Best First
WEKA name : BestFirst
Type : Subset Search
Greedy hill-climbing and backtracking
A Fast Correlation-Based Filter Solution [2]
WEKA name : FCBFSearch
Type : Subset Search
Mencari berdasarkan korelasi
Tabu Search [5]
WEKA name : TabuSearch
Type : Subset Search
Heuristic methods, proposed by Glover
Combinatorial optimization problem --> search di semua kemungkinan yg ada
Particle Swarm Optimization Search [6][7]
WEKA name : PSOSearch
Type : Subset Search
Heuristic methods
Genetic Algorithm based on proposed method by David E. Goldberg (1989)
WEKA name : GeneticSearch
Type : Subset Search
Heuristic methods
Scatter Search [8]
WEKA name : ScatterSearchV1
Type : Subset Search
Ranker
WEKA name : Ranker
Type : Single Attribute
Sorting methods, based on individual eval from single attribute evaluator
Reference
[1] Hall, MA, Correlation-based feature selection machine learning, Ph.D. Thesis, Department of Computer Science, University of Waikato,
Hamilton, New Zealand, 1998
[2] L. Yu and H. Liu, Feature selection for high-dimensional data: A fast correlation-based filter solution, MACHINE LEARNING-INTERNATIONAL
WORKSHOP, 2003.
[3] I. Guyon, J. Weston, S. Barnhill, V. Vapnik (2002). Gene selection for cancer classification using support vector machines. Machine Learning.
46:389-422.
[4] A. Chouchoulas,, Q. Shen (2001). Rough set-aided keyword reduction for text categorization. Applied Artificial Intelligence: An International
Journal. 15(9):843-873.
[5] A. Hedar, J. Wang, and M. Fukushima, Tabu Search for Attribute Reduction in Rough Set Theory pp. 115, 2006.
[6] Moraglio, A., Di Chio, C., Poli, R.: Geometric Particle Swarm Optimisation. In: Proceedings of the 10th European Conference on Genetic
Programming, Berlin, Heidelberg, 125-136, 2007.
[7] Garca-Nieto, J.M., Alba, E., Jourdan, L., Talbi, E.-G. (2009). Sensitivity and specificity based multiobjective approach for feature selection:
Application to cancer diagnosis. Information Processing Letters. 109(16):887-896.
[8] Felix Garcia Lopez (2004). Solving feature subset selection problem by a Parallel Scatter Search. Elsevier.
[9] Kenji Kira, Larry A. Rendell: A Practical Approach to Feature Selection. In: Ninth International Workshop on Machine Learning, 249-256, 1992.
[10] Igor Kononenko: Estimating Attributes: Analysis and Extensions of RELIEF. In: European Conference on Machine Learning, 171-182, 1994.