The Nature of Feature Selection Technique

Uploaded by

Rasika Dilshan

0% found this document useful (0 votes)

1 views7 pages

Original Title

Feature_Selection

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

1 views7 pages

The Nature of Feature Selection Technique

Uploaded by

Rasika Dilshan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

Feature Selection

The nature of feature selection technique

In large scale high dimensional data that comes out from IOT sensors, healthcare with hundreds and
thousands of features, we need to figure out what subset of features will bring out a good sustaining
model.

In a Dataset, when ‘raw’ often comes with many irrelevant features that do not contribute much to the
accuracy of your predictive model.

For example, when you understand that when using music analogy music engineers often employ various
techniques to tune their music such that there is no unwanted noise and the voice is crisp and clear.

Similarly, even the datasets encounter noise and its crucial to remove them for better model
optimization.

Feature selection

• Reduces overfitting ‘The Curse of Dimensionality’ — If your dataset has more features/columns
than samples (X), the model will be prone to overfitting. By removing irrelevant data/noise, the
model gets to focus on essential features, leading to more generalization.
• Simplifies models — Dimensionality adds many layers to a model, making it needlessly
complicated. Overengineering is fun but they may not be better than their simpler counterparts.
Simpler models are easier to interpret and debug.
• Reduces training time — Lesser features/dimensions reduces the computation speed, speeding
up model training.

Feature Selection and Classification via

GMDH Algorithm in R
GMDH-type neural network algorithm is a heuristic self-organizing algorithm to model complex systems.
This ultimate guide involves feature selection and classification via GMDH algorithm for a binary
response.
We will work GMDH-type neural network approach for feature selection and classification when a
response with two classes exists. Before we start, we need to divide data into three parts; train, validation
and test sets. We use train set for model building. We utilize validation set for neuron selection. Last, we
show the performance of the model on test set.

we will implement the algorithm on breast cancer dataset, also used in the work done by Dag et al. (2019),
available in mlbench package (Leisch and Dimitriadou, 2010). Before we go ahead, we load dataset and
start to process the data.
Nest step will be defining the input and output variables and we need to divide data into three sets; train
(60%), validation (20%) and test (20%) sets. For reproducibility of results, let’s fix the seed number to 100.
Then, we obtain the number of observations in each fold.

Now let’s obtain the indices of train, validation and test sets. Before we obtain the indices, we shuffle
the indices to prevent any bias based on order.
Next, we can construct train, validation and test sets.

After obtaining train, validation and test sets, we can use GMDH-type neural network algorithm. GMDH
algorithm is available in GMDH2 package.
Now, let’s obtain performance measures on test set.
Based on the above, we can come to a conclusion with the accuracy of GMDH algorithm which is
estimated to be 0.9485. This algorithm classifies 94.85% of persons in a correct class. Also, sensitivity
and specificity are calculated as 0.8913 and 0.9778. The algorithm classifies 89.13% of the persons
having breast cancer, 97.78% of the persons not having breast cancer.

Chapter-3-Common Issues in Machine Learning
Document20 pages
Chapter-3-Common Issues in Machine Learning
codeavengers0
No ratings yet
GlobalLogic - Optimization Algorithms For Machine Learning
Document4 pages
GlobalLogic - Optimization Algorithms For Machine Learning
Kumar manickam
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Bard Advices
Document35 pages
Bard Advices
fotini81729
No ratings yet
AMTA Assignment AMTA B (Aswin Avni Navya)
Document13 pages
AMTA Assignment AMTA B (Aswin Avni Navya)
Shambhawi Sinha
No ratings yet
Workflow of A Machine Learning Project
Document12 pages
Workflow of A Machine Learning Project
ashish
No ratings yet
Electricity Load Forecasting - Intelligent
Document10 pages
Electricity Load Forecasting - Intelligent
karthikbollu
No ratings yet
TB 969425740
Document16 pages
TB 969425740
guohong hu
No ratings yet
Intrusion Detection System Using Unsupervised ML Algorithms: School of Information Technology and Engineering
Document15 pages
Intrusion Detection System Using Unsupervised ML Algorithms: School of Information Technology and Engineering
haggele haggele
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Document12 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Hassan Saddiqui
No ratings yet
ML Performance Improvement Cheatsheet
Document11 pages
ML Performance Improvement Cheatsheet
rahulsukhija
100% (1)
A Process For Implementing Industrial Predictive Maintenance - Part II - Google Cloud Blog
Document11 pages
A Process For Implementing Industrial Predictive Maintenance - Part II - Google Cloud Blog
Hernn
No ratings yet
Breast Cancer Prediction With Evaluating Performance of Elastography Image Using Deep Learning
Document8 pages
Breast Cancer Prediction With Evaluating Performance of Elastography Image Using Deep Learning
Jayachandran Vivekanandan
No ratings yet
Les Statistiques Descriptives Est Oujda
Document4 pages
Les Statistiques Descriptives Est Oujda
Mouhsine EL MOUDIR
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Rev Insurance Business Report
Document4 pages
Rev Insurance Business Report
Pratigya pathak
No ratings yet
Machine Learning Section2 Ebook
Document16 pages
Machine Learning Section2 Ebook
camgova
No ratings yet
Intrusion Detection System Using Unsupervised ML Algorithms: School of Information Technology and Engineering
Document9 pages
Intrusion Detection System Using Unsupervised ML Algorithms: School of Information Technology and Engineering
haggele haggele
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
Document5 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
Victor Ruto
No ratings yet
Samatrix Assignment3
Document4 pages
Samatrix Assignment3
Yash Kumar
No ratings yet
Machine Learning Ai Manufacturing PDF
Document6 pages
Machine Learning Ai Manufacturing PDF
Nitin Rathi
No ratings yet
ML Question Answer
Document21 pages
ML Question Answer
Madhulina Pal
No ratings yet
Fashion Intelligent System Using Machine Learning
Document9 pages
Fashion Intelligent System Using Machine Learning
ADVENTURE CSE
No ratings yet
Tarptask 4
Document10 pages
Tarptask 4
Kartik Sharma
No ratings yet
Develop A Program To Implement Data Preprocessing Using
Document19 pages
Develop A Program To Implement Data Preprocessing Using
Fucker Jamun
No ratings yet
Feature Selection Effects On Classification Algorithms
Document3 pages
Feature Selection Effects On Classification Algorithms
aman pandey
No ratings yet
Python and ML Content For Page 16
Document22 pages
Python and ML Content For Page 16
Sumaiya Kauser
No ratings yet
Work Flow
Document6 pages
Work Flow
ayushisharma.hcst
No ratings yet
Script
Document5 pages
Script
Simo Jayat
No ratings yet
Data Mining Assignment Help
Document5 pages
Data Mining Assignment Help
Statistics Homework Solver
No ratings yet
Scikit - Notes ML
Document12 pages
Scikit - Notes ML
Vulli Leela Venkata Phanindra
100% (1)
Subjects You Need To Know:: Programming Languages of AI
Document7 pages
Subjects You Need To Know:: Programming Languages of AI
MaxSteel
0% (1)
AP19110010110 Project Report
Document9 pages
AP19110010110 Project Report
Phani Bhushan
No ratings yet
Machine Learning Part: Domain Overview
Document20 pages
Machine Learning Part: Domain Overview
surya prakash
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
Document45 pages
There Are Key Areas in The Process of Machine Learning, Like
Yashi
No ratings yet
INTRODUCTION
Document14 pages
INTRODUCTION
Mohshin Khan
No ratings yet
Grid Search Hyper-Parameter Tuning and K-Means Clustering ToImprove The Decision Tree Accuracy
Document3 pages
Grid Search Hyper-Parameter Tuning and K-Means Clustering ToImprove The Decision Tree Accuracy
International Journal of Innovative Science and Research Technology
No ratings yet
ML Concept
Document3 pages
ML Concept
Vikash Rryder
No ratings yet
Ranking Features Based On Predictive Power - Importance of The Class Labels
Document11 pages
Ranking Features Based On Predictive Power - Importance of The Class Labels
Juan
No ratings yet
Types of ML
Document4 pages
Types of ML
chandana kiran
No ratings yet
Data Science Assignment 2
Document14 pages
Data Science Assignment 2
anigunasekara
No ratings yet
AI Lab11 Task
Document21 pages
AI Lab11 Task
Engr Aftab Amin
No ratings yet
Feature Scaling in Machine Learning
Document4 pages
Feature Scaling in Machine Learning
Varun Bhayana
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
Document10 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
sajid
No ratings yet
Introduction To Machine Learning
Document4 pages
Introduction To Machine Learning
SHUBHAM SRIVASTAVA
No ratings yet
Feature Pruning and Normalization
Document8 pages
Feature Pruning and Normalization
yevedi5237
No ratings yet
SVM Parameter Optimization Using Grid Search and G
Document8 pages
SVM Parameter Optimization Using Grid Search and G
Joseph Jose
No ratings yet
Deep Learning Workout: I. Selected Architecture
Document3 pages
Deep Learning Workout: I. Selected Architecture
ArmadaDefrente
No ratings yet
Breast Cancer Classification
Document16 pages
Breast Cancer Classification
Tester
100% (2)
Machine Learning Algorithm
Document8 pages
Machine Learning Algorithm
Shivaprakash D M
No ratings yet
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
Document11 pages
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
Vj Kumar
No ratings yet
AI Session 3 Machine Learning Slides
Document35 pages
AI Session 3 Machine Learning Slides
Philani Mangezi
No ratings yet
Types of Pruning Techniques
Document10 pages
Types of Pruning Techniques
Smriti Piyush
No ratings yet
Malignant Comments Classifier Project
Document30 pages
Malignant Comments Classifier Project
Saranya M
No ratings yet
Human Resource Management by Machine Learning Algorithms
Document6 pages
Human Resource Management by Machine Learning Algorithms
IJRASETPublications
No ratings yet
Project Presentation
Document18 pages
Project Presentation
Yasaswini
No ratings yet
ML Final Project Report
Document8 pages
ML Final Project Report
Aditya Gupta
No ratings yet
DP-Designing and Implementing
Document10 pages
DP-Designing and Implementing
Steven Doh
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
Document14 pages
Machine Learning Random Forest Algorithm - Javatpoint
RAMZI Azeddine
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Project Charter Template
Document6 pages
Project Charter Template
Rasika Dilshan
No ratings yet
Benefits of Agile Project Management in Planning and Managing An IT Project
Document10 pages
Benefits of Agile Project Management in Planning and Managing An IT Project
Rasika Dilshan
No ratings yet
Literature Review2
Document3 pages
Literature Review2
Rasika Dilshan
No ratings yet
How To Normalize Data in R
Document5 pages
How To Normalize Data in R
Rasika Dilshan
No ratings yet
Group Work Self Reflection FINAL
Document2 pages
Group Work Self Reflection FINAL
Rasika Dilshan
No ratings yet
Unsupervised Machine Learning Approach For Word Sense Disambiguation To Amharic Words
Document106 pages
Unsupervised Machine Learning Approach For Word Sense Disambiguation To Amharic Words
Amanuel Wogaso
No ratings yet
Machine Learning Notes - Lec 01 - Basics of Machine Learning
Document109 pages
Machine Learning Notes - Lec 01 - Basics of Machine Learning
Zara Jamshaid
No ratings yet
Machine Learning Notes - Lec 04 - Decision Tree Learning
Document108 pages
Machine Learning Notes - Lec 04 - Decision Tree Learning
Zara Jamshaid
No ratings yet
PerceptiLabs-ML Handbook
Document31 pages
PerceptiLabs-ML Handbook
fx0ne
No ratings yet
Classifying Hand-Written Digits Using Neural Network
Document21 pages
Classifying Hand-Written Digits Using Neural Network
Nihir Yadav
No ratings yet
2 - Ennrj
Document11 pages
2 - Ennrj
S.Vigneshwaran
No ratings yet
ABP DWDM UNIT 4 Classification 1
Document51 pages
ABP DWDM UNIT 4 Classification 1
Tatipamula Ratnakar
No ratings yet
Commonsense Transformers For Automatic Knowledge Graph Construction
Document18 pages
Commonsense Transformers For Automatic Knowledge Graph Construction
JaspreetSingh
No ratings yet
Predicting Students Academic Performance Using Artificial Neural Network: A Case Study of An Engineering Course
Document9 pages
Predicting Students Academic Performance Using Artificial Neural Network: A Case Study of An Engineering Course
hackermlf
No ratings yet
Beating The Odds: Learning To Bet On Soccer Matches Using Historical Data
Document7 pages
Beating The Odds: Learning To Bet On Soccer Matches Using Historical Data
S Rajan Rajan
No ratings yet
4 - Predict Visitor Purchases With A Classification Model in BQML
Document25 pages
4 - Predict Visitor Purchases With A Classification Model in BQML
subodh
No ratings yet
Crisp-Dm: Cross Industry Standard Process For Data Mining
Document60 pages
Crisp-Dm: Cross Industry Standard Process For Data Mining
Chau Minh Tran
No ratings yet
Awesome Machine Learning Papers
Document326 pages
Awesome Machine Learning Papers
Manjunath.R
No ratings yet
Stock Market Prediction: Hrithik D B181070PE
Document5 pages
Stock Market Prediction: Hrithik D B181070PE
hrithik d
No ratings yet
Huawei Final Written Exam 2.2 Attempts
Document19 pages
Huawei Final Written Exam 2.2 Attempts
Jonafe Piamonte
No ratings yet
Comparative Performance of Machine Learning Algorithms For Cryptocurrency Forecasting
Document12 pages
Comparative Performance of Machine Learning Algorithms For Cryptocurrency Forecasting
sunilverma2010
No ratings yet
Emotion Recognition From Facial Expression of Autism Spectrum Disordered Children Using Image Processing and Machine Learning Algorithms
Document47 pages
Emotion Recognition From Facial Expression of Autism Spectrum Disordered Children Using Image Processing and Machine Learning Algorithms
Hammad iqbal
No ratings yet
Speech-Emotion-Recognition Using SVM, Decision Tree and LDA Report
Document7 pages
Speech-Emotion-Recognition Using SVM, Decision Tree and LDA Report
Aditi Biswas
No ratings yet
Introduction To Machine Learning and Hands On Sessions
Document50 pages
Introduction To Machine Learning and Hands On Sessions
Johan Christhofer Armas Valencia
No ratings yet
The ML Test Score: A Rubric For ML Production Readiness and Technical Debt Reduction
Document10 pages
The ML Test Score: A Rubric For ML Production Readiness and Technical Debt Reduction
Dinesh Bhatia
No ratings yet
Dwdm-Unit-3 R16
Document14 pages
Dwdm-Unit-3 R16
Manaswini Bhaskaruni
No ratings yet
Toxic Comment Classification
Document4 pages
Toxic Comment Classification
Editor IJTSRD
No ratings yet
Data Science Interview Questions: Answer Here
Document54 pages
Data Science Interview Questions: Answer Here
neeraj12121
No ratings yet
Retinaface: Single-Stage Dense Face Localisation in The Wild
Document10 pages
Retinaface: Single-Stage Dense Face Localisation in The Wild
Cẩm Tú Cầu
No ratings yet
ML Quiz 2
Document8 pages
ML Quiz 2
mr India
No ratings yet
Applied Sciences: A Novel Approach To Short-Term Stock Price Movement Prediction Using Transfer Learning
Document16 pages
Applied Sciences: A Novel Approach To Short-Term Stock Price Movement Prediction Using Transfer Learning
Daniel Yesaya
No ratings yet
A Systematic Review On Overfitting Control in Shallow and Deep Neural Networks
Document48 pages
A Systematic Review On Overfitting Control in Shallow and Deep Neural Networks
Nathalia Santos
100% (1)
Machine Learning For Fluid Property Correlations: Classroom Examples With MATLAB
Document7 pages
Machine Learning For Fluid Property Correlations: Classroom Examples With MATLAB
Diana Marcela Martinez
No ratings yet
Face Emotion Recognition - Capstone Project
Document25 pages
Face Emotion Recognition - Capstone Project
rajashekar korutla
100% (2)
Project Report Full
Document60 pages
Project Report Full
Faraz Ansar
No ratings yet