4 Methods that Power Feature Selection in a Machine Learning Model

Feature Selection in Machine Learning is selecting the most impactful features, or columns, in a dataset. Does your dataset have many columns, and do you want to see which have the biggest impact? Do you want to discard those that aren't generating much value? By performing feature selection, you're not only reducing the amount of data that needs to be processed to speed up your analysis, but you're simplifying the interpretation of the model, making it easier to understand.

Depending on the types of data you have, one can use several techniques, ranging from Statistical methods to leveraging a machine learning model to make the selection. We'll look at a few of the most common techniques and see how they are applied in practice!

Categorical data using the Chi-Squared Test
Pearson's Correlation Coefficient for Numeric Data
Principal Component Analysis for Numeric Data
Feature Importance with Random Forests for Both Categorical and Numeric Data

Read more here: https://www.dataknowsall.com/featureselection.html

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
FeatureSelection.code-workspace		FeatureSelection.code-workspace
FeatureSelection.ipynb		FeatureSelection.ipynb
README.md		README.md
SHAP.ipynb		SHAP.ipynb
bank.csv		bank.csv
featureselection_01.png		featureselection_01.png
featureselection_02.png		featureselection_02.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

4 Methods that Power Feature Selection in a Machine Learning Model

About

Releases

Packages

Languages

broepke/FeatureSelection

Folders and files

Latest commit

History

Repository files navigation

4 Methods that Power Feature Selection in a Machine Learning Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages