Categorical data is a type of data that is used to group information with similar characteristics, while numerical data is a type of data that expresses information in the form of numbers.
Example of categorical data: gender
Most machine learning algorithms cannot handle categorical variables unless we convert them to numerical values Many algorithm’s performances even vary based upon how the categorical variables are encoded
Nominal: no particular order Ordinal: there is some order between values We will also refer to a cheat sheet that shows when to use which type of encoding.
Below are some screenshot to give you a glimps of the project. For more download the python file and install jupyter and run your file.