Basics of Machine Learning

Basics of machine learning, understading the decision tree, tree defth, leaf nodes
Loading csv data using Pandas, understanding data using describe and head methods of pandas
Building first Model, Chooing Feaures (know data used for prediction) and Target (the value to predict) Defining a model, Fitting a model, Predict, Evaulate (using sklearn lib. of python)
Validating the models, introdution to Mean Absolute Error (MEA- diff b/w the acutal and predicted values from the training data set), introduction to train_test_split - splitting the given data into training data and data set for prediction so that we can compare the results
Underfitting and overfitting concepts based on Shallow and Deep trees respectively. In underfitting we ignore a lot of features while in overfitting the issue is we have large splits that results in less number of records to predict [Lesser the tree depth the more we go to underfitting and more the depth we move towards overfitting. to overcome this issue we need to find a middle way!]
Introduction to Random Forest (RandomForestRegressor) to overcomes the problem of underfitting and overfitting resulting in better model selection. Provided by sklearn lib. itself

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
input		input
.DS_Store		.DS_Store
README.md		README.md
a1_load_data.py		a1_load_data.py
a2_define_model.py		a2_define_model.py
a3_validation.py		a3_validation.py
a4_split_test_data.py		a4_split_test_data.py
a5_under_over_fit.py		a5_under_over_fit.py
a6_random_forest.py		a6_random_forest.py
a7_testing_it_all.py		a7_testing_it_all.py
practice.py		practice.py

Provide feedback