Skip to content

The objective is to predict insurance policy renewals for existing customers using predictive modeling. To segment existing customers for better targeting through focused marketing strategies & Minimize the fraudulent claims..

License

Notifications You must be signed in to change notification settings

picoders1/HEALTH-INSURANCE-RENEWAL-PREDICTION-and-CUSTOMER-SEGMENTATION-ANALYSIS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HEALTH-INSURANCE-RENEWAL-PREDICTION-and-CUSTOMER-SEGMENTATION-ANALYSIS

1. Medical Insurance

To be financially stable is everyone’s dream. However, the rise in medical costs and treatment for illness can strain your savings. Health insurance helps lessen the costs of medical expenses in the event of an illness or accident and for preventive medicine such as routine medical tests, check-ups, and screening tests. Cashless treatment, Pre and post-hospitalization cost coverage, Transportation facility, No Claim Bonus (NCB), Medical checkup, Room rent, and Tax benefits are the main benefits. Health insurance is typically offered as one to three years long contracts and required a renewal based on the chosen plan.

Objectives:

  1. The objective is to predict insurance policy renewals for existing customers using predictive modeling.
  2. To segment existing customers for better targeting through focused marketing strategies.

2. Technical Stacks

These are the Software, Tools, and Environments used in the project.

HTML, CSS, JS: Cascading Style Sheets (CSS) are used for presenting documents written in a markup language such as HTML. CSS is a cornerstone technology of the World Wide Web, alongside HTML and JavaScript.

Flask: Flask helps end users interact with your Python code (in this case our ML models) directly from their web browser without needing any libraries, or code files.

Tableau: A tool we used for visualization.

MySQL: A tool used for storing databases.

Heroku: A tool used for Deploying the model.

3. Project Architecture / Data Pipeline

image

4. Data Understanding

The first step in data understanding is Data Collection. The feature set used in this project was taken from the MySQL database, containing 42 features related to the health insurance renewal policy. The output variable for this project is “Renewal: Yes or No”, which is a discrete data type. So this project will focus on the classification machine learning algorithm.

5. Exploratory Data Analysis(EDA)

  1. Data Cleansing is a primary process that needs to be worked on after data collection.
  2. We have performed outlier treatment on features that had outliers, as outliers were affecting the mean values.
  3. Dummy variables were also created for categorical variables using Label Encoding.
  4. We have also performed Standardization of data.
  5. Business Moments Decisions and graphical interpretation of data are performed before and after data cleansing to analyze the statistics of the data
  6. Visualization of univariate and bivariate plots was done in Python and Tableau.

6. Model Building

Classification Algorithms:

  1. Shallow Model(KNN, Naive Bayes, Decision Tree)
  2. Ensemble Model(Random Forest)
  3. Regression Model(Logistic Regression)

Algorithms Used

  1. Support Vector Machine(SVM)
  2. Artificial Neural Network(ANN)

Model Segmentation

  1. Hierarchical Clustering
  2. Density-Based Clustering of Application with Noise
  3. K-Means Clustering

7. Model Evaluation

Model Hyper-parameters used:

  1. Cross Validation
  2. GridSearchCV
  3. RandomSearchCV

Model Accuracy Measures :

  1. Confusion matrix
  2. Accuracy
  3. F1 score
  4. ROC (Receiver Operating Characteristics) curve & AUC (Area Under Curve)

8. Deployment Strategy

  1. Flask :

  2. Flask is a micro-framework for building web applications in Python. It began as a simple wrapper around Wekzeug(WSGI protocol) and Jinja and has become
    the most popular Python web application Framework.

  3. Flask and Green-unicorn module must be installed in the project environment using “pip install flask gunicorn”.

  4. Gunicorn is a Python WSGI HTTPS Server that uses a worker model.

  5. Heroku : 2.1 Heroku is a cloud platform as a service (PaaS) supporting several programming languages.

9. Conclusion

  1. This project is an exploratory attempt to understand the factors, which affect renewal decisions in the health insurance market.
  2. The prediction model helped us to figure out features that contributed more to the renewal of health insurance policies.
  3. Customer segmentation provided further insights into the business which would provide a targeted marketing approach.
  4. The results also suggest customer satisfaction is a significant factor in influencing the renewal decision of policyholders.

Thank You!!!

About

The objective is to predict insurance policy renewals for existing customers using predictive modeling. To segment existing customers for better targeting through focused marketing strategies & Minimize the fraudulent claims..

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published