ProspectiveTopUpCustomerPrediction

MonPG provides its loan services to its customers and is interested in selling more of its Top-up loan services to its existing customers. The goal was to identify the customers who are likely to purchase MonPG's top-up services in the future. I developed and persisted a SparkML pipeline model to identify potential customers that may purchase any Top-up services in the future (Part 1). Furthermore, I created a SparkML machine learning pipeline stream using Kafka and Spark Streaming using the persisted pipeline model (Part 2).

Note: Part 2 of this project was implemented with the help of dedicated cloud servers at Monash University. These servers were used to start a new Kafka session. To reuse this code in your own environment, you may need to set up your own Kafka servers.

Dataset Information:

Customer data contains variables related to basic service information. For example, frequency of the loan, tenure of the loan, disbursal amount for a loan & LTV.
The bureau data includes the behavioural and transactional attributes of the customers, such as current balance, loan amount, overdue, etc., for various tradelines of a given customer.
Please refer to this link for more details on these datasets: https://www.kaggle.com/datasets/rizdelhi/analytics-vidya-ltfs-finhack-3?select=ltfs3_train_bureau.csv
Please refer to the releases for the customer and bureau datasets.

Part 1:

I derived a new column called “Top-up” from the column called "Top-Up Month" as the label where (label 0 corresponds to No Top-up Service event, and label 1 represents all other types of Top-Up service).
The series of steps that were performed to train the Spark ML model is given in the markdown sections of the part 1 notebook.

Part 2:

The order execution for the Part 2 notebooks is Producer -> spark streaming -> Consumer.
Please refer to the markdown sections of the notebook for more information on the code.

Please use appropriate referencing as decreed under the GNU 3.0 public license to reuse the code or findings of this study in your work.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
topup_pipeline_model		topup_pipeline_model
trainedmodels1/gb2		trainedmodels1/gb2
LICENSE		LICENSE
README.md		README.md
TopUpCustomerPredictionPart1.ipynb		TopUpCustomerPredictionPart1.ipynb
TopUpCustomerPredictionPart2_Step1Producer.ipynb		TopUpCustomerPredictionPart2_Step1Producer.ipynb
TopUpCustomerPredictionPart2_Step2SparkStreaming.ipynb		TopUpCustomerPredictionPart2_Step2SparkStreaming.ipynb
TopUpCustomerPredictionPart2_Step3Consumer.ipynb		TopUpCustomerPredictionPart2_Step3Consumer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProspectiveTopUpCustomerPrediction

About

Releases 2

Packages

Languages

License

Siddharth1989/ProspectiveTopUpCustomerPrediction

Folders and files

Latest commit

History

Repository files navigation

ProspectiveTopUpCustomerPrediction

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages