Skip to content

This repository explores the application of supervised learning techniques to two key domains: banking credit data and relational datasets (Cora, CiteSeer, PubMed). It aims to tackle real-world challenges through a comparative analysis of methods such as Naive Bayes, KNN, SVM, and more, all implemented in Python.

Notifications You must be signed in to change notification settings

AbirOumghar/SupervisedLearning-BankingAndRelationalAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

Supervised Learning Analysis: Banking and Relational Data

Project Overview

This project utilizes Python to apply supervised learning techniques to two distinct real-world data domains: banking credit assessments and relational datasets (Cora, CiteSeer, PubMed). The objective is to navigate through the inherent challenges of these datasets, deploying a variety of classification methods to gain insights and predictive accuracy.

Methodology

I employed several classical and contemporary supervised learning algorithms, including but not limited to Naive Bayes, K-Nearest Neighbors (KNN), Support Vector Machines (SVM), and Decision Trees. Special emphasis was placed on preprocessing techniques suitable for each dataset's nature, ensuring robust model training and evaluation.

Key Highlights:

  • Data Exploration: Initial analysis to understand the datasets' characteristics, distribution, and potential challenges.
  • Feature Engineering: Crafting and selecting meaningful features to enhance model performance.
  • Model Selection and Training: Comparative analysis of various algorithms to identify the most effective models for these specific datasets.
  • Performance Evaluation: Utilizing accuracy, precision, recall, and F1-score metrics to assess model efficacy.

Results

The analysis revealed insightful patterns and predictive accuracies that underscore the complexities and potential of supervised learning in real-world applications.

About

This repository explores the application of supervised learning techniques to two key domains: banking credit data and relational datasets (Cora, CiteSeer, PubMed). It aims to tackle real-world challenges through a comparative analysis of methods such as Naive Bayes, KNN, SVM, and more, all implemented in Python.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published