Skip to content

bjam24/bank_customer_segmentation_methods

Repository files navigation

Bank Customer Segmentation Methods

Description

This project focuses on Clustering Bank Customers using various methods available in the scikit-learn library. My goal was to gain insights into the nature of clustering methods and learn how to use scikit-learn to implement these techniques in practical problems. A relevant business sector for this application is Finance and Banking, where understanding different customer segments is crucial for the survival of any institution providing financial services. This project was also created for a Kaggle challenge called Credit Card Dataset for Clustering. The link to this event can be found in the repository description.

Topics

There are many techniques used in this notebook, but only a fraction of them are presented here. Please refer to the notebook to learn about all the techniques used.

Principal Component Analysis (PCA)

Clustering methods

  • K-Means

- DBSCAN

- Agglomerative Clustering

- Affinity Propagation

- Spectral Clustering

- Gaussian Mixture Model

Installation

To run this notebook, you'll need to have Jupyter Notebook and an Anaconda environment set up on your system.

1. Clone the repository

Open your terminal or command prompt and run:

git clone https://github.com/bjam24/bank_customer_segmentation_methods.git
cd bank_customer_segmentation_methods

2. Create and activate a new Anaconda environment

conda create --name myenv python=3.8
conda activate myenv

3. Install required packages

pip install -r requirements.txt

4. Launch Jupyter Notebook

jupyter notebook

5. Navigate to the notebook and run it

Technology stack

  • Python programming language
  • Jupyter Notebook

Data source