is a statistical inference project focuses mainly on how to run inference on data and visuialize it applying data pre-processisng such as data-cleansing , normalizing and removing outliers
with the main task of clustring the data using various techinques like k-means, k-medoids and hirearchical clustring