Skip to content
#

data-splitting

Here are 22 public repositories matching this topic...

Data-Splitter is a Python script designed to split a large CSV file containing data into three different formats: JSON, a database table, and another CSV file. The script ensures a random distribution of data across the three output formats based on custom-defined ratios.

  • Updated Jul 24, 2023
  • Jupyter Notebook

In this project, I have used logistic regression, a supervised machine learning algorithm, to predict whether a person has diabetes or not based on various features such as age, blood pressure, glucose level, body mass index, etc. I have used Python and popular libraries such as Pandas, Scikit-Learn, and Matplotlib to perfom model building

  • Updated Jan 26, 2024
  • Jupyter Notebook

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

  • Updated Jul 27, 2024
  • Jupyter Notebook

Analyzed customer churn using transaction data. Built ML model to predict lapses. Dataset includes customer status, collection/redemption info, and program tenure. Delivered business presentation outlining modeling approach, findings, and churn reduction strategies.

  • Updated Apr 18, 2024

Improve this page

Add a description, image, and links to the data-splitting topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-splitting topic, visit your repo's landing page and select "manage topics."

Learn more