Skip to content
#

ai-training

Here are 31 public repositories matching this topic...

Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.

  • Updated Aug 7, 2024
  • Jupyter Notebook

A tool to extract plain (unformatted) multilingual text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI training / Machine Learning software.

  • Updated Nov 11, 2023
  • Python

A machine learning project that I worked on in Summer 2019 during my internship where I used MATLAB to train AlexNet to perform facial recognition in real-time to identify people. This was my first time using MATLAB.

  • Updated Sep 27, 2021
  • MATLAB

A step-by-step walkthrough of the inner workings of a simple neural network. The goal is to demystify the calculations behind neural networks by breaking them down into understandable components, including forward propagation, backpropagation, gradient calculations, and parameter updates.

  • Updated Dec 14, 2024
  • Jupyter Notebook

Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.

  • Updated Sep 16, 2024
  • Python

Improve this page

Add a description, image, and links to the ai-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-training topic, visit your repo's landing page and select "manage topics."

Learn more