Skip to content
View timlautk's full-sized avatar

Highlights

  • Pro

Block or report timlautk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,045 445 Updated Oct 10, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 1,973 86 Updated Oct 21, 2024

Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"

Python 127 15 Updated Dec 11, 2023

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Python 2,026 166 Updated Oct 21, 2024

Best practices & guides on how to write distributed pytorch training code

Python 190 12 Updated Oct 18, 2024

📜 LaTeX Templates for Notes, Reports, CV/Resumes, and Beamers

TeX 100 11 Updated Oct 17, 2024

✍️ A way to integrate LaTeX, VS Code, and Inkscape in macOS

Python 352 23 Updated May 17, 2024

Animation engine for explanatory math videos

Python 68,631 6,100 Updated Oct 17, 2024

prime (previously called ZeroBand) is a framework for efficient, globally distributed training of AI models over the internet.

Python 180 21 Updated Oct 21, 2024

NanoGPT (124M) quality in 2.67B tokens

Python 755 43 Updated Oct 21, 2024

What would you do with 1000 H100s...

Jupyter Notebook 892 52 Updated Jan 10, 2024
Jupyter Notebook 500 54 Updated Oct 21, 2024
Python 42 Updated Sep 27, 2024

Model components of the Llama Stack APIs

Python 3,808 504 Updated Oct 21, 2024
Python 6,439 489 Updated Oct 14, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 72 5 Updated Sep 20, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 417 32 Updated Oct 8, 2024

Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for training large language models.

Python 40 2 Updated Aug 30, 2024

深度学习经典、新论文逐段精读

26,754 2,419 Updated Aug 8, 2024

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…

Python 538 33 Updated Mar 28, 2024

Distributed Training Over-The-Internet

663 25 Updated Aug 27, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 23,449 674 Updated Oct 21, 2024

Efficient Triton Kernels for LLM Training

Python 3,252 173 Updated Oct 17, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 695 36 Updated Sep 24, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,018 41 Updated May 31, 2024

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Python 287 22 Updated Sep 26, 2024

DoubleML - Double Machine Learning in Python

Python 486 74 Updated Oct 15, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,284 151 Updated Aug 23, 2024

This is a library to use with Robinhood Financial App. It currently supports trading crypto-currencies, options, and stocks. In addition, it can be used to get real time ticker information, assess …

Python 1,694 459 Updated Oct 14, 2024
Next