Skip to content
View Robert-zwr's full-sized avatar

Block or report Robert-zwr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 3,086 158 Updated Sep 25, 2024

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 559 21 Updated Aug 17, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 7,333 392 Updated Sep 27, 2024

FlagGems is an operator library for large language models implemented in Triton Language.

Python 272 24 Updated Sep 27, 2024

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 105 2 Updated Sep 21, 2024

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 243 21 Updated Sep 11, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,677 877 Updated Sep 27, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,695 112 Updated Sep 19, 2024

Your image is almost there!

Python 7,238 418 Updated Jul 26, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Python 209 13 Updated Sep 22, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,229 66 Updated Sep 26, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 5,078 553 Updated Sep 20, 2024

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Python 1,039 117 Updated Aug 26, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,617 306 Updated Sep 13, 2024

A massively parallel, high-level programming language

Rust 17,247 424 Updated Sep 27, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,319 285 Updated Aug 15, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 461 21 Updated Sep 27, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,702 1,348 Updated Sep 15, 2024

CoreNet: A library for training deep neural networks

Python 6,937 540 Updated May 28, 2024

LLM training in simple, raw C/CUDA

Cuda 23,582 2,639 Updated Sep 27, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,031 301 Updated Jul 16, 2024

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,493 168 Updated Sep 8, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 609 28 Updated Aug 13, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,534 568 Updated Jul 2, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 566 50 Updated Apr 7, 2024

Grok open release

Python 49,458 8,326 Updated Aug 30, 2024

Port of EVA-02-CLIP model in C/C

C 3 Updated Apr 29, 2023

Scalable Diffusion Models with State Space Backbone

Python 147 7 Updated Mar 7, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,521 530 Updated Jul 25, 2024
Next