Skip to content
View Robert-zwr's full-sized avatar
Block or Report

Block or report Robert-zwr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 534 19 Updated Aug 17, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 5,648 276 Updated Aug 18, 2024

FlagGems is an operator library for large language models implemented in Triton Language.

Python 203 12 Updated Aug 19, 2024

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 94 2 Updated Jul 28, 2024

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 206 21 Updated Aug 17, 2024

Mathematical Visual Instruction Tuning for Multi-modal Large Language Models

82 Updated Aug 5, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 8,521 635 Updated Aug 16, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,658 104 Updated Aug 3, 2024

Your image is almost there!

Python 7,098 412 Updated Jul 26, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Python 196 9 Updated Aug 6, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,068 53 Updated Aug 19, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 4,873 539 Updated Aug 10, 2024

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Python 990 111 Updated Aug 19, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,426 288 Updated Aug 7, 2024

A massively parallel, high-level programming language

Rust 17,061 420 Updated Aug 16, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,191 271 Updated Aug 15, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 437 19 Updated Aug 16, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,107 1,279 Updated Aug 17, 2024

CoreNet: A library for training deep neural networks

Python 6,881 531 Updated May 28, 2024

LLM training in simple, raw C/CUDA

Cuda 22,697 2,537 Updated Aug 16, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,931 297 Updated Jul 16, 2024

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,400 153 Updated Aug 10, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 552 26 Updated Aug 13, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,426 554 Updated Jul 2, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 530 43 Updated Apr 7, 2024

Grok open release

Python 49,336 8,322 Updated Aug 7, 2024

Port of EVA-02-CLIP model in C/C

C 3 Updated Apr 29, 2023

Scalable Diffusion Models with State Space Backbone

Python 145 7 Updated Mar 7, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,494 525 Updated Jul 25, 2024
Next