Skip to content
View somepago's full-sized avatar

Block or report somepago

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Using FlexAttention to compute attention with different masking patterns

Python 37 Updated Sep 22, 2024

Efficient Triton Kernels for LLM Training

Python 3,132 161 Updated Oct 5, 2024

Megatron's multi-modal data loader

Python 90 6 Updated Oct 2, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,055 221 Updated Oct 5, 2024

NumPy tutorials & educational content in notebook format

Python 479 184 Updated Oct 2, 2024

『ゼロから作る Deep Learning ❸』(O'Reilly Japan, 2020)

Python 735 291 Updated May 27, 2024

LLM related research papers curated by LLMs themselves

Python 13 9 Updated Oct 4, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 11,652 1,457 Updated Aug 18, 2024

LLM101n: Let's build a Storyteller

29,167 1,599 Updated Aug 1, 2024

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

Python 56 6 Updated Aug 21, 2024

Official implementation of AnimateDiff.

Python 10,366 849 Updated Jul 31, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,038 86 Updated Aug 6, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,211 92 Updated Aug 22, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 460 27 Updated Aug 15, 2024

Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥

442 51 Updated Oct 5, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,191 436 Updated Sep 9, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,667 95 Updated Sep 6, 2024

Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (2024)

Python 172 10 Updated May 28, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,292 1,009 Updated Oct 6, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,225 193 Updated Sep 21, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Python 211 13 Updated Sep 22, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 367 35 Updated Feb 29, 2024

[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Python 23 Updated Jul 12, 2024

Simple AI agents / assistants

Python 21 2 Updated Sep 23, 2024

A list of AI autonomous agents

9,975 728 Updated Sep 28, 2024

When do we not need larger vision models?

Python 321 9 Updated Aug 19, 2024

Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024

Python 40 Updated Oct 2, 2024

Website for hosting the Open Foundation Models Cheat Sheet.

JavaScript 255 18 Updated Jun 26, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,769 2,109 Updated Aug 9, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,776 271 Updated Aug 1, 2024
Next