Skip to content
View mengbingrock's full-sized avatar
  • 03:33 (UTC -04:00)

Highlights

  • Pro

Organizations

@trthackthonFighters

Block or report mengbingrock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,089 327 Updated Sep 27, 2024

Low-bit LLM inference on CPU with lookup table

C 457 33 Updated Sep 14, 2024

A simple, easy-to-hack GraphRAG implementation

Python 823 86 Updated Sep 26, 2024

Nightly release of ControlNet 1.1

Python 4,679 371 Updated Aug 8, 2024

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 34,246 5,702 Updated Aug 30, 2024

A plugin for Jupyter Notebook to run CUDA C/C code

Jupyter Notebook 192 87 Updated Sep 13, 2024

An ML Systems Onboarding list

514 20 Updated Jul 23, 2024

Apple G13 GPU architecture docs and tools

HTML 537 38 Updated May 6, 2024

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

Jupyter Notebook 53 3 Updated Aug 1, 2024

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 171 13 Updated Jun 18, 2024

LoRA (Low-Rank Adaptation) inspector for Stable Diffusion

Python 81 5 Updated Sep 20, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,946 817 Updated Jun 10, 2024

Metal Guide

Swift 81 9 Updated Sep 23, 2023

Generative Models by Stability AI

Python 24,212 2,697 Updated Sep 4, 2024

A simplified POC implementation of a RAG-based virtual assistant

Python 3 Updated May 15, 2024

Stable Diffusion web UI

Python 279 42 Updated Jun 26, 2024
Swift 423 33 Updated Sep 26, 2024

List of papers related to neural network quantization in recent AI conferences and journals.

423 37 Updated Sep 22, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,369 2,895 Updated Sep 29, 2024

Everything we actually know about the Apple Neural Engine (ANE)

2,019 75 Updated Sep 23, 2024

MLX: An array framework for Apple silicon

C 16,563 947 Updated Sep 28, 2024

List of Tech Company OAs. Save your time from finding them all over the internet.

1,286 81 Updated Sep 29, 2024

LLM training in simple, raw C/CUDA

Cuda 23,590 2,639 Updated Sep 27, 2024

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 5,006 546 Updated Sep 28, 2024

Apple GPU microarchitecture

Metal 459 17 Updated Sep 22, 2024
Python 750 137 Updated Nov 29, 2023

Distribute and run LLMs with a single file.

C 19,350 981 Updated Sep 28, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,833 147 Updated Sep 25, 2024

To practice your competitive programming skills, try solving daily Codeforces problems!

C 263 201 Updated Sep 29, 2024

User-friendly WebUI for AI (Formerly Ollama WebUI)

Svelte 40,691 4,783 Updated Sep 28, 2024
Next