Skip to content
View Jimmy-Lu's full-sized avatar
  • Shanghai Jiaotong University
  • Shanghai China
Block or Report

Block or report Jimmy-Lu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashInfer: Kernel Library for LLM Serving

Cuda 989 89 Updated Aug 17, 2024

Compiler for Dynamic Neural Networks

Python 42 2 Updated Nov 13, 2023

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,670 5,087 Updated Aug 18, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Python 328 24 Updated Aug 16, 2024

ppl.cv is a high-performance image processing library of openPPL supporting various platforms.

C 488 108 Updated Jun 12, 2024

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 73 10 Updated Jul 14, 2023

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 5,665 516 Updated Aug 17, 2024

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,291 290 Updated Jul 14, 2024

SGLang is yet another fast serving framework for large language models and vision language models.

Python 4,185 273 Updated Aug 18, 2024
C 3 1 Updated Apr 10, 2021
Jupyter Notebook 8 2 Updated Aug 25, 2023

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,534 92 Updated Aug 15, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 724 35 Updated Jun 27, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,261 2,032 Updated Aug 9, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,128 989 Updated Aug 14, 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Python 580 38 Updated Aug 14, 2024

A framework for few-shot evaluation of language models.

Python 6,159 1,631 Updated Aug 17, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,456 501 Updated Aug 14, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

982 21 Updated Jul 31, 2024

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,117 86 Updated May 28, 2023

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question ans…

Python 912 151 Updated Jul 30, 2021

A collection of benchmarks and datasets for evaluating LLM.

203 17 Updated Jul 13, 2024
Jupyter Notebook 47 5 Updated Jul 23, 2024

A collection of AWESOME things about mixture-of-experts

887 69 Updated Jul 31, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,248 146 Updated Aug 17, 2024

🎉CUDA/C 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 1,026 99 Updated Aug 12, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,896 351 Updated Aug 17, 2024

ATC23 AE

Python 41 4 Updated May 11, 2023

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,388 140 Updated Aug 6, 2024

A LaTeX resume template designed for optimal information density and aesthetic appeal.

TeX 253 34 Updated Jun 26, 2024
Next