Skip to content
View Zefan-Cai's full-sized avatar

Highlights

  • Pro

Block or report Zefan-Cai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Lua 12,312 1,935 Updated Sep 12, 2023

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 63 5 Updated Sep 11, 2024

The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

108 Updated Sep 12, 2024

Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 43 2 Updated Aug 26, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 1,801 100 Updated Sep 13, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Python 468 38 Updated Sep 13, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 659 33 Updated Aug 19, 2024
Jupyter Notebook 8 1 Updated Nov 29, 2023

aider is AI pair programming in your terminal

Python 17,960 1,678 Updated Sep 13, 2024

structured outputs for llms

Python 7,495 595 Updated Sep 13, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

530 34 Updated Sep 11, 2024

📚 A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

199 5 Updated Mar 27, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,647 849 Updated Aug 21, 2024

Utilities intended for use with Llama models.

Python 3,775 668 Updated Sep 12, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,390 196 Updated Sep 4, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc,…

Python 6,473 1,237 Updated Sep 13, 2024

Dense Connector for MLLMs

Python 98 3 Updated Aug 19, 2024

[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

Jupyter Notebook 98 3 Updated Mar 14, 2024

[MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness

Python 2 Updated Jul 24, 2024

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

24 1 Updated Jul 24, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,536 158 Updated Sep 13, 2024

The Memory layer for your AI apps

Python 21,602 1,965 Updated Sep 12, 2024

Official github repo for the paper "Compression Represents Intelligence Linearly"

Python 121 6 Updated Jun 9, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,173 61 Updated Sep 12, 2024

Efficiently Fine-Tune 100 LLMs in WebUI (ACL 2024)

Python 30,694 3,782 Updated Sep 13, 2024

LLM101n: Let's build a Storyteller

28,201 1,537 Updated Aug 1, 2024

📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623

Python 51 3 Updated Aug 25, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 639 36 Updated Aug 5, 2024

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 243 18 Updated Sep 9, 2024

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

59 2 Updated Jul 12, 2024
Next