Skip to content
View zhipeng-jia's full-sized avatar

Organizations

@ut-osa

Block or report zhipeng-jia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

52,811 13,524 Updated Jul 30, 2024

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

TypeScript 23,910 1,734 Updated Nov 16, 2024

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 356 40 Updated Aug 19, 2024

A large-scale simulation framework for LLM inference

Python 276 42 Updated Oct 10, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,442 135 Updated Nov 15, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C 8,668 990 Updated Nov 13, 2024

ThetaGang is an IBKR bot for collecting money

Python 2,008 266 Updated Nov 11, 2024

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

JavaScript 8,122 359 Updated Nov 17, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,839 999 Updated Nov 14, 2024

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 12,771 1,394 Updated Nov 17, 2024

Official inference library for Mistral models

Jupyter Notebook 9,723 863 Updated Nov 12, 2024

A simple, performant and scalable Jax LLM!

Python 1,529 294 Updated Nov 17, 2024

NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.

C 113 13 Updated Nov 15, 2023

The Mojo Programming Language

Mojo 23,304 2,592 Updated Nov 17, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,140 2,526 Updated Nov 17, 2024

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 734 93 Updated Nov 15, 2024

Boki: Stateful Serverless Computing with Shared Logs [SOSP '21]

C 80 11 Updated May 13, 2022

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,870 2,979 Updated Nov 17, 2024
Ruby 2,244 80 Updated Jun 15, 2024

Ongoing research training transformer models at scale

Python 10,573 2,364 Updated Nov 17, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 71,360 8,476 Updated Nov 13, 2024

Numbers every LLM developer should know

4,102 140 Updated Jan 16, 2024

LLM training code for Databricks foundation models

Python 4,049 528 Updated Nov 15, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,139 211 Updated Oct 8, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,621 127 Updated Sep 19, 2023

Code samples related to Intel(R) AMX

C 29 12 Updated Apr 8, 2024

Transformer related optimization, including BERT, GPT

C 5,886 893 Updated Mar 27, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 112,770 15,382 Updated Nov 11, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,192 548 Updated Oct 28, 2024

The RethinkDNS resolver that deploys to Cloudflare Workers, Deno Deploy, Fastly, and Fly.io

JavaScript 1,950 1,739 Updated Nov 12, 2024
Next