Skip to content
View Duo-Lu's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@sfu-dis

Block or report Duo-Lu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,944 5,769 Updated Nov 12, 2024

AIFM: High-Performance, Application-Integrated Far Memory

C 108 35 Updated Feb 28, 2023

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challen…

Python 13,682 1,387 Updated Nov 11, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 13,320 1,212 Updated Oct 30, 2024
C 63 1 Updated Mar 9, 2023

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

124 3 Updated Nov 11, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 15,904 1,559 Updated Oct 15, 2024

A high-performance, concurrent hash table

C 1,608 275 Updated Apr 6, 2024

A list of learning materials to understand databases internals

9,441 1,095 Updated Aug 29, 2024

Ensō is a high-performance streaming interface for NIC-application communication.

SystemVerilog 69 7 Updated Sep 21, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 30,023 4,539 Updated Nov 12, 2024
C 4,392 479 Updated Nov 11, 2024
C 1 Updated Aug 2, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 10,479 1,050 Updated Nov 3, 2024

A framework to enable multimodal models to operate a computer.

Python 8,836 1,183 Updated Aug 2, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,967 1,858 Updated Nov 12, 2024

Paper-reading notes for Berkeley OS prelim exam.

7 Updated Aug 28, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,791 408 Updated Jul 28, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,417 816 Updated Aug 20, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,112 23 Updated Jul 31, 2024

[EMNLP 2024] SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation

Python 6 1 Updated Aug 24, 2024

Supercharge Your LLM Application Evaluations 🚀

Python 7,182 732 Updated Nov 11, 2024

LLM inference in C/C

C 67,691 9,714 Updated Nov 12, 2024

LLM training in simple, raw C/CUDA

Cuda 24,377 2,753 Updated Oct 2, 2024

Performance monitoring and benchmarking suite

C 1,672 229 Updated Nov 11, 2024

🙌 OpenHands: Code Less, Make More

Python 36,003 4,103 Updated Nov 12, 2024

MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrates high-dimensional vector indices into PostgreSQL, a relati…

C 84 6 Updated Jun 12, 2024

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,124 128 Updated Nov 30, 2023

🎉 Modern CUDA Learn Notes with PyTorch: CUDA Cores, Tensor Cores, fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, hgemm, sgemv, warp/block reduce, elementwise, softmax, layernorm, rmsnorm.

Cuda 1,416 156 Updated Nov 12, 2024
Next