-
Simon Fraser University
- Boston
- https://duo-lu.github.io/
Stars
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AIFM: High-Performance, Application-Integrated Far Memory
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challen…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A list of learning materials to understand databases internals
Ensō is a high-performance streaming interface for NIC-application communication.
A high-throughput and memory-efficient inference and serving engine for LLMs
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
A framework to enable multimodal models to operate a computer.
A modular graph-based Retrieval-Augmented Generation (RAG) system
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
[EMNLP 2024] SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation
Supercharge Your LLM Application Evaluations 🚀
MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrates high-dimensional vector indices into PostgreSQL, a relati…
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
🎉 Modern CUDA Learn Notes with PyTorch: CUDA Cores, Tensor Cores, fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, hgemm, sgemv, warp/block reduce, elementwise, softmax, layernorm, rmsnorm.