-
@microsoft Research, Montréal
- Montréal
- https://xingdi-eric-yuan.github.io/
Block or Report
Block or report xingdi-eric-yuan
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output…
A curated list of resources about generative flow networks (GFlowNets).
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Used for adaptive human in the loop evaluation of language and embedding models.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
Train transformer language models with reinforcement learning.
🦜🔗 Build context-aware reasoning applications
Compositional Differentiable Programming Library