Skip to content
View yqy2001's full-sized avatar

Organizations

@baaivision

Block or report yqy2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Next-Token Prediction is All You Need

Python 372 4 Updated Sep 28, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,873 213 Updated Sep 27, 2024

DSIR large-scale data selection framework for language model training

Python 223 19 Updated Apr 7, 2024

Library for fast text representation and classification.

HTML 25,865 4,712 Updated Mar 22, 2024

DataComp for Language Models

HTML 1,119 99 Updated Sep 5, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 388 30 Updated Sep 17, 2024

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 285 19 Updated Sep 6, 2024

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 321 28 Updated Sep 6, 2024

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,162 89 Updated May 28, 2023
Python 1,571 137 Updated Sep 27, 2024

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 147 8 Updated Mar 29, 2024

Implementation for ICLR 2024 paper “Multimodal Molecular Pretraining via Modality Blending"

Python 2 Updated Sep 1, 2024

Code for Discovering Preference Optimization Algorithms with and for Large Language Models

Python 48 28 Updated Jun 13, 2024

Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"

Python 403 34 Updated Jun 18, 2024

official code for "Large Language Models as Optimizers"

Python 391 38 Updated Aug 16, 2024

Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"

Python 73 9 Updated Sep 18, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,452 847 Updated Sep 23, 2024

System 2 Reasoning Link Collection

628 52 Updated Sep 14, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 167,014 44,179 Updated Sep 28, 2024

Let's Build A Simple Interpreter

Python 1,803 417 Updated Aug 4, 2021

Implementation for ICLR2024 Oral paper "Unified Generative Modeling of 3D Molecules with Bayesian Flow Networks"

Jupyter Notebook 32 2 Updated Jun 4, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,321 224 Updated Nov 26, 2023

Automated Design of Agentic Systems

Python 883 127 Updated Aug 20, 2024

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 685 78 Updated Sep 27, 2024

Ongoing research training transformer models at scale

Python 10,119 2,278 Updated Sep 28, 2024

Scalable toolkit for efficient model alignment

Python 519 58 Updated Sep 28, 2024

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Go 23,191 2,457 Updated Sep 27, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,672 877 Updated Sep 27, 2024
Next