Stars
Virtual whiteboard for sketching hand-drawn like diagrams
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A library for advanced large language model reasoning
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
O1 Replication Journey: A Strategic Progress Report – Part I
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
Follow the pytorch tutorial tutorial to learn how to use nn.parallel.DistributedDataParallel to speed up training
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
大麦网自动购票, 支持docker一键部署。https://t.me/ 2EELgNTYiMYxMTFl
猫眼纷玩岛大麦抢票、余票监控AutoX.js脚本,手机移动端,全场次选购
A framework for few-shot evaluation of language models.
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Wyfs7 / peft
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.