Fage2016

zifa li Fage2016

2 followers · 12 following

xiamen,china

Starred repositories

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,153 853 Updated Jul 1, 2024

paschmann / rasa-ui

Rasa UI is a frontend for the Rasa Framework

JavaScript 959 330 Updated Dec 30, 2022

Linear95 / bert-intent-slot-detector

BERT-based intent and slots detector for chatbots.

Python 131 18 Updated May 10, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,797 4,538 Updated Oct 24, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,384 2,911 Updated Sep 2, 2024

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C 70,256 7,675 Updated Oct 25, 2024

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 68,672 14,483 Updated May 10, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,053 2,492 Updated Aug 15, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,972 5,852 Updated Aug 19, 2024

dmlc / xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C 26,236 8,722 Updated Oct 28, 2024

tqchen / xgboost

Forked from dmlc/xgboost

https://github.com/dmlc/xgboost

C 571 260 Updated Jul 4, 2018

GitHubDaily / GitHubDaily

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

32,322 3,553 Updated May 29, 2024

azl397985856 / leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)

JavaScript 54,643 9,464 Updated Oct 13, 2024

fchollet / deep-learning-with-python-notebooks

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

Jupyter Notebook 18,676 8,650 Updated Jul 9, 2024

CyberZHG / keras-bert

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

Python 2,428 513 Updated Jan 22, 2022

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C , C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing

Java 97,740 12,385 Updated Oct 24, 2024

shibing624 / nlp-tutorial

自然语言处理（NLP）教程，包括：词向量，词法分析，预训练语言模型，文本分类，文本语义匹配，信息抽取，翻译，对话。

Jupyter Notebook 392 62 Updated May 7, 2022

shibing624 / textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLOOM，GPT2，Seq2Seq，BART，T5，UDA等模型的训练和预测，开箱即用。

Python 930 108 Updated Sep 14, 2024

shibing624 / pytextclassifier

pytextclassifier is a toolkit for text classification. 文本分类，LR，Xgboost，TextCNN，FastText，TextRNN，BERT等分类模型实现，开箱即用。

Python 491 74 Updated Sep 25, 2024

shibing624 / text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Python 4,457 395 Updated Oct 27, 2024

shibing624 / pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。

Python 5,560 1,093 Updated Oct 28, 2024

shenweichen / DeepCTR

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Python 7,557 2,210 Updated Aug 9, 2024

wangzhegeek / DSSM-Lookalike

Python 192 56 Updated Mar 7, 2020

InsaneLife / dssm

DSSM and Multi-View DSSM

Python 660 230 Updated Dec 15, 2020

shenweichen / DeepMatch

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.

Python 2,224 530 Updated May 14, 2024