Skip to content
View lizhengbuaa's full-sized avatar

Block or report lizhengbuaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,588 1,087 Updated May 23, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 3,249 381 Updated Aug 19, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 9,706 952 Updated Oct 13, 2024

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt

Jupyter Notebook 6,098 512 Updated Oct 20, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,207 523 Updated Oct 21, 2024

小红书关键词笔记搜索Python 爬虫 (csv保存)

Python 55 4 Updated Apr 13, 2023

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 17,098 5,405 Updated Oct 21, 2024

Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML 2021) and Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs (JMLR 2022)

Python 201 18 Updated Oct 6, 2023

a simple example to learn tensorrt with dynamic shapes

Python 25 8 Updated Sep 13, 2021

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,132 45 Updated Oct 19, 2024

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,200 167 Updated Dec 26, 2023

本『ChatGPT资源库(原理/微调/代码/论文)』的初始版本来自July CSDN博客上阅读量高达50万的ChatGPT系列,联合发起人:七月ChatGPT原理课学员,6月初正式对外发布

158 29 Updated May 27, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,079 292 Updated Jun 22, 2024

Multi-Task Deep Neural Networks for Natural Language Understanding

Python 2,229 411 Updated Mar 7, 2024

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,882 497 Updated Feb 14, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,936 2,543 Updated Oct 10, 2024

👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)

Python 23 27 Updated Aug 22, 2019

Unsupervised text tokenizer for Neural Network-based text generation.

C 10,197 1,170 Updated Oct 1, 2024

MGeo: Multi-Modal Geographic Language Model Pre-Training

Python 65 17 Updated Sep 14, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,824 5,823 Updated Aug 19, 2024

BERT Tokenizer in C

C 73 19 Updated Jan 14, 2021

SimCSE在中文上的复现,有监督 无监督

Python 263 48 Updated Dec 14, 2021

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

640 48 Updated Jan 7, 2024

全中文汉化latex简历。支持overleaf个性化编辑并生成pdf。适用于互联网求职产品、运营、算法、开发岗

TeX 79 11 Updated Aug 11, 2022

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,269 1,294 Updated May 21, 2023

SimCSE的tensorflow版本实现,以及基础实验对比

Python 12 3 Updated Jul 22, 2021

基于“音形码”的中文字符串相似度计算方法

Python 222 65 Updated Jul 24, 2020

汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型

Python 127 20 Updated May 25, 2020
Python 273 105 Updated Jan 6, 2021

Facilitating the design, comparison and sharing of deep text matching models.

Python 3,837 897 Updated Aug 2, 2024
Next