Skip to content
View STHSF's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report STHSF

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use PEFT or Full-parameter to finetune 300 LLMs or 80 MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 3,335 279 Updated Sep 10, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 50,060 5,269 Updated Sep 10, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 26,023 2,842 Updated Sep 9, 2024

unified embedding model

Python 814 61 Updated Sep 1, 2023

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,520 336 Updated Aug 22, 2024

pytorch memory track code

Python 992 155 Updated May 4, 2021

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,034 22 Updated Jul 31, 2024

code for piccolo embedding model from SenseTime

Python 88 4 Updated May 21, 2024

Code repository for the paper - "Matryoshka Representation Learning"

Jupyter Notebook 394 17 Updated Feb 19, 2024

The Memory layer for your AI apps

Python 21,463 1,953 Updated Sep 10, 2024

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,110 146 Updated Aug 21, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,323 95 Updated Oct 31, 2023

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,558 1,188 Updated Jul 25, 2024

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C 2,419 259 Updated Sep 10, 2024

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 742 80 Updated May 3, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 3,845 840 Updated Mar 24, 2023

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 4,620 502 Updated Aug 29, 2024

https://hrl.boyuai.com/

Jupyter Notebook 2,291 513 Updated Nov 22, 2022

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 9,021 1,812 Updated Sep 9, 2024

Parse files for optimal RAG

Python 2,419 245 Updated Sep 9, 2024

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 217 6 Updated May 28, 2024

对llama3进行全参微调、lora微调以及qlora微调。

Python 109 9 Updated Jul 28, 2024

text embedding

Python 130 6 Updated Sep 18, 2023

3D Visualization of an GPT-style LLM

TypeScript 3,761 415 Updated Aug 24, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,554 356 Updated Sep 10, 2024

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 363 18 Updated May 17, 2024

A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。

Python 25,428 5,624 Updated Sep 28, 2023

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C 8,135 897 Updated Sep 10, 2024

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,585 2,144 Updated Feb 4, 2024
Next