Skip to content
View hhstore's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing
Block or Report

Block or report hhstore

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

C 22,688 1,726 Updated Aug 19, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,361 1,598 Updated Aug 17, 2024

Model components of the Llama Stack APIs

Python 218 24 Updated Aug 19, 2024

Set of tools to assess and improve LLM security.

Python 2,426 396 Updated Aug 7, 2024

Utilities intended for use with Llama models.

Python 3,453 569 Updated Aug 19, 2024

Inference code for CodeLlama models

Python 15,781 1,844 Updated Aug 12, 2024

The official Meta Llama 3 GitHub site

Python 25,602 2,841 Updated Aug 12, 2024

Inference code for Llama models

Python 55,133 9,410 Updated Aug 18, 2024

Agentic components of the Llama Stack APIs

Python 3,007 283 Updated Aug 19, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 25,743 2,823 Updated Aug 19, 2024

MLX: An array framework for Apple silicon

C 16,206 924 Updated Aug 19, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 5,685 278 Updated Aug 19, 2024

port of Andrjey Karpathy's llm.c to Mojo

Mojo 267 11 Updated Jun 16, 2024

LLM training in simple, raw C/CUDA

Cuda 22,702 2,538 Updated Aug 16, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 15,508 1,417 Updated Aug 19, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,228 578 Updated Aug 19, 2024

Development repository for the Triton language and compiler

C 12,275 1,479 Updated Aug 19, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,446 953 Updated Aug 17, 2024

UniRec is an easy-to-use, lightweight, and scalable implementation of recommender systems. Its primary objective is to enable users to swiftly construct a comprehensive ecosystem of recommenders us…

Python 40 5 Updated Jul 6, 2024

Bridging LLM and Recommender System.

Jupyter Notebook 489 44 Updated Aug 4, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,249 318 Updated Aug 17, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,263 5,171 Updated Jun 27, 2024

中文LLaMA&Alpaca大语言模型 本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,078 1,849 Updated Apr 30, 2024

LLM inference in C/C

C 63,735 9,131 Updated Aug 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,024 3,613 Updated Aug 19, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

14,263 1,304 Updated Jul 21, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 55,645 6,799 Updated Aug 17, 2024

🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!

Dart 28,259 1,165 Updated Aug 19, 2024

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

TypeScript 21,531 1,229 Updated Aug 19, 2024

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,029 500 Updated Aug 9, 2024
Next