Skip to content
View VisionTheta's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Hong Kong University of Science and Technology
  • Hong Kong
  • 13:20 (UTC 08:00)

Highlights

  • Pro

Organizations

@RapidsAtHKUST

Block or report VisionTheta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 2,775 244 Updated Oct 24, 2024

Wireguard client that exposes itself as a socks5 proxy

Go 4,466 266 Updated Sep 3, 2024

Examples using MLX Swift

Swift 993 106 Updated Nov 1, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,919 289 Updated Oct 21, 2024

Distribute and run LLMs with a single file.

C 20,087 1,006 Updated Nov 2, 2024

Go ahead and axolotl questions

Python 7,842 863 Updated Nov 1, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,139 57 Updated Nov 2, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 14,610 2,125 Updated Nov 1, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 961 79 Updated Jul 23, 2024
Python 282 35 Updated Apr 2, 2024

Locally running, hands-free ChatGPT UI

TypeScript 1,599 257 Updated May 8, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C 7,947 410 Updated Sep 6, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 95,883 7,609 Updated Nov 1, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,573 167 Updated Nov 28, 2023

vpnc-script replacement for easy and secure split-tunnel VPN setup

Python 742 87 Updated Sep 5, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,873 342 Updated Oct 18, 2024

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 34,241 3,583 Updated Sep 23, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,884 176 Updated Oct 31, 2024

Inference Llama 2 in one file of pure C

C 17,419 2,078 Updated Aug 6, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,565 201 Updated Nov 1, 2024

Top ranked OpenAI GPTs

970 70 Updated Mar 13, 2024

how to optimize some algorithm in cuda.

Cuda 1,561 128 Updated Nov 1, 2024

Machine Learning Engineering Open Book

Python 11,552 703 Updated Nov 1, 2024

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 273 43 Updated Nov 28, 2021

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C 8,546 970 Updated Nov 1, 2024

Parallel GDB developed for debugging HPC code at Lawrence Livermore National Laboratory.

Python 32 6 Updated Nov 3, 2015

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,551 369 Updated Oct 23, 2024

Bash script for Ubuntu (and derivatives) to easily (un)install kernels from the Ubuntu Kernel PPA

Shell 861 103 Updated Jan 23, 2024

Universal LLM Deployment Engine with ML Compilation

Python 19,094 1,566 Updated Nov 2, 2024

Updated list of public BitTorrent trackers

47,035 6,584 Updated Nov 1, 2024
Next