Skip to content
View yqxerneas's full-sized avatar

Block or report yqxerneas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C 7,898 406 Updated Sep 6, 2024

hip gemm optimization

C 5 1 Updated Feb 11, 2023

HIP: C Heterogeneous-Compute Interface for Portability

C 3,706 528 Updated Sep 27, 2024

Examples for HIP

C 201 89 Updated Sep 27, 2024

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C 4,256 705 Updated Jul 29, 2024

Repository for the QUIK project, enabling the use of 4bit kernels for generative inference

C 169 12 Updated Apr 16, 2024

Awesome LLM compression research papers and tools.

1,080 65 Updated Sep 28, 2024