Popular repositories Loading
-
-
Megatron-DeepSpeed
Megatron-DeepSpeed PublicForked from bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python
-
Llama-X
Llama-X PublicForked from AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Python
-
openspg
openspg PublicForked from OpenSPG/openspg
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…
Java
-
-
swift
swift PublicForked from modelscope/ms-swift
ms-swift: Use PEFT or Full-parameter to finetune 300 LLMs or 50 MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.