Popular repositories Loading
-
Megatron-DeepSpeed-TT
Megatron-DeepSpeed-TT PublicForked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python
-
-
astra-sim
astra-sim PublicForked from astra-sim/astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
C
-
chakra
chakra PublicForked from mlcommons/chakra
Repository for MLCommons Chakra schema and tools
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.