mustious

🎯

Focusing

Mustapha Abdullahi mustious

🎯

Focusing

Random Obsessions 🤖

47 followers · 79 following

Aalto University
Helsinki, Finland
https://mustious.github.io/
@mustious7

Achievements

Stars

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,781 942 Updated Oct 7, 2024

eric-prog / GPU-Grants

GPUGrants - a list of GPU grants that I can think of

5 Updated Aug 2, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 37,839 3,981 Updated Jul 28, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,186 2,290 Updated Oct 5, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C 12,950 1,574 Updated Oct 7, 2024

google-deepmind / rlax

Python 1,242 85 Updated Sep 24, 2024

joo-sama / mama4dgyalsdem

JavaScript 1 Updated Mar 10, 2024

joo-sama / eco-tech

JavaScript 1 Updated Sep 25, 2024

joo-sama / My-Saddle-Portfolio

JavaScript 1 Updated Jun 9, 2024

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 9,425 850 Updated Sep 1, 2024

databricks / megablocks

Python 1,179 170 Updated Sep 19, 2024

microsoft / GRIN-MoE

GRadient-INformed MoE

250 15 Updated Sep 25, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,575 5,758 Updated Aug 19, 2024

HazyResearch / aisys-building-blocks

Building blocks for foundation models.

369 13 Updated Jan 3, 2024

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 405 30 Updated Sep 17, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,140 166 Updated Oct 5, 2024

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 861 46 Updated Jun 25, 2024

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 1,486 279 Updated Oct 7, 2024

koayon / awesome-adaptive-computation

A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).

124 7 Updated Aug 9, 2024

google-deepmind / optax

Optax is a gradient processing and optimization library for JAX.

Python 1,651 182 Updated Oct 7, 2024

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 3,167 279 Updated Aug 30, 2024

vopani / jaxton

100 exercises to learn JAX

Jupyter Notebook 564 45 Updated Jun 11, 2022

arogozhnikov / einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,409 350 Updated Sep 21, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,656 1,252 Updated Oct 6, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,726 1,057 Updated Sep 10, 2024

joerick / pyinstrument

🚴 Call stack profiler for Python. Shows you why your code is slow!

Python 6,487 230 Updated Oct 7, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,272 6,385 Updated Oct 3, 2024

for-ai / parameter-efficient-moe

Python 244 16 Updated Oct 31, 2023

meta-llama / llama-stack-apps

Agentic components of the Llama Stack APIs

Python 3,691 549 Updated Oct 4, 2024

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 409 71 Updated Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mustapha Abdullahi mustious

Achievements

Achievements

Block or report mustious

Stars

huggingface / accelerate

eric-prog / GPU-Grants

mlabonne / llm-course

NVIDIA / Megatron-LM

triton-lang / triton

google-deepmind / rlax

joo-sama / mama4dgyalsdem

joo-sama / eco-tech

joo-sama / My-Saddle-Portfolio

srush / GPU-Puzzles

databricks / megablocks

microsoft / GRIN-MoE

karpathy / nanoGPT

HazyResearch / aisys-building-blocks

allenai / OLMoE

linkedin / Liger-Kernel

pjlab-sys4nlp / llama-moe

AI-Hypercomputer / maxtext

koayon / awesome-adaptive-computation

google-deepmind / optax

facebookresearch / fairscale

vopani / jaxton

arogozhnikov / einops

Dao-AILab / flash-attention

SakanaAI / AI-Scientist

joerick / pyinstrument

facebookresearch / fairseq

for-ai / parameter-efficient-moe

meta-llama / llama-stack-apps

FLAIROx / JaxMARL