Skip to content
View vinhowe's full-sized avatar

Highlights

  • Pro

Organizations

@EpicGames @byudevelopers @BYU-PCCL

Block or report vinhowe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 13,474 1,349 Updated Oct 11, 2024

NanoGPT (124M) quality in 2.67B tokens

Python 437 26 Updated Oct 14, 2024
Python 4 Updated May 24, 2024

🧩 The Browser Extension Framework

TypeScript 10,378 362 Updated Oct 8, 2024
Jupyter Notebook 38 7 Updated Jun 17, 2024

Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (2024)

Python 172 10 Updated May 28, 2024

An attempt to migrate Karpathy's llm.c to safe rust.

Rust 12 Updated Jun 4, 2024
WGSL 57 4 Updated Mar 15, 2024

Implementation code for the paper "Parallel Structures in Pre-training Data Yield In-Context Learning"

Python 5 2 Updated Jun 29, 2024

A cross-platform browser ML framework.

Rust 601 31 Updated Oct 4, 2024

Pure Typescript, dependency free, ridiculously slow implementation of GPT2 for educational purposes

TypeScript 42 1 Updated Apr 25, 2023

Tensor computation with WebGPU acceleration

TypeScript 581 17 Updated Jul 25, 2024

High-performance In-browser LLM Inference Engine

TypeScript 13,289 850 Updated Oct 7, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,883 998 Updated Oct 14, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,511 451 Updated Oct 15, 2024

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Python 560 48 Updated Nov 10, 2023

Repo for the EMNLP 2023 Findings paper "Transparency at the Source"

Java 6 1 Updated Apr 10, 2024

🥉 useful helpers for react-three-fiber

JavaScript 8,305 688 Updated Oct 12, 2024

Algorithms for explaining machine learning models

Python 2,396 251 Updated Jul 12, 2024

LLM training in simple, raw C/CUDA

Cuda 24,008 2,691 Updated Oct 2, 2024

Model interpretability and understanding for PyTorch

Python 4,866 490 Updated Oct 11, 2024

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 617 59 Updated Oct 8, 2024

Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State

Jupyter Notebook 16 1 Updated Jan 8, 2024

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 501 80 Updated Oct 15, 2024
Python 4 Updated Dec 14, 2023

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 10,202 1,470 Updated Aug 8, 2024

Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.

JavaScript 3,625 209 Updated Jan 12, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,241 169 Updated Aug 21, 2024

Code for the paper "Efficient Training of Language Models to Fill in the Middle"

Python 163 33 Updated Apr 2, 2023
Next