Skip to content
View Smith42's full-sized avatar
🌳
Park life
🌳
Park life

Block or report Smith42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Investigating attention masks learned from other tasks repurposed for cloud masking

Jupyter Notebook 4 1 Updated Oct 18, 2024

Training Sparse Autoencoders on Language Models

Jupyter Notebook 459 122 Updated Nov 15, 2024

A 100x faster SVD for PyTorch⚡️

C 447 36 Updated Oct 10, 2022

NanoGPT (124M) quality in 7.8 8xH100-minutes

Python 1,022 82 Updated Nov 14, 2024

This is the repository for the distill web framework

JavaScript 814 133 Updated Dec 5, 2022

RWKV centralised docs for the community

19 6 Updated Sep 3, 2024

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 250 15 Updated Nov 7, 2024

Examples of how the GalaxiesML dataset can be used

Python 3 Updated Sep 28, 2024

A port of DOOM for a quantum computer

C 653 21 Updated Nov 7, 2024

Implementation of the Prithvi WxC Foundation Model and Downstream Tasks

Python 103 16 Updated Nov 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 84 6 Updated Nov 13, 2024

Expandable Datasets for Earth Observation

Jupyter Notebook 147 11 Updated Oct 21, 2024

Python routines for Machine Learning applications for Earth Observation

Python 10 3 Updated Sep 6, 2019

Implemenation of PQMass from Lemos et al. 2024

Jupyter Notebook 5 Updated Nov 11, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,175 1,144 Updated Nov 8, 2024

nanoGPT style version of Llama 3.1

Python 1,239 60 Updated Aug 8, 2024

PyTorch implementation of models from the Zamba2 series.

Python 158 17 Updated Nov 10, 2024
Python 40 1 Updated Oct 18, 2024

Code for Leung, Bovy & Speagle 2024

Jupyter Notebook 1 Updated Sep 25, 2024

Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.

Jupyter Notebook 192 16 Updated May 27, 2024

The Multilayer Perceptron Language Model

Python 522 45 Updated Aug 9, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,445 1,622 Updated Nov 12, 2024

A codebase dedicated to exploring multimodal learning approaches by integrating images of host galaxies of supernovae and their corresponding light-curves and spectra.

Jupyter Notebook 9 2 Updated Oct 21, 2024

[NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state …

Python 58 Updated Jul 31, 2024

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 21,766 7,963 Updated Nov 6, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,321 56 Updated Aug 15, 2024

Video code lecture on building nanoGPT from scratch

Python 3,863 500 Updated Aug 13, 2024

Fine-tune a 7B LLM on cosmology datasets

Python 23 1 Updated Oct 11, 2024
Next