ArthurConmy

🏃‍♂️

Arthur Conmy ArthurConmy

🏃‍♂️

Personal account (I work at @google-deepmind)

94 followers · 303 following

London, UK
https://twitter.com/ArthurConmy
https://codeforces.com/profile/arthurconmy

Achievements

x2 x2

Achievements

x2 x2

Stars

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 924 137 Updated Oct 7, 2024

cma1114 / representation_tuning

Internalizing steering vectors via fine tuning

Jupyter Notebook 3 Updated Sep 6, 2024

IBM / sae-steering

Code to enable layer-level steering in LLMs using sparse auto encoders

Python 1 Updated Sep 20, 2024

thestephencasper / everything-you-need

we got you bro

32 Updated Jul 29, 2024

jbloomAus / SAELens

Training Sparse Autoencoders on Language Models

Jupyter Notebook 396 108 Updated Oct 3, 2024

koayon / atp_star

PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)

Python 12 1 Updated Apr 16, 2024

UFO-101 / auto-circuit

A library for efficient patching and automatic circuit discovery.

Python 23 9 Updated Aug 24, 2024

callummcdougall / sae_vis

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 140 27 Updated Oct 5, 2024

mishajw / repeng

Experiments with representation engineering

Python 9 1 Updated Feb 28, 2024

callummcdougall / eindex

My interpretation of what einops indexing would look like (created to work on during my SERI MATS project).

Python 5 1 Updated Jul 7, 2024

callummcdougall / sae_visualizer

HTML 24 1 Updated Apr 4, 2024

stanfordnlp / pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 610 59 Updated Oct 4, 2024

cgarciae / einop

Python 57 2 Updated Mar 8, 2022

neelnanda-io / 1L-Sparse-Autoencoder

Python 102 12 Updated Oct 28, 2023

LRudL / evalugator

(Model-written) LLM evals library

Python 15 2 Updated Jul 27, 2024

xrsrke / pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

Python 77 17 Updated Dec 14, 2023

Aaquib111 / Automatic-Circuit-Discovery

Forked from ArthurConmy/Automatic-Circuit-Discovery

Fork of Arthur Conmy's Automatic-Circuit-Discovery for the purpose of conducting ACDC research

Jupyter Notebook 1 Updated Feb 11, 2024

Aaquib111 / edge-attribution-patching

Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"

Jupyter Notebook 22 7 Updated May 31, 2024

callummcdougall / SERI-MATS-2023-Streamlit-pages

Repo for hosting Streamlit pages for my 2023 SERI MATS project with Arthur Conmy (mentored by Neel Nanda).

HTML 7 1 Updated Feb 27, 2024

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,137 59 Updated Sep 1, 2024

aVariengien / causal-checker

Python 5 2 Updated Aug 24, 2023

callummcdougall / ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 189 77 Updated Feb 7, 2024

TransformerLensOrg / CircuitsVis

Mechanistic Interpretability Visualizations using React

Jupyter Notebook 185 29 Updated Jul 13, 2024

redwoodresearch / rust_circuit_public

Rust 60 2 Updated Feb 16, 2023

berkott / lucent

Forked from greentfrapp/lucent

Lucid library adapted for PyTorch with new features for ViTs and MLP-Mixers

Python 1 Updated Aug 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arthur Conmy ArthurConmy

Achievements

Achievements

Block or report ArthurConmy

Stars

xjdr-alt / entropix

cma1114 / representation_tuning

IBM / sae-steering

thestephencasper / everything-you-need

jbloomAus / SAELens

koayon / atp_star

UFO-101 / auto-circuit

callummcdougall / sae_vis

mishajw / repeng

callummcdougall / eindex

callummcdougall / sae_visualizer

stanfordnlp / pyvene

cgarciae / einop

neelnanda-io / 1L-Sparse-Autoencoder

LRudL / evalugator

xrsrke / pipegoose

Aaquib111 / Automatic-Circuit-Discovery

Aaquib111 / edge-attribution-patching

callummcdougall / SERI-MATS-2023-Streamlit-pages

patrick-kidger / jaxtyping

aVariengien / causal-checker

callummcdougall / ARENA_2.0

TransformerLensOrg / CircuitsVis

redwoodresearch / rust_circuit_public

berkott / lucent

google-research / t5x

TransformerLensOrg / TransformerLens

redwoodresearch / Easy-Transformer

karpathy / makemore

alignedai / HappyFaces