WOMBO EdgeMaxxing Subnet: Optimizing AI Models for Consumer Devices

Enabling Millions to Contribute to a Decentralized Network

In the annals of the digital age, a grand saga unfolds. In a realm where the forces of artificial intelligence are harnessed by a select few, the question arises: shall this power remain concentrated, or shall it be distributed for the benefit of all humankind?

w.ai envisions a future where artificial intelligence is decentralized, democratized, and accessible to everyone. This vision is embodied in a global supercomputer composed of individual user devices—laptops, gaming rigs, and smartphones. By harnessing the untapped potential of these devices, w.ai aims to create a vast decentralized network of computing power, democratizing access to the most advanced AI technologies. This approach will foster a thriving ecosystem of AI applications, driving innovation and ensuring the benefits of AI are shared by all of humanity.

EdgeMaxxing Subnet

What is the goal?

The EdgeMaxxing subnet aims to create the world's most optimized AI models for consumer devices, starting with Stable Diffusion XL on the NVIDIA GeForce RTX 4090.

The subnet will expand to support optimization for various end devices, models, and modalities overtime.

Key Benefits of Optimized Models:

Optimizing AI models is crucial to realizing a vision of decentralized AI.

Accessibility: Enabling these advanced models to run on consumer devices, from smartphones to laptops, bringing AI capabilities to everyone.
Decentralization: Allowing millions of users to contribute their computing power, rather than relying on a small number of powerful miners, creating a truly distributed AI network.

By optimizing popular models like LLAMA3 and Stable Diffusion, we transform idle computing resources into valuable contributors to a global AI network. This democratizes both AI usage and creation, offering earning opportunities to millions.

Current Subnet Focus

Current GPU: NVIDIA GeForce RTX 4090
Current Model: stablediffusionapi/newdream-sdxl-20
Netuid: 39

Miners and Validators

Incentive Mechanism and Reward Structure

The EdgeMaxxing subnet defines specific models, pipelines, and target hardware for optimization. Miners and validators collaborate in a daily competition to improve AI model performance on consumer devices.

Miners are rewarded based on how optimized their submissions are relative to other miners and the baseline . Every day at 12 PM PST a contest is run.

Validators receive rewards for their consistent operation and accurate scoring.

Competition Structure

Miners submit optimized models
Validators score submissions
Contest runs daily at 12 PM PST
Miners receive rewards based on their ranking

Miners

Actively submit optimized checkpoints of the specified model or pipeline. No need for continuous operation; can wait for results after submission
Use custom algorithms or tools to enhance model performance
Aim to produce the most generically optimized version of the model

Validators

Must run on the specified target hardware (e.g., NVIDIA GeForce RTX 4090, M2 MacBook)
Collect all miner submissions daily
Benchmark each submission against the baseline checkpoint
Score models based on:
- Speed improvements
- Accuracy maintenance
- Overall efficiency gains
Select the best-performing model as the daily winner

Running Miners and Validators

To start working with a registered hotkey, clone the repository and install uv

# uv
if [ "$USER" = "root" ]; then
  apt install pipx
else
  sudo apt install pipx
fi

pipx ensurepath
pipx install uv

# Repository
git clone https://github.com/womboai/edge-maxxing
cd edge-maxxing

There is no need to manage venvs in any way, as uv will handle that.

Miner setup

Clone the base inference repository

    git clone --depth 1 https://github.com/womboai/sdxl-newdream-20-inference model

Make your own repository on a git provider such as GitHub or HuggingFace to optimize in
Edit the src/pipeline.py file to include any loading or inference optimizations, and commit when finished
After creating and optimizing your repository, submit the model, changing the options as necessary

cd miner
uv run submit_model \
    --netuid {netuid} \
    --subtensor.network finney \
    --wallet.name {wallet} \
    --wallet.hotkey {hotkey} \
    --logging.trace \
    --logging.debug

Follow the interactive prompts to submit the repository link, revision, and contest to participate in
Optionally, benchmark your submission locally before submitting (make sure you have the right hardware e.g. NVIDIA GeForce RTX 4090). uv and huggingface are required for benchmarking:

pipx ensurepath
pipx install uv
pipx install huggingface-hub[cli,hf_transfer]

Validators will collect your submission on 12PM New York time and test it in the remainder of the day. Updated weights are set at the beginning of the next contest.

Validator setup

The validator setup requires two components, an API container and a scoring validator

Dedicated Hardware

If your hardware is not accessed within a container(as in, can use Docker), then the easiest way to set the different components up is to use docker compose.

To get started, go to the validator, and create a .env file with the following contents:

VALIDATOR_ARGS=--netuid {netuid} --subtensor.network {network} --wallet.name {wallet} --wallet.hotkey {hotkey} --logging.trace --logging.debug
VALIDATOR_HOTKEY_SS58_ADDRESS={ss58-address}

Generate the compose file for the GPUs you have by editing compose-gpu-layout.json to include all CUDA device IDs and then running:

python3 ./generate_compose.py

And then start docker compose

docker compose up -d --build

RunPod/Containers

If running in a containerized environment like RunPod(which does not support Docker), then you need to run 2 pods/containers. The following setup assumes using PM2.

API Component

In one pod/container with a GPU, we'll set up the API component, start as follows:

    git clone https://github.com/womboai/edge-maxxing /api
    cd /api/validator

And then run as follows:

    export CUDA_VISIBLE_DEVICES=0
    export VALIDATOR_HOTKEY_SS58_ADDRESS={ss58-address}
    
    pm2 start ./submission_tester/start.sh --name edge-maxxing-submission-tester --interpreter /bin/bash -- \
      --host 0.0.0.0 \
      --port 8000 \
      submission_tester:app

Make sure port 8000(or whichever you set) is exposed!

The argument at the end is the name of the main PM2 process. This will keep your PM2 validator instance up to date as long as it is running.

You can run more APIs(and are recommended to do so) and link the scoring validator to them. You can set which CUDA devices or ports to use along with that.

Scoring Validator

In the another pod/container without a GPU, to run the scoring validator, clone the repository as per the common instructions, then do as follows

    cd validator
    pm2 start uv --name edge-maxxing-validator --interpreter none -- \
        run start_validator \
        --netuid {netuid} \
        --subtensor.network {network} \
        --wallet.name {wallet} \
        --wallet.hotkey {hotkey} \
        --logging.trace \
        --logging.debug \
        --benchmarker_api {API component routes, space separated if multiple}

Make sure to replace the API component route with the routes to the API containers(which can be something in the format of http://ip:port), refer to the instructions above at API Component

Proposals for Optimizations

There are several effective techniques to explore when optimizing machine learning models for edge devices. Here are some key approaches to consider:

Knowledge Distillation: Train a smaller, more efficient model to mimic a larger, more complex one. This technique is particularly useful for deploying models on devices with limited computational resources.
Quantization: Reduce the precision of the model's weights and activations, typically from 32-bit floating-point to 8-bit integers. This decreases memory usage and computational requirements, making it possible to run models on edge devices. Additionally, exploring low-precision representation for weights (e.g., using 8-bit integers) can reduce memory bandwidth usage for memory-bound models, even if the actual compute is done in higher precision (e.g., 32-bit).
TensorRT and Hardware-Specific Optimizations: Utilize NVIDIA's TensorRT to optimize deep learning models for inference on NVIDIA GPUs. This involves more than just layer fusion; it includes optimizing assembly, identifying prefetch opportunities, optimizing L2 memory allocation, writing specialized kernels, and performing graph optimizations. These techniques enhance performance and reduce latency by tailoring the model to the specific hardware configuration.
Hyperparameter Tuning: Optimize the configuration settings of the model to improve its performance. This can be done manually or through automated methods such as grid search or Bayesian optimization. While not a direct edge optimization, it is an essential step in the overall process of model optimization.

We encourage developers to explore these optimization techniques or develop other approaches to enhance model performance and efficiency specifically for edge devices.

Roadmap

Our mission is to create the world's most optimized AI models for edge devices, democratizing access to powerful AI capabilities. Here's our path forward:

Phase 1: Foundation (Current)

Perfect contest and benchmarking mechanisms
Establish a robust framework for measuring model performance across hardware
Cultivate a community of world-class miners skilled in optimizing models for edge devices

Phase 2: Expansion

Support a diverse range of AI models, pipelines, and consumer-grade hardware
Develop tools to lower entry barriers for new participants
Integrate initial set of optimized models into the w.ai platform

Phase 3: Mass Adoption and Accessibility

Launch user-friendly mobile app for widespread participation in the network
Implement intuitive interfaces for non-technical users to contribute and benefit from optimized AI models
Fully integrate EdgeMaxxing with w.ai, making all optimized models instantly available and usable on the platform

Long-term Vision

Transform EdgeMaxxing into a cornerstone of decentralized AI, where:
Any device, from smartphones to high-end GPUs, can contribute to and benefit from the network
Optimized models power a new generation of AI-driven applications
EdgeMaxxing becomes the go-to platform for rapid benchmarking and optimization of new AI models on diverse hardware

Through each phase, we'll continuously refine our techniques, expand hardware support, and push the boundaries of AI optimization for edge computing.

License

The WOMBO Bittensor subnet is released under the MIT License.

Connect with us on social media

Name		Name	Last commit message	Last commit date
Latest commit History 483 Commits
.github/workflows		.github/workflows
miner		miner
neuron		neuron
pipelines		pipelines
validator		validator
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
min_compute.yml		min_compute.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WOMBO EdgeMaxxing Subnet: Optimizing AI Models for Consumer Devices

Enabling Millions to Contribute to a Decentralized Network

In the annals of the digital age, a grand saga unfolds. In a realm where the forces of artificial intelligence are harnessed by a select few, the question arises: shall this power remain concentrated, or shall it be distributed for the benefit of all humankind?

Table of Contents

About WOMBO

About w.ai

Democratizing the Future of AI

EdgeMaxxing Subnet

What is the goal?

Key Benefits of Optimized Models:

Current Subnet Focus

Miners and Validators

Incentive Mechanism and Reward Structure

Competition Structure

Miners

Validators

Running Miners and Validators

Miner setup

Validator setup

Dedicated Hardware

RunPod/Containers

API Component

Scoring Validator

Proposals for Optimizations

Roadmap

License

About

Releases

Packages

Contributors 4

Languages

License

womboai/edge-maxxing

Folders and files

Latest commit

History

Repository files navigation

WOMBO EdgeMaxxing Subnet: Optimizing AI Models for Consumer Devices

Enabling Millions to Contribute to a Decentralized Network

In the annals of the digital age, a grand saga unfolds. In a realm where the forces of artificial intelligence are harnessed by a select few, the question arises: shall this power remain concentrated, or shall it be distributed for the benefit of all humankind?

Table of Contents

About WOMBO

About w.ai

Democratizing the Future of AI

EdgeMaxxing Subnet

What is the goal?

Key Benefits of Optimized Models:

Current Subnet Focus

Miners and Validators

Incentive Mechanism and Reward Structure

Competition Structure

Miners

Validators

Running Miners and Validators

Miner setup

Validator setup

Dedicated Hardware

RunPod/Containers

API Component

Scoring Validator

Proposals for Optimizations

Roadmap

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages