Custom Hand Gesture Recognition Models (HGR)🖐️ with YOLOv8🚀 and MediaPipe👋 - Streamlit App

Author: jk4e | Last Update: August 2024

⚠️ Warning: This project/application is in preview and under development. Therefore there are still bugs/issues/errors, not refactored codebase, performance issues and not optimized model performance.

Introduction

This is a collection of demos as a Streamlit app showcasing the capabilities of custom-trained HGR models on freely available SDKs and frameworks for various HGR tasks. Select a model configuration from the sidebar to try out the different models and tasks. Have fun exploring!

Features

Support for Images🖼️, Videos🎞️, and Webcam📷 Photos and Video Streams
Custom YOLOv8 detection model for hand detection task (HGR)
Custom YOLOv8 classification model for hand classification task (HGR)
Custom MediaPipe hand gesture recognition task (HGR)

Note on Webcam Support for Video Stream:
Webcam real-time video stream is not yet supported on the WebApp. This functionality is not natively possible on Streamlit Cloud, and would require a custom component such as streamlit-webrtc. Currently, there is no fast and easy way to integrate local webcam streams in the WebApp. (see 1, 2, 3)

Streamlit WebApp

Streamlit app (MediapipeYolo-HGR) with MediaPipe and YOLOv8 hand gesture recognition demos using custom trained models. The Streamlit Community Cloud is used to deploy the app. Check it out!

LINK TO APP: https://my-hgr.streamlit.app

Note: You can change via settings the app theme between Light or Dark (default is Use system setting)

These screenshots show some app pages.

Home Page with project information and description

YOLOv8 Classification Page for Video Input

YOLOv8 Detection Page for Image Input

MediaPipe HGR Page for Image Input

Resources, Tools and Packages

Main Packages and Frameworks

This app uses custom-trained hand gesture models created with YOLOv8, MediaPipe Gesture Recognition Task, and Streamlit as the user interface.

Annotation Tool

For image annotation, i used CVAT (Computer Vision Annotation Tool). There are several free alternatives available in the market, such as those offered by Roboflow Annotate.

Datasets for Customizing Models

Freely available datasets for training can be found on platforms like:

Alternatively, you can generate your own dataset using a webcam on your PC or notebook.

Hardware for Training

For training, it's recommended to have access to:

A powerful NVIDIA GPU with CUDA support, or
Cloud compute instances (e.g., Google Colab Free Plan for beginners offers a cloud-based Jupyter notebook environment)
- Google Colab provides access to GPU resources and pre-installed libraries

Deployment

This app is deployed on Streamlit Community Cloud, which is free to use. Other deployment options include:

Private deployment on Streamlit Community Cloud
Local usage
Non-GUI options like Jupyter Notebook
Custom GUI apps using packages like Tkinter or PyQt

Sources

YOLOv8

MediaPipe

⚠️ Important Note on MediaPipe Model Maker Installation

As of August 2024:

The latest version of MediaPipe Model Maker (current version 0.2.1.4) cannot be installed on Windows or Silicon-based macOS.
For more details, see issues: How to install mediapipe_model_maker 0.2.1.4 in Windows? #5545 or Windows Support #1206

Workarounds:

Use a machine with Linux
Set up WSL2 (Windows Subsystem for Linux) on a Windows machine
- See Install WSL for instructions
Use Google Colab, a free cloud-based Jupyter notebook environment
- Google Colab provides access to GPU resources and pre-installed libraries

When is MediaPipe Model Maker needed?

This package is required if you want to train custom models with your own dataset.
If you don't need to train or customize a model, you can simply install the MediaPipe package
- See the Guide for Python for installation instructions.

Streamlit

Local Installation and Requirements

This project requires Python 3.11. (Currently, only Python 3.11 is tested.)

Create a new conda environment

# Create a new environment named 'my_hgr_env'
conda create -n my_hgr_env python=3.11 pip

# Activate the environment
conda activate my_hgr_env

Alternatively, you can use other virtual environment tools like venv or virtualenv.

Clone the repository

git clone https://github.com/jk4e/my-hgr
cd my-hgr

Install dependencies

pip install -r requirements.txt

This process may take several minutes depending on your internet connection and system performance.

Note for beginners: If these instructions are unfamiliar, consider searching for "Python beginner setup guide" or "How to use virtual environments in Python" for more detailed explanations. You can also use AI assistants like ChatGPT or Claude for step-by-step guidance tailored to your operating system.

Running the Application

Ensure you're in the project directory and your virtual environment is activated, then run:

streamlit run streamlit-app.py

This command will start the Streamlit server and automatically open the application in your default web browser. If it doesn't open automatically, you can manually navigate to the URL displayed in the terminal (usually http://localhost:8501).

📋 My TODO List: Things that can be fixed, improved or added

Software

Fix bugs and errors
Refactor codebase
Optimize the performance, response time, and design of GUI

Features

Add new YOLOv8 tasks (Instance Segmentation or Pose Estimation)
Add support for additional media input formats, e.g., YouTube videos
Add support for user-uploaded models
Collect, annotate, and preprocess new datasets for model training
Train models with higher accuracy for better results (access to better hardware required)

Disclaimer

This project is for educational purposes only.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.devcontainer		.devcontainer
assets		assets
images		images
models		models
readme-assets		readme-assets
videos		videos
.gitignore		.gitignore
README.md		README.md
content.py		content.py
home.py		home.py
mediapipe-hgr-task-photo.py		mediapipe-hgr-task-photo.py
mediapipe-hgr-task-video.py		mediapipe-hgr-task-video.py
mediapipe-hgr-task-webcam.py		mediapipe-hgr-task-webcam.py
mediapipe-hgr-task.py		mediapipe-hgr-task.py
plotting.py		plotting.py
requirements.txt		requirements.txt
streamlit-app.py		streamlit-app.py
utils.py		utils.py
yolov8-classify-photo.py		yolov8-classify-photo.py
yolov8-classify-video-realtime.py		yolov8-classify-video-realtime.py
yolov8-classify-video.py		yolov8-classify-video.py
yolov8-classify-webcam.py		yolov8-classify-webcam.py
yolov8-classify.py		yolov8-classify.py
yolov8-detect-photo.py		yolov8-detect-photo.py
yolov8-detect-video.py		yolov8-detect-video.py
yolov8-detect-webcam.py		yolov8-detect-webcam.py
yolov8-detect.py		yolov8-detect.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom Hand Gesture Recognition Models (HGR)🖐️ with YOLOv8🚀 and MediaPipe👋 - Streamlit App

⚠️ Warning: This project/application is in preview and under development. Therefore there are still bugs/issues/errors, not refactored codebase, performance issues and not optimized model performance.

Introduction

Features

Streamlit WebApp

Home Page with project information and description

YOLOv8 Classification Page for Video Input

YOLOv8 Detection Page for Image Input

MediaPipe HGR Page for Image Input

Resources, Tools and Packages

Main Packages and Frameworks

Annotation Tool

Datasets for Customizing Models

Hardware for Training

Deployment

Sources

YOLOv8

MediaPipe

⚠️ Important Note on MediaPipe Model Maker Installation

Workarounds:

When is MediaPipe Model Maker needed?

Streamlit

Local Installation and Requirements

Create a new conda environment

Clone the repository

Install dependencies

Running the Application

📋 My TODO List: Things that can be fixed, improved or added

Software

Features

Disclaimer

About

Languages

jk4e/my-hgr

Folders and files

Latest commit

History

Repository files navigation

Custom Hand Gesture Recognition Models (HGR)🖐️ with YOLOv8🚀 and MediaPipe👋 - Streamlit App

⚠️ Warning: This project/application is in preview and under development. Therefore there are still bugs/issues/errors, not refactored codebase, performance issues and not optimized model performance.

Introduction

Features

Streamlit WebApp

Home Page with project information and description

YOLOv8 Classification Page for Video Input

YOLOv8 Detection Page for Image Input

MediaPipe HGR Page for Image Input

Resources, Tools and Packages

Main Packages and Frameworks

Annotation Tool

Datasets for Customizing Models

Hardware for Training

Deployment

Sources

YOLOv8

MediaPipe

⚠️ Important Note on MediaPipe Model Maker Installation

Workarounds:

When is MediaPipe Model Maker needed?

Streamlit

Local Installation and Requirements

Create a new conda environment

Clone the repository

Install dependencies

Running the Application

📋 My TODO List: Things that can be fixed, improved or added

Software

Features

Disclaimer

About

Topics

Resources

Stars

Watchers

Forks

Languages