GitHub - tsxgithub01/ML-HFT: High frequency trading (HFT) strategies built for futures using machine learning and deep learning techniques

High Frequency Trading Framework with Machine/Deep Learning

In this project, we provide a framework/pipeline for high frequency trading using machine/deep learning techniques. More advanced feature engineering (with depth trade and quote data) and models (such as pre-trained models) can be applied in this framework.

Target

Extract trading signals from multi-level orderbook data
Replicate well-designed high frequency trading (HFT) strategies using machine learning and deep learning techniques

Data

The SGX FTSE CHINA A50 INDEX Futures (新加坡交易所FTSE中国A50指数期货) tick depth data are used.

Strategy Pipline

Orderbook Signals

We use level-3 deep orderbook data to develop trading signals, including Depth Ratio, Rise Ratio, and Orderbook Imbalance (OBI).

Price Series

Feature Engineering & HFT Factors Design

Simple average depth ratio and OBI:

Weighted average depth ratio, OBI, and rise ratio:

Model Fitting

Models:
- RandomForestClassifier
- ExtraTreesClassifier
- AdaBoostClassifier
- GradientBoostingClassifier
- Support Vector Machines
- Other classifiers: Softmax, KNN, MLP, LSTM, etc.
Hyperparameters:
- Training window: 30min
- Test window: 10sec
- Prediction label: 15min forward

Performance Metrics

Prediction accuracy:

Prediction Accuracy Series:

Cross Validation Mean Accuracy:

Best Model:

PnL Visualization

Improvements

Feature Engineering

There are tons of potential powerful signals if we have both the trade and quote data, such as:

volume imbalance signal
trade imbalance signal
technical indicators of bid and ask series (RSI, MACD...)
WAP/WPR, weighted average price
volume imbalance signal
.....

These signals can also generate derivative version using techniques such as:

consider different weights on different level of orderbook data for a particular signal
consider moving average with period n (hyperparameter)
consider weighted average of signals, such as weighted average of trade imbalance and orderbook imbalance
.....

Models

More advanced classifiers are definitely welcomed! Include but not limit to:

CNN
GRU/LSTM
XGBoost, AdaBoost, GBDT, LightGBM
Attention, Auto-encoder
TabNet
GNN
Pre-trained models
.....

Performance Metrics The performance metrics are subject to amendment, including the PnL calculation, commission fee consideration, etc.

Final Words

There are tons of excellent features to be explored with trade data and depth ordebook data. So does the numerous powerful classifiers. In the Kaggle optiver volatility competition, the training data includes both trade and quote/orderbook, and it contains level-2 data. Many insightful feature engineering techniques and models can be discovered from the top solutions, which can also be applied in this framework.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
Graph		Graph
data		data
images		images
HFT_factors.ipynb		HFT_factors.ipynb
README.md		README.md
SGX-FTSE-China-A50-Index-Futures.pdf		SGX-FTSE-China-A50-Index-Futures.pdf
data_process.ipynb		data_process.ipynb
data_visualization.ipynb		data_visualization.ipynb
feature_engineering.ipynb		feature_engineering.ipynb
model_fitting.ipynb		model_fitting.ipynb
order_book_3_2014_1_2.csv		order_book_3_2014_1_2.csv
order_book_4_2014_1_2.csv		order_book_4_2014_1_2.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

High Frequency Trading Framework with Machine/Deep Learning

Target

Data

Strategy Pipline

Orderbook Signals

Price Series

Feature Engineering & HFT Factors Design

Model Fitting

Performance Metrics

PnL Visualization

Improvements

Final Words

About

Releases

Packages

Languages

tsxgithub01/ML-HFT

Folders and files

Latest commit

History

Repository files navigation

High Frequency Trading Framework with Machine/Deep Learning

Target

Data

Strategy Pipline

Orderbook Signals

Price Series

Feature Engineering & HFT Factors Design

Model Fitting

Performance Metrics

PnL Visualization

Improvements

Final Words

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages