NFA Processing on GPU

Artifact for Why GPUs are Slow at Executing NFAs and How to Make them Faster @ ASPLOS '2k20

The artifact contains the source code and Python/shell scripts. The scripts generate the throughput results in Table 3 of our paper. We also provide our raw experimental data and the scripts to reproduce other figures in our paper.

Hardware dependencies

We developed and tested our work in two NVIDIA GPUs (Quadro P6000 and Tesla V100). We expect our code can run on GPUs with the compute capability no less than 5.0.

Software dependencies

Our work requires CUDA 9.2 SDK. Our work uses cmake later than 3.10 for compilation. We use Python 3 for our scripts.

Operating System

We test our work on Ubuntu 18.04. We expect it can work with mainstream Linux distributions.

Required Python packages:

matplotlib
numpy
pandas
scipy
xlrd

If you use conda, simply run

conda create --name gpunfa_env python=3.7 matplotlib numpy scipy pandas xlrd
conda activate gpunfa_env

to install all the required Python packages.

Datasets

All data sets are from public available benchmark suites. We convert the automata files of them to ANML format. To facilitate this step, we provide the data set that is ready to use. Our scripts will automatically unzip these datasets.

Benchmarks

If the benchmark could not be downloaded due to GitHub's quota, try to download it here.

Installation

The setup.sh will automatically download the datasets, build the executables, and set up environmental variables.

git clone https://github.com/bigwater/gpunfa-artifact.git 
cd gpunfa-artifact
source setup.sh

Our project contains three executables, infant, ppopp12, and obat. They are added to your PATH variable after the setup. The former two excutables are our implementations of iNFAnt and NFA-CG, respectively. The obat contains our schemes. They share the same options and arguments settings.

Basic Usage

obat -i [input_stream_file] -a [anml_file] -g [algorithm] ...

The algorithm could be:

obat2: the scheme NewTran
obat_MC: NewTran matchset compression
hotstart_ea_no_MC2: Hotstart
hotstart_ea: Howstart matchset compression

ppopp12 -i [input_stream_file] -a [anml_file] -g [algorithm] ...

The algorithm could be:

ppopp12: Our implementation of NFA-CG.

infant -i [input_stream_file] -a [anml_file] -g [algorithm] ...

The algorithm could be:

infant: Our implementation of iNFAnt.

Other options. You can check how to use other options by showing the help using obat -? or obat -h.

Experiments

To get Table 3 in our paper, simply run the following commands.

cd gpunfa-artifact
./run_experiments_get_table3.sh

The entire set requires several hours to finish. It uses the same configuration as the paper and generates Table 3 automatically. It will generate a csv file abs_throughput.csv containing the information of Table 3.

Regenerating the figures of our paper from our raw data

cd ${GPUNFA_ROOT}
./gen_figures.sh

The script will take around 20 min to finish.

Paper

Please refer to this paper for other details.

@inproceedings{gpunfa-asplos2020,
 author = {Liu, Hongyuan and Pai, Sreepathi and Jog, Adwait},
 title = {{Why GPUs are Slow at Executing NFAs and How to Make them Faster}},
 booktitle = {Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)},
 year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
gpunfa_code		gpunfa_code
small_dataset		small_dataset
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gen_figures.sh		gen_figures.sh
gpunfa_benchmarks.zip		gpunfa_benchmarks.zip
raw_data.zip		raw_data.zip
run_experiments_get_table3.sh		run_experiments_get_table3.sh
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NFA Processing on GPU

Artifact for Why GPUs are Slow at Executing NFAs and How to Make them Faster @ ASPLOS '2k20

Hardware dependencies

Software dependencies

Operating System

Required Python packages:

Datasets

Benchmarks

Installation

Basic Usage

Experiments

Regenerating the figures of our paper from our raw data

Paper

About

Releases

Packages

Languages

License

bigwater/gpunfa-artifact

Folders and files

Latest commit

History

Repository files navigation

NFA Processing on GPU

Artifact for Why GPUs are Slow at Executing NFAs and How to Make them Faster @ ASPLOS '2k20

Hardware dependencies

Software dependencies

Operating System

Required Python packages:

Datasets

Benchmarks

Installation

Basic Usage

Experiments

Regenerating the figures of our paper from our raw data

Paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages