prepare_data

Preparing Data

Please follow the four scripts to download and preprocess the MSCOCO data.

Download MSCOCO Image Data

sh step1_download_coco.sh

The script will download the image data from the MSCOCO official site and extract it to mscoco.

Generating Detection Results

You can download our pre-extracted object detection feature from here. Or you can extract the object feature by yourself using the following instructions:

sh step2_detection.sh

We use the TensorFlow Object Detection API to prepocess each image and save the detection results to disk. Note that it may takes about 12 hours to finish the preprocessing with a Nvidia V100 GPU.

Extract Image Feature

sh step3_image_feature_extraction.sh

It may takes 20 minutes to finish the feature extraction process.

Generate the NOC COCO Dataset

sh step4_transfer_coco_to_noc.sh

We transfer the original MSCOCO dataset to fit the novel object captioning setting.

We also provide our generated data (the training and testing splits for the held-out MSCOCO dataset), which can be downloaded here.

All the preprocessed results can be found in mscoco.

Name		Name	Last commit message	Last commit date
parent directory ..
mscoco		mscoco
utils		utils
.gitignore		.gitignore
README.md		README.md
step1_download_coco.sh		step1_download_coco.sh
step2_detection.sh		step2_detection.sh
step3_image_feature_extraction.sh		step3_image_feature_extraction.sh
step4_transfer_coco_to_noc.sh		step4_transfer_coco_to_noc.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prepare_data

prepare_data

README.md

Preparing Data

Download MSCOCO Image Data

Generating Detection Results

Extract Image Feature

Generate the NOC COCO Dataset

Files

prepare_data

Directory actions

More options

Directory actions

More options

Latest commit

History

prepare_data

Folders and files

parent directory

README.md

Preparing Data

Download MSCOCO Image Data

Generating Detection Results

Extract Image Feature

Generate the NOC COCO Dataset