Skip to content

Latest commit

 

History

History

pipeline

New GO pipeline. For context, see:

Overview

  • Gathers external contributed files (GAFs, GPADs, ...)
  • Performs filtering and checking
    • Uses ontobio validate for parsing and reporting
    • Afterwards runs owltools checks and inferences
  • Generates filtered files and derived files
    • GPAD, GAF and GPI
    • RDF for direct loading into blazegraph

Products

Everything is driven by a makefile. It should be possible to run the makefile locally, on an amazon machine slaved to jenkins, etc

Environment

The pipeline/environment.sh script can be used to setup the environment before the makefile is run. Use source to run the script:

source environment.sh

or

. environment.sh

Status

Active. Running in parallel. Bulk of processing and reporting is done by ontobio validate.py.

Pipeline Re-Orientation

To support a more flexible processing environment and to support all annotation input types, we will change the basic structure of the pipeline, outlined here: geneontology/pipeline#206

This new pipeline "kernel" will be a series of scripts/commands that perform simple steps file-by-file. This new set of scripts/commands will be placed in the "go-association-transit" directory.

See the go-association-transit readme for more.