Workflows (aka pipelines) are a series of steps performed by an external compute engine - often used for automated, bulk analysis (such as aligning genomic reads). Pipelines run on Terra are written in Workflow Description Language (WDL), a workflow processing language that is easy for humans to read and write.
To run a pipeline in a workspace, you will need to have
- A Workspace you've cloned using a Billing Project to which you have access (such as a free credit account)
- Input data (can be in the Google bucket, or linked to the workspace in the data table)
- One or more workflows (If you clone a Workspace that already contains workflows, these will be present in your clone. If the Workflows tab of your Workspace is empty, you can import workflows from the Terra library's code and workflows section)
One way to get up and running quickly is to clone and run the workflows in a featured workspace. For a quickstart example of running a pipeline on Terra, copy the Terra Quickstart workspace to your own billing account and try going through the exercises in Part 2, where you will learn:
- How to run a pre-configured workflow on a sample in the workspace table
- How to run the workflow on your own sample
- How to configure the attributes and variables of a workflow to run
This article walks you through the important steps. Once you get the hang of how pipelining works, you can practice running more complex pipelines by reproducing work in a workspace in the Showcases and Tutorials library. See this article on curated workspaces in Terra.