Workflows (aka pipelines) are a series of steps performed by an external compute engine - often used for automated, bulk analysis (such as aligning genomic reads). Pipelines run on Terra are written in Workflow Description Language (WDL), a workflow processing language that is easy for humans to read and write.
To run a pipeline in a workspace, you will need to have
- A Workspace you've cloned using a Billing Project to which you have access (such as a free credit account)
- Input data (can be in the Google bucket, or linked to the workspace in the data table)
- One or more workflows (If you clone a Workspace that already contains workflows, these will be present in your clone. If the Workflows tab of your Workspace is empty, you can import workflows from the Terra library's code and workflows section)
One way to get up and running quickly is to clone and run the workflows in a featured workspace. For a quickstart example of running a pipeline on Terra, copy the Terra Quickstart workspace to your own billing account and try going through the exercises in Part 2, where you will learn:
- How to link data in the cloud as input data for the workflow
- How to run in your data by configuring workflow inputs and outputs in the Terra interface
Once you get the hang of how pipelining works, you can practice running more complex pipeline workspaces by reproducing work in a workspace in the Showcases and Tutorials library. See this article on curated workspaces in Terra.