GCP versus Azure: What are the Terra offerings?

Derek Caetano-Anolles
  • Updated

So you've decided to use Terra

Terra is a cloud-native platform that empowers you to access, analyze, and share your biomedical data efficiently.

Once you have decided to use Terra, you must then make a choice of how you would like to use Terra.

Terra currently offers two distinct offeringsTerra on GCP and Terra on Azure.

Under the hood, there are some important differences between both offerings that impact functionality and your cost.

What is Terra? Terra is a cloud service front-end designed to let you get your work done without needing to deal with the complexities of cloud vendors such as Google or Azure. Instead, Terra lets you run your pipelines, perform statistics, and visualize your data in a user-friendly environment that does the heavy lifting of interfacing with cloud vendors on your behalf.

Terra on the Cloud that makes sense for you

Terra on GCP

Banner_TerraGCP.png

Terra on GCP (Google Cloud Platform) is the best option for the majority of people.

With Terra on GCP, you can create workspaces and store/analyze your data using cloud storage and compute resources. Built-in features allow you to collaborate and share your data and tools securely with others.

    • Pay-as-you-go
      You are only responsible for paying for the cloud storage, egress, and compute resources you use when performing an analysis.

    • Grows with your project
      Shared cloud infrastructure scales to meet most data storage and analysis needs at an affordable price.

    • Permissions management
      Workspace permissions and Authorization Domains offer built-in mechanisms for sharing and collaborating securely.

When to choose Terra on GCP

Shared infrastructure is owned and managed by the Broad Institute and optimizes expenses, making Terra on GCP a perfect choice if:

    • You are sensitive to costs (for example, an academic researcher on a grant budget).
    • You do not require features available only on Terra on Azure.

For more information on how to get started with Terra on GCP, visit our Terra (GCP) support page.

Terra on Azure

Banner_TerraAzure.png

Terra on Microsoft Azure is the best option for people with specific security and data access needs.

Terra on Azure is a Managed Application deployed entirely within your individual or organizational Azure Cloud Subscription.

In this offering, you can engage in all of the features present on the Terra for GCP offering — create workspaces, analyze data using Azure compute resources, and share your results with others — but you can also benefit from these Azure-only features:

    • Data security & isolation
      All of your data is stored entirely within your own Azure subscription perimeter. While the Broad manages Azure resources, Terra itself has no access to your data. Data logging capabilities (logging, download alerts, data regionality restrictions) assists with your data security compliance. Enhanced security measures let you control data sharing at a granular scale.

    • Private hosting
      You can connect to your data where it lives within your own private IT network, rather than from external servers. There are options to run workflows from a private GitHub using Docker images from a private Azure Container Registry (ACR).

    • Azure-native
      Pipelines and notebooks developed and optimized for the Azure ecosystem will run immediately.

    • Large-scale
      User-owned infrastructure facilitates high capacity project support, ideal for genomic or clinical data engineering. Optimized for analyses of much larger quantities of data.

When to choose Terra on Azure

Terra on Azure includes both operational (working) and infrastructure costs, which may be prohibitive for most individual users who do not need added security and logging.

This makes Terra on Azure a perfect choice if:

    • You work within a larger organization, such as a pharmaceutical company or research hospital
    • You handle Protected Health Information (PHI)
    • You have very specific data security requirements
    • You are already heavily integrated into the Microsoft Azure ecosystem and do not want to (or cannot) move your data to Google Cloud.

For more information on how to get started with Terra on Azure, send us a message at terra-enterprise@broadinstitute.org and let us know your interest.

Terra offerings at a glance

 

GCP.png
Terra on GCP

Azure.png
Terra on Azure

Costs

Terra-owned, centralized services for workflow compute and data tables
User-owned, isolated services for workflow compute and data tables allows for shorter wait times for scheduled tasks, improved data table functionality and control of data residence
Users pay for data storage, egress and compute cloud resources based on usage

Security and Controls

Data and workspace permissions managed by individuals using self-service methods

FedRAMP Moderate Authorized

Technical features to enforce compliance requirements such as enhanced logging, data regionality restrictions, download alerts, etc.

Active Directory for centralized account management and permissioning through Terra

Data Management and Analysis

Data Tables


Up to 30,000 rows


Up to 1 million rows

Interactive Analysis VMs

Import public workflows from Dockstore

Import public workflows from GitHub

Import private workflows from GitHub

Use Dockers from private ACR

Workflow engine ownership

Shared Per user per workspace
Terra's data custodian application, which supports ongoing data versioning needs and complex data sharing

Can I transfer data, tools, or workspaces between Terra offerings?

Data in the cloud

Data can be transferred between GCP and Azure storage, but egress between cloud providers comes at a cost.

Each cloud service has its own costs to upload and download data. Remember to account for these as part of your data migration.

Terra workspaces and analysis tools

Terra workspaces and workspace resources (tools or tables) are a different story.

Due to the incompatibility in infrastructure between GCP and Azure, it is not (currently) possible to directly transfer workspaces/resources between Terra on GCP and Terra on Azure.

That said, as part of our support for Terra on Azure users, Terra developers can offer a manual transfer of your resources as part of the onboarding process, if your organization requires a transfer of workspace resources.

What is the difference in costs?

Billing for Terra on GCP is usage-based, and depends on the amount of data storage, egress, and computational resources you use. There are no additional costs.

Terra on Azure incorporates both working (usage-based) and static costs to cover enhanced service features and infrastructure needs of your isolated Terra Environment.

I know what I want! How do I get started?

Selecting the appropriate Terra offering requires a clear understanding of the specific requirements for you or your organization.

If you’ve decided to use the scalable, cost-effective solutions offered by Terra on GCP, head over to our Getting Started (GCP) documentation to learn how to register for your new account today.

If your organization is ready to use the robust, security-ready features of Terra on Azure, schedule a meeting with a Terra Team liaison at terra-enterprise@broadinstitute.org to discuss which features, plans, and licenses work best for your use case.

 

Was this article helpful?

2 out of 2 found this helpful

Comments

0 comments

Please sign in to leave a comment.