How to access GTEx data in Terra

Allie Cliffe
  • Updated

Step-by-step instructions to access GTEx data for analysis in a Terra workspace or download locally once your dbGaP request has been approved. These instructions will work for AnVIL and Biodata Catalyst researchers working in Terra. 

Overview

Requests for GTEx data can no longer be made through DUOS and must be submitted through dbGaP. Once your request has been approved, you can access the data from the DUOS Data Library (see instructions below).

Please note: If your request has been approved in dbGaP, you should now see a blue Export link in the View by Dataset tab under Export to Terra (right column) in the DUOS Data Library.

Step 1: Register in DUOS

1.1. Go to duos.org and click sign-up/sign in. You have the option to sign in with either a Microsoft or Google-backed account. Make sure to sign in with the same email account that was used to request GTEx access through dbGaP and is an institutional email.

DUOS-login_Screenshot-of-sign-in-page.png

Screenshot-of-DUOS-signin-screen.png

1.2. Accept the Terms of service.

DUOS_Accept-Terms-of-Service.png

Step 2 : Complete Your User Profile

2.1. In the Researcher Console, select Your Profile under your name (top right).

DUOS-Setup_Screenshot-of-your-profile-in-drop-down-menu.png

2.2. Once you land on the profile page, add your full name and select your institution from the dropdown (start to fill in the name). If your institution is not yet in DUOS, please email DUOS support at support@duos.org and we will add it for you!

DUOS-setup_Screenshot-of-Your-profile-page.png

DUOS-setup_Affiliation-and-role-section-with-'Your-institution-name'-in-the-institution-field_Screenshot.png

2.3. Link your NIH RAS Account. If you do not already have a RAS account, you can find instructions for obtaining one here. You’ll be taken to the external NIH page to sign into your RAS account. 

Please note: You do not need to obtain a Library Card to access GTEx data through DUOS

Step 3: Access GTEx data

Please remember that you must already have an approved dbGaP request to access GTEx data in DUOS. You can either access the data by cloning the associated workspace (preferred method) or by exporting the snapshot to Terra.

Option 1 (preferred method): Access GTEx data through the AnVIL workspace

3.1. Go to the DUOS Data Library (https://duos.org/datalibrary) and filter/search for the dataset you’re looking for. Click on the dataset name that you want to access.

3.2. You will be taken to a page containing information about the dataset. Click on the link to the associated AnVIL workspace.

3.3. You will then be taken to the AnVIL workspace. Click the three dots in the upper right hand corner of the workspace, then select "Clone" to clone the workspace. 

For more detailed instructions on cloning a Terra workspace, please see How to Clone Your Workspace

Option 2: Export the snapshot to Terra

3.4. If your request has been approved in dbGaP, you should now see a blue Export link in the View by Dataset tab under Export to Terra (right column) in the Data Library.

Access-GTEx-with-DUOS_Screenshot-of-data-library-with-export-link-highlighted.png

3.5. Clicking the button should allow you to export to an existing Terra workspace or to create a new one.

How-to-access-GTEX-data-in-Terra_Screenshot-of-option-to-export-data-to-an-existing-workspace-or-vreate-a-new-one.png

3.6. Note that if you're accessing controlled data, DUOS will automatically enable additional security on your new or existing workspace. 

DUOS-access-GTEx-data_Screenshot-of-Create-a-new-workspace-form-with-enable-additional-security-monitoring-highlighted.png

What to expect

DUOS will import the data snapshot to a new or existing workspace.

It will take a few minutes to export the snapshot. You’ll get a green popup (upper right) when data is in your workspace.

DUS-data-access_Screenshot-of-successful-data-import-popup.png

Once you refresh your page, you’ll see the data tables containing all the snapshot data and metadata in the Data tab of your workspace. Note the security shield at the top right indicating additional security monitoring.

DUOS-access-GTEx-data_Screenshot-of-data-tab-after-data-import.png

Download GTEx data to local machine

GTEx data, including controlled access data, can be downloaded from the Terra workspace GCP bucket using the CLI commands provided by Google (gsutil or gcloud storage).

Download caveats

  • The bucket has requester pays enabled.
  • You must be in the appropriate Authorization Domain to access these workspaces. 

GTEx v11 details

GTEx v10 details

GTEX v8 details

Step-by-step instructions

1. Install the gcloud CLI

For detailed instructions, see How to install gcloud on a local machine.

2. Authenticate with Google

Set up user credentials with the Google user identity you use when logging in to Terra (described in the article above).

3. Select and download the desired files

Was this article helpful?

0 out of 4 found this helpful

Comments

0 comments

Please sign in to leave a comment.