Need Help?

Search our documentation and community forum

Terra is a cloud-native platform for biomedical researchers to access data, run analysis tools, and collaborate.
Terra powers important scientific projects like FireCloud, AnVIL, and BioData Catalyst. Learn more.

hg19 and hg38 TCGA workspaces

Comments

8 comments

  • Avatar
    Sabrina Camp

    Also on another note, I am trying to use bam files from the hg38 cohort and see this note on the dashboard of the workspace "hg38 TCGA and TARGET workspaces reference files by their GDC UUIDs. In order to run analyses on the referenced data files, you will need to run workflows that retrieve the files from the GDC and copy them to your workspace bucket. See this forum post for instructions on the running of these workflows." 

    The link to the forum post (https://gatkforums.broadinstitute.org/firecloud/discussion/10382/populating-hg38-tcga-and-target-workspaces-with-data-files#latest) is broken. Is there an updated link? 

    0
    Comment actions Permalink
  • Avatar
    Samantha (she/her)

    Hi Sabrina Camp,

     

    Thanks for writing in. Unfortunately, we do not have any documentation on the curation of the hg19 and hg38 controlled-access TCGA workspaces. The CGA team did the original pull of the data, but have since given up ownership of the workspaces. It seems that getting the hg19 data which is now in legacy archive was really complicated to parse programmatically because metadata was not always homogeneously present. It may just be a result of QC metrics or something else along the lines of not enough or not correctly formatted metadata.

    To your second question, the link to the forum post points to our legacy GATK forum which is now defunct. We do not have an updated link, but are working on new documentation for these hg38 and hg19 workspaces. I will be sure to keep you updated on that.

    Please let me know if you have any questions.

     

    Best,

    Samantha

    0
    Comment actions Permalink
  • Avatar
    Maha Shady

    Hi Samantha,

    I wanted to follow up on this thread because I also need to use some hg38 bams. Are there any updates regarding workflows to retrieve those files from the GDC portal? The documentation link on the hg38 workspaces is still broken, and the columns do not include bam files. 

    Thank you!

    0
    Comment actions Permalink
  • Avatar
    Emil Furat

    Hi Maha,

     

    Thanks for writing in. I'm reaching out on behalf of the Terra support team to help with answering your question. If you want to retrieve data from the GDC portal you'll need to update the GDC UUIDs in your data tables to be DRS URIs which you can then access from your notebook and/or workflow. To do so you will need to:

    Please note: You must have your NIH account + CRDC Framework Services linked to your Terra account to access the TCGA DRS data.

    If you have any other questions please let us know!

     

    Kind regards,

    Emil

    0
    Comment actions Permalink
  • Avatar
    Maha Shady

    Hi Emil, 

    Thank you for the clarifications, I followed the steps you mentioned. However, I continue to have the problem that the the data tables do not include any columns for bam files. How can I modify the workspace to also be able to retrieve bam files from the GDC portal?

    Thanks so much!

    0
    Comment actions Permalink
  • Avatar
    Emil Furat

    Hi Maha,

     

    Could you please share with me the TCGA workspace that you are using? Not your workspace specifically, but the original workspace that you cloned to create your workspace. 

     

    Kind regards,

    Emil

    0
    Comment actions Permalink
  • 0
    Comment actions Permalink
  • Avatar
    Emil Furat

    Hi Maha,

     

    Sorry for the delay getting back to you, our support staff has lost access to TCGA protected workspaces so we are unable to troubleshoot your issue at this time. We hope to regain access and get back to you with a solution as soon as possible.

     

    Kind regards,

    Emil

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk