How to download data
Hi,
We've been granted access to the phs003200.v1.p1 dataset. Now we would like to download the bam files to analyze it in our HPC, but it seems we have to set up billing (see image below). Before going through this procedure, I would like to know: is this "Files" tab the correct place to download the bam files?
I was also wondering how we can download the data. Is it only via gsutils and command line? What should be the command?
Comments
2 comments
Thanks for writing into the community forum, Arthur Dondi You need to have billing enabled to download the data you're requesting because it's in a Requester Pays Bucket. This storage arrangement keeps the data custodian for paying additional fees, since you (the person accessing the data) are responsible for any networking (egress) costs associated with downloading it. See Using Requester Pays workspaces in Terra Support for more details.
To download the data, you'll use gcloud copy in a local terminal. The article above includes commands you will need to use here. For step-by-step instructions to install gcloud, see this article.
To estimate the cost to download the data locally, see Google Pricing (for example, to download 0-1TB worldwide except Asia and Australia is $0.12/GB).
Hi again, Arthur Dondi!
We haven't heard back from you (hopefully because you got the help you needed to solve your issue) so we're going to close out this ticket. If you still require assistance, simply respond to this comment and we'll be happy to pick up where we left off!
Cheers,
Allie
Please sign in to leave a comment.