AnVIL on GCP: Data Submitters' guide

Allie Cliffe
  • Updated

An overview of the data submission process to help AnVIL Data Submitters (GCP) get started staging and uploading data to TDR.

PrerequisitesThis document assumes you have already registered your study data with AnVIL and defined the data model for your dataset. For new projects that have not yet been approved, data submitters would complete the AnVIL Onboarding Application.

Process overview and requirements

The video below outlines the process for submitting data to AnVIL (Google cloud). 

For additional data submission support, reach out to the AnVIL Support team at anvil-data@broadinstitute.org

AnVIL provides data submitters with a submission workspace where you will stage data for ingestion (large data files such as omics and image files and CSV/TSV files for each dataset table).

As the data submitter, you’re expected to abide by the following guidelines Only upload data from the current approved data submission.

Use a separate workspace to run any compute or analysis on this data unless you have prior approval from the AnVIL program. Note that cloning the data deposit workspace is not allowed, as the clones will not have access protection for controlled data. Users who wish to analyze AnVIl data should import data snapshots from TDR. 

Don’t copy or move primary data from this workspace without prior approval from the AnVIL program. 

Next steps: Accessing the data

Once the data is ingested, you will be able to access it in TDR for analysis. Please do NOT clone this workspace for long-term use. This workspace will be deleted once your submission is complete.

Step-by-step instructions

Ready to submit data to AnVIL? See the resources below. 

Additional data model resources

Was this article helpful?

0 out of 0 found this helpful

Comments

0 comments

Please sign in to leave a comment.