An overview of the data submission process to help AnVIL Data Submitters (GCP) get started staging and uploading data to TDR.
PrerequisitesThis document assumes you have already registered your study data with AnVIL and defined the data model for your dataset. For new projects that have not yet been approved, data submitters would complete the AnVIL Onboarding Application.
Process overview and requirements
The video below outlines the process for submitting data to AnVIL (Google cloud).
For additional data submission support, reach out to the AnVIL Support team at anvil-data@broadinstitute.org.
AnVIL provides data submitters with a submission workspace where you will stage data for ingestion (large data files such as omics and image files and CSV/TSV files for each dataset table).
As the data submitter, you’re expected to abide by the following guidelines Only upload data from the current approved data submission.
Use a separate workspace to run any compute or analysis on this data unless you have prior approval from the AnVIL program. Note that cloning the data deposit workspace is not allowed, as the clones will not have access protection for controlled data. Users who wish to analyze AnVIl data should import data snapshots from TDR.
Don’t copy or move primary data from this workspace without prior approval from the AnVIL program.
Next steps: Accessing the data
Once the data is ingested, you will be able to access it in TDR for analysis. Please do NOT clone this workspace for long-term use. This workspace will be deleted once your submission is complete.
Step-by-step instructions
Ready to submit data to AnVIL? See the resources below.
Additional data model resources
- Set up a Data Model in the AnVIL portal (the tables that hold your data)
- Managing data with tables
- Overview: Entity types and the standard genomic model