Blog posts
Product updates, announcements and stories from the trenches.
-
The Terra blog has moved
As part of an overhaul of the Terra.bio website, we've moved the blog to https://terra.bio/blog, which we hope will provide you with a more satisfying experience. (The app itself, the rest of the user guide and the community forum are unchanged.) ...
See more -
Upcoming event: Panel on Genomic Data Sharing Policies
On Thursday, Nov 19, 2020, our colleague Jonathan Lawson will participate in a panel on Genomic Data Sharing Policies hosted by the US National Human Genome Research Institute (NHGRI). Jonathan will contribute the deep experience he has gained thr...
See more -
Update your Terra Notebooks Utilities for continued access to data via DRS/DOS
TL;DR: If you have been using the Terra Notebook Utilities (TNU) to access data through DRS/DOS URIs, you need to update your version of TNU before December 1, 2020 as described further below. Some of the many data repositories that are accessibl...
See more -
Identifying viral insertions with GATK Pathseq in Terra
In this guest blog post, Tiffany Miller describes an analysis project she undertook in Terra in collaboration with her colleague Mark Walker, in which they applied a metagenomic approach to identify viral insertion sites in human genome sequencing...
See more -
A demo workspace for working with gnomAD data in Terra
Last week we were very excited for our colleagues in the gnomAD team, who announced on their blog that the entire gnomAD dataset is now available for direct use or download from Google Cloud as well as Amazon Web Services and Microsoft Azure. If ...
See more -
Dealing with different data models: challenges and solutions
In my last blog post, I gave an overview of how Terra's data tables can help you streamline and scale up your data processing operations through the use of a data model that describes your dataset in a structured way. In this follow-up, I want to...
See more -
New resources for unlocking the power of Terra's Data tables
"This is not a question. I just wanted to say that I've avoided learning how to use Terra Data tables for a long time, primarily because I've used workflows that require sample sheet files, but I finally got to try it out and I must say it was qui...
See more -
Introducing WARP: A collection of cloud-optimized workflows for biological data processing and analysis
Guest blog post by Kylee Degatano, Product Manager for the Lantern Pipelines team in the Data Sciences Platform at the Broad Institute. I'm very excited to announce the recent release of WDL Analysis Research Pipelines (WARP), a brand new, public ...
See more -
The freedom of portable workflows
One of the foundational principles of Terra is that it's designed to be an open ecosystem, not a walled garden -- there are no lock-in mechanisms. If after a while you decide to leave, you can take all the analysis tooling you've been using here a...
See more -
Terra at ASHG 2020: Workshop plans and a video preview
Every year we run a workshop at the annual meeting of the American Society for Human Genetics, and it's a highlight of our year for many reasons. My favorite aspect of it is that we use the ASHG workshop to drive development of brand new education...
See more -
Synthetic phenotypes for 1000 Genomes: Updated dataset for testing, training, and learning
One of the challenges we face in human genomic research is that there is a lot of data that we can't freely share with one another, for legal and ethical reasons. This can be particularly vexing for tool developers who need data for testing, and f...
See more -
Community-maintained Notebook environments in Terra
Ever since we introduced Jupyter Notebooks in Terra, we've sought to provide default environments pre-loaded with software packages that are likely to interest you, to minimize the amount of setup necessary to get your work going. However, we've f...
See more -
GA4GH Interoperability standards in action
Today is the second day of the 8th plenary meeting of GA4GH, aka the Global Alliance for Genomics & Health, an international collaborative effort that has been driving the development of standards for infrastructure, policy, and security in the ge...
See more -
Behind the scenes: Bringing the analysis of COVID-19 data from greater Boston into the cloud
Christine Loreth is a project manager in the Data Sciences Platform at the Broad Institute. In this guest blog post, she tells the story of how she and colleagues in the DSP helped members of the Sabeti Lab, a leading infectious diseases research ...
See more -
Update to Jupyter Notebook environment in Terra : Persistent Disk storage now available
This week, we released one of those changes that looks small on the face of it but is actually a really big deal. Specifically, we upgraded the cloud environment (previously called "runtime") that we provide in Terra for running Jupyter Notebooks ...
See more -
Terra videos on YouTube
What do the Teen Titans and a German docu-series called Terra X have in common? That's right, they're the top hits you get on Google when you search for "Terra intro videos YouTube". So, you might have already searched for Terra videos and come up...
See more -
Coming Soon - Faster, cheaper workflows
Whether you’re processing ten data files or ten thousand, making your workflows run faster and cost less is always a goal. The Terra Workflow (aka “Batch”) team has been working on some cost and performance improvements. These aren’t available qui...
See more -
COVID-19 viral genomics: Updated public workspace and Boston outbreak preprint in medRxiv
As we have discussed previously, our collaborators in the Sabeti Lab at the Broad Institute have been analyzing SARS-CoV-2 viral genomes from COVID-19 cases in the Boston area, in partnership with the MA Department of Public Health and Massachuset...
See more -
Making large-scale single-cell RNASeq analysis scalable and cost-effective with Cumulus
In this guest blog post, Bo Li, Principal Investigator at Massachusetts General Hospital and Assistant Professor of Medicine at Harvard Medical School, explains how using Terra enabled his group to develop a new single-cell transcriptomic analysi...
See more -
“Delete intermediates” option now available for Workflows in Terra
Summary: Intermediate files generated by your workflow may be an unexpected source of storage costs. Fortunately, you now have an easy option to delete these files immediately after your workflow runs. Just select the new “delete intermediate outp...
See more -
Faster creation of Notebook environments with Google Compute Engine VMs
Summary: We know that the 4 minutes it takes to create a cloud environment to perform a Jupyter Notebook analysis can feel like a long time. To reduce this time and save you cost, Terra has added support for using standard Google Compute Engine Vi...
See more -
Single-cell transcriptome atlas of COVID-19 in primates arms scientists in the fight against SARS-CoV-2
In this guest blog post, Longqi Liu from Beijing Genomics Institute-Research and Miguel A. Esteban from the Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), discuss their efforts in developing a better understanding of...
See more -
Announcing the NCI Cancer Data Aggregator (CDA) - A new collaboration led by Broad Institute in partnership with NCI, SBG, and ISB
In the last decade, we’ve seen exponential growth in the amount and breadth of cancer research data. We’ve harnessed the power of cloud repositories to host large oncology datasets, including genomics, proteomics, imaging, and more! Although this ...
See more -
Successes (and stumbles) in cloud-based research: Why we moved to Terra and what we learned along the way
In this guest blog post, Timothy Majarian, a Computational Associate from the Manning lab is giving us a glimpse to the lab's journey with transitioning to cloud computing using Terra from the lab's local high-performance compute cluster. Back in ...
See more -
COVID-19 Genomic Surveillance in the Boston Area - Powered by Terra
As the global battle against COVID-19 continues, researchers in the viral genomics group at the Broad Institute have been hard at work using viral sequencing and genomic epidemiology to understand the spread of SARS-CoV-2 close to home, yielding n...
See more -
COVID-19 Integrated Analyses using Single Cell Data
Recently, we’ve been writing here about the work of the Viral Genomics group at Broad and how they’re using Terra to support their workflows for genome assembly and phylogenomic analysis of novel coronavirus (SARS-CoV-2) genomes recovered from pat...
See more -
A textbook for life sciences in the cloud
I remember when I originally started hearing about Docker containers; I didn't really understand what they were and anything I googled on the topic seemed awfully complicated. And I'll admit, getting used to working with cloud storage was rough to...
See more -
Workflow updates to the COVID-19 workspace: Better viral assembly and phylogenetics with NextStrain
In our last blog post, we featured a public workspace containing best-practices workflows for viral genome analysis developed by Dr. Danny Park's Viral Genomics group, and used to process COVID-19 research data. We have been working with the viral...
See more -
Broad scientists release COVID-19 best-practices workflows and analysis tools in Terra
Like you, we are adapting to a different way of living and working as the 2019 novel coronavirus (COVID-19) spreads, claiming many lives, sickening many more, and upending daily life around the world. We are heartened to see that the scientific co...
See more -
Funding opportunity: Analyze TOPMed and GTEx datasets with support from NHLBI BioData Catalyst
It’s the holidays, which means the season of giving is here. And we’re excited to hear that our Data Biosphere partners over at the National Heart, Lung, and Blood Institute (NHLBI) are leading the way with a timely offering. No, it’s not the late...
See more