This release note corresponds to the dates May 17, 2024, May 24, 2024. This release includes back-end updates to workflows, interactive analysis (Notebooks, Galaxy, RStudio), user interface, Data Repository, and Google and Azure integrations to improve upcoming features.
Cloud Integrations
- Users of Terra workflows on GCP may have noticed jobs failing under low quota conditions with messages such as "PAPI error code 9", "quota too low", and/or "no available zones". Terra's design is that quota exhaustion should not cause jobs to fail, they should wait to run. This regression is due to a recent behavior change in Google Life Sciences API, and Cromwell has now implemented support to detect and retry affected jobs. No WDL update is required, the retry is built-in and applies to all users. There is no cost to the retry, because jobs that fail do so before starting a VM.
- In Terra on Azure, workflows handle DRS input files in a cloud-aware way. This means that Terra will preferentially download files co-located in the same cloud as the Terra workspace, if they exist. The same behavior already exists for GCP.
Interactive Analysis
-
The R version for notebooks on Azure has been upgraded to v4.4.0 due to a security vulnerability with the previous version of R.
-
The latest GATK and Bioconductor images have been updated with Bioconductor v3.19, which includes R 4.4.0.
Workflows
- We resolved a hotspot in Cromwell that could cause the WDL `size()` engine function to perform very slowly on arrays of files, resulting in high IO load. Common examples of file arrays could include globs or scatter-gather results. This enhancement applies only to WDL 1.0 and later, because that's when `size()` added support for arrays.