Genomics workflows, Part 7: analyze public RNA sequencing data using AWS HealthOmics
Architecture Blog
This article discusses how life science organizations can use AWS HealthOmics and AWS Step Functions to automate and analyze public RNA sequencing data for clinical diagnostics. It allows researchers to quickly test hypotheses against larger datasets.
Specifically, the article covers:
- The use case of analyzing RNA sequencing (RNA-Seq) data from the Gene Expression Omnibus (GEO) repository using Nextflow workflows
- The solution architecture involving AWS HealthOmics for compute and workflow management, and AWS Step Functions for orchestrating data ingestion and analysis
- Implementation details for dataset ingestion using AWS Batch jobs, tracking status in DynamoDB, and generating sample sheets for HealthOmics analysis
- The HealthOmics RNA-Seq analysis workflow steps, including FASTQ file ingestion, monitoring status, and storing output BAM files
- Conclusion highlighting how HealthOmics simplifies gaining insights from omics data, combined with Step Functions for end-to-end automation
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
May 30
2024
2024
Unlock Powerful Genomic Insights with AWS HealthOmics Analytics and Amazon EMR
Jun 9
2025
2025
Reduce Genomic Discovery Time and Costs with AWS HealthOmics Run Analyzer
Nov 20
2025
2025
Accelerating genomics variant interpretation with AWS HealthOmics and Amazon Bedrock AgentCore
Jul 18
2025
2025
Benchmarking PacBio whole genome sequencing variant pipeline analysis with AWS HealthOmics workflows
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.