Protein language model training with NVIDIA BioNeMo framework on AWS ParallelCluster

HPC Blog

This article demonstrates how to train the ESM-1nv protein language model using the NVIDIA BioNeMo framework on an AWS ParallelCluster cluster with GPU-accelerated instances. It provides a step-by-step guide for setting up the cluster, configuring the framework and datasets, and running the pre-training job.

Specifically, the article covers:

Creating an HPC cluster using AWS ParallelCluster with GPU instances, Amazon FSx for Lustre, and Elastic Fabric Adapter (EFA)
Configuring the cluster with the BioNeMo framework and downloading the UniRef50 dataset
Running the ESM-1nv pre-training job and monitoring its progress
Conclusion highlighting alternative options for deploying BioNeMo on AWS

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Aug 11
2025

Federated learning-based protein language models with Apheris on AWS

Mar 12
2024

Find the Next Blockbuster with NVIDIA BioNeMo Framework on Amazon SageMaker

May 1
2024

Accelerate drug discovery with NVIDIA BioNeMo Framework on Amazon EKS

May 29
2024

Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Protein language model training with NVIDIA BioNeMo framework on AWS ParallelCluster

Related articles