Home icon

The art and science of hyperparameter optimization on Amazon Nova Forge

Machine Learning Blog



This article provides a comprehensive guide to hyperparameter optimization for Amazon Nova Forge, AWS's platform for building custom domain-specific language models. It addresses the critical challenges of fine-tuning while preserving general capabilities.

  • Catastrophic forgetting occurs when domain training overwrites general capabilities; data mixing prevents this
  • Learning rate is the most sensitive hyperparameter; service defaults are recommended starting points
  • Three customization techniques: Continued Pre-Training (CPT), Supervised Fine-Tuning (SFT), Reinforcement Fine-Tuning (RFT)
  • Checkpoint selection is the most impactful decision for CPT; match flexibility to data scale
  • Data mixing should balance customer data at ~50% with Nova curated data; always include reasoning-instruction-following
  • LoRA training offers lower cost and faster iteration; graduate to Full Rank after validation
  • Recommended workflow: SFT with LoRA, then RFT for optimal results on labeled data
  • Batch size targets: 2-20 million tokens per step for CPT; monitor validation loss for overfitting
  • Common mistakes: skipping SFT before RFT, deviating from default learning rates, poor reward functions
  • Data and reward quality matter more than volume; prioritize quality over scale

Successful Nova Forge customization requires balancing strategic decisions (checkpoint, data mixing, training mode) with systematic hyperparameter tuning, while maintaining data quality and reward function rigor.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Apr 3
2024
Hyperparameter optimization for quantum machine learning with Amazon Braket
Jul 24
2025
Benchmarking Amazon Nova: A comprehensive analysis through MT-Bench and Arena-Hard-Auto
Jun 10
2025
Leveraging LLMs as an Augmentation to Traditional Hyperparameter Tuning
Apr 29
2025
Improve Amazon Nova migration performance with data-aware prompt optimization

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.