Home icon

Perform generative AI-powered data prep and no-code ML over any size of data using Amazon SageMaker Canvas

Machine Learning Blog



This article introduces the new capability of Amazon SageMaker Canvas to handle petabyte-scale datasets for data preparation, model building, and inference. It provides a step-by-step guide on how to leverage this new feature using a sample flight dataset.

Specifically, the article covers:

  • Prerequisites and setup
  • Importing data into SageMaker Canvas and analyzing data quality
  • Preparing data using the new "Chat for data prep" feature and other transforms
  • Optionally exporting the prepared data to Amazon S3 using Amazon EMR Serverless
  • Creating a machine learning model with AutoML and running batch predictions
  • Conclusion emphasizing the democratization of machine learning with this new capability


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Aug 16
2024
SageMaker Canvas unlocks no-code ML and data preparation at petabyte-scale
Aug 21
2024
Unlock the power of data governance and no-code machine learning with Amazon SageMaker Canvas and Amazon DataZone
Aug 20
2024
Amazon SageMaker Canvas now supports data flows import, and faster data prep for ML
Apr 3
2024
Seamlessly transition between no-code and code-first machine learning with Amazon SageMaker Canvas and Amazon SageMaker Studio

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.