Home icon
AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

Blog



This article details how AWS built a toxic speech classifier for a gaming company using fine-tuned Large Language Models (LLMs).

  • Gaming company needed automated detection of toxic language in player communications
  • Challenge: Only 100 labeled samples available, far below recommended 1,000 for LLM fine-tuning
  • Solution used transfer learning with BERTweet models pre-trained on 850 million English tweets
  • AWS GAIIC tested three models: bertweet-base, bertweet-base-offensive, bertweet-base-hate
  • Initial two-stage approach achieved 91% precision but had maintenance and cost drawbacks
  • AWS ProServe MLDT consolidated to single-stage model with additional 5,000 labeled samples
  • Final one-stage model achieved 88% precision with improved maintainability and lower costs
  • Solution deployed using Amazon SageMaker for scalable production implementation

AWS successfully demonstrated that fine-tuning pre-trained LLMs with limited labeled data effectively solves real-world content moderation challenges while balancing accuracy, cost, and operational efficiency.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.