Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Customize Amazon Nova in Amazon SageMaker with Direct Preference Optimization

Unlocking the Power of Customization: A Deep Dive into Amazon Nova Foundation Models at AWS Summit NYC

Exploring Model Customization Capabilities: A Comprehensive Overview

Direct Preference Optimization: Enhancing Model Outputs with Your Preferences

Streamlined Customization Workflow: Leveraging Amazon SageMaker AI

Adapting Amazon Nova Micro: A Practical Implementation Walk-Through

Preparing Your Dataset: Key Steps for Optimal Model Training

Fine-Tuning with DPO: Achieving Superior Model Performance

Evaluating Model Performance: A Comparative Analysis of Evaluation Metrics

Deploying Your Custom Model: Seamless Integration with Amazon Bedrock

Conclusion: Embracing Advanced AI Customization with Amazon Nova

Meet the Authors: Insights from the Team Behind Amazon Nova

Unlocking the Future of AI: Customizing Amazon Nova Models with DPO Recipes at AWS Summit NYC

At the recent AWS Summit in New York City, a significant announcement resonated throughout the AI community: Amazon introduced a comprehensive suite of model customization capabilities for the Amazon Nova foundation models. With the launch of ready-to-use recipes on Amazon SageMaker AI, users can now adapt various models—Nova Micro, Nova Lite, and Nova Pro—throughout the model training lifecycle, encompassing pre-training, supervised fine-tuning, and alignment.

A Sneak Peek into Customization Recipes

This multi-post series will delve into these customization recipes, starting with an alignment technique called Direct Preference Optimization (DPO). DPO allows you to finely tune model outputs based on your preferences through an intuitive method: supplying the model with two responses—one preferred and the other less so. By guiding the model with this structured feedback, you can foster outputs that resonate better with your brand’s tone, style, and guidelines. Depending on your data volume and budget, you can choose between parameter-efficient or full model DPO approaches.

Once trained, these customized models can be seamlessly deployed to Amazon Bedrock for inference, utilizing provisioned throughput, while the parameter-efficient version supports on-demand inference. The flexibility extends further—Nova customization recipes are compatible with SageMaker training jobs and SageMaker HyperPod, allowing you to tailor the environment to your scale requirements.

Streamlined Customization with SageMaker Training Jobs

Today, we’re focusing on a streamlined approach to customizing Amazon Nova Micro using SageMaker training jobs. Here’s how the workflow operates:

  1. Selecting a Recipe: Users initiate by selecting a specific Nova customization recipe, which offers detailed configurations for training parameters, model settings, and distributed training strategies. Default configurations optimized for the SageMaker AI environment can be utilized or modified for experimentation.

  2. API Request Submission: An API request is sent to the SageMaker AI control plane, carrying the chosen Amazon Nova recipe configuration.

  3. Managed Compute Cluster Execution: SageMaker executes the recipe on a managed compute cluster through a training job launcher script. It provisions the necessary infrastructure and orchestrates distributed training, automatically decommissioning the cluster upon job completion.

This comprehensive architecture simplifies the customization experience, allowing users to swiftly define training parameters while SageMaker handles the underlying infrastructure—billed only for the duration of training in seconds.

Business Impact: Optimizing for Function Calling

The implementation walkthrough zeroes in on adapting the Amazon Nova Micro model for structured function calling, proving to significantly enhance performance metrics. Our findings indicate an 81% increase in the F1 score and up to 42% improvement in ROUGE metrics. Such enhancements mean that these models can effectively drive various business applications, like AI customer support systems, intelligent digital assistants, and workflow automation in fields such as e-commerce and finance.

Training the Model: Data Preparation & Execution

The initial step involves preparing the dataset using the nvidia/When2Call dataset, designed for training AI assistants to make optimal tool-use decisions. You’ll format user prompts, preferred outputs, and non-preferred outputs into a suitable training structure.

Next, the DPO training takes place on SageMaker, leveraging the PyTorch Estimator class for fine-tuning. You’ll identify the instance type, configure input channels, and submit the fitting job—all while maintaining oversight through Amazon CloudWatch logs for robust observability.

Evaluation and Results

After fine-tuning, model evaluation follows. Using evaluation recipes allows for an in-depth assessment of performance across various metrics. Our evaluation shows marked improvements:

  • F1 Score: Increased by 81%
  • F1 Quasi: Improved by 40%
  • ROUGE Scores: Gains of 39% and 42% for ROUGE-1 and ROUGE-2, respectively.

Deployment on Amazon Bedrock

Finally, deploying the fine-tuned model using the Amazon Bedrock CreateCustomModel API enables on-demand inference with the integrated tooling. The process involves creating a custom model and configuring it for real-time operational efficiency.

Conclusion: A New Era of Customization

This blog post illustrates how businesses can leverage DPO recipes to customize Amazon Nova models effectively. The exploration highlights the capability to achieve dramatic performance improvements—culminating in a powerful tool for meeting industry-specifications and enhancing customer interactions.

To kickstart your journey with the Nova customization recipes, check out the SageMaker HyperPod recipes repository and Amazon Nova Samples repository. The AWS team is continually expanding this suite based on customer feedback, ensuring alignment with evolving machine learning trends.

About the Authors

  • Mukund Birje: Sr. Product Marketing Manager at AWS.
  • Karan Bhandarkar: Principal Product Manager, focusing on model customization.
  • Kanwaljit Khurmi: Principal Worldwide Generative AI Solutions Architect.
  • Bruno Pistone: Senior Generative AI/ML Specialist Solutions Architect.

Stay tuned for the next posts in this series, where we will explore additional customization techniques and real-world applications!

Latest

I Asked ChatGPT About the Worst Money Mistakes You Can Make — Here’s What It Revealed

Insights from ChatGPT: The Worst Financial Mistakes You Can...

Can Arrow (ARW) Enhance Its Competitive Edge Through Robotics Partnerships?

Arrow Electronics Faces Growing Challenges Amid New Partnership with...

Could a $10,000 Investment in This Generative AI ETF Turn You into a Millionaire?

Investing in the Future: The Promising Potential of the...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

Microsoft launches new AI tool to assist finance teams with generative tasks

Microsoft Launches AI Copilot for Finance Teams in Microsoft...

Leverage Amazon SageMaker HyperPod and Anyscale for Next-Gen Distributed Computing Solutions

Optimizing Large-Scale AI Deployments with Amazon SageMaker HyperPod and Anyscale Overview of Challenges in AI Infrastructure Introducing Amazon SageMaker HyperPod for ML Workloads The Integration of Anyscale...

Vxceed Creates the Ideal Sales Pitch for Scalable Sales Teams with...

Revolutionizing Revenue Retention: AI-Powered Solutions for Consumer Packaged Goods in Emerging Markets Collaborating for Change in CPG Loyalty Programs The Challenge: Addressing Revenue Retention in Emerging...

Streamline the Creation of Amazon QuickSight Data Stories with Agentic AI...

Streamlining Decision-Making with Automated Amazon QuickSight Data Stories Overview of Challenges in Data Story Creation Introduction to Amazon Nova Act Automating QuickSight Data Stories: A Step-by-Step Guide Best...