Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

Unveiling Detailed Cost Attribution for Amazon Bedrock

Understanding Granular Cost Attribution for Amazon Bedrock Inference: A Guide to Tracking and Optimizing Cloud Expenses

Key Takeaways:

  • Overview of Amazon Bedrock’s new cost attribution feature
  • How to effectively trace inference costs to specific IAM principals
  • Step-by-step guide for implementing cost tracking and aggregation

Introduction

As AI inference gains prominence in cloud spending, understanding the driving factors behind these costs has become crucial for organizations eager to optimize finances. This post introduces Amazon Bedrock’s granular cost attribution capabilities.

How Granular Cost Attribution Works

  • Understanding IAM Principal Attribution
  • Incorporating Cost Allocation Tags for Better Insights

Tracking Costs by Access Patterns

Scenario 1: IAM Users and API Keys

Scenario 2: Applications with IAM Roles

Scenario 3: Federated Authentication

Scenario 4: LLM Gateway Implementations

Steps for Cost Aggregation and Reporting

  • Setting Up Cost Tags
  • Enabling IAM Principal Data in AWS Billing
  • Using AWS Cost Explorer for Visualization

Conclusion

Leverage Amazon Bedrock’s cost attribution capabilities to enhance your financial planning and optimize AI inference expenses effectively.

About the Authors

  • Profiles of the team behind Amazon Bedrock’s cost management features.

Understanding Granular Cost Attribution for Amazon Bedrock Inference

As AI inference offerings expand, they’re taking on a larger share of cloud expenses. This surge underscores the necessity for organizations to grasp precisely who and what are driving these costs, allowing for effective chargebacks, cost optimization, and financial forecasting. This brings us to Amazon Bedrock’s latest feature: granular cost attribution for inference calls.

What’s New?

Today, we’re thrilled to announce that Amazon Bedrock now automatically attributes inference costs to the Identity and Access Management (IAM) principals that initiate these calls. An IAM principal could be an IAM user, an application-assumed role, or a federated identity from providers like Okta or Entra ID. This attribution flows seamlessly into your AWS Billing, accommodating various models, without requiring additional resource management or altering your existing workflows.

Additionally, through optional cost allocation tags, you can further categorize costs by teams, projects, or any custom dimension using AWS Cost Explorer and AWS Cost and Usage Reports (CUR 2.0).

How Granular Cost Attribution Works

When you enable IAM principal data in your CUR 2.0 configuration, you’ll see which IAM principals are interacting with Amazon Bedrock and the costs associated with their usage. For instance:

Line Item IAM Principal Usage Type Unblended Cost
arn:aws:iam::123456789012:user/alice USE1-Claude4.6Sonnet-input-tokens $0.069
arn:aws:iam::123456789012:user/alice USE1-Claude4.6Sonnet-output-tokens $0.214
arn:aws:iam::123456789012:user/bob USE1-Claude4.6Opus-input-tokens $0.198
arn:aws:iam::123456789012:user/bob USE1-Claude4.6Opus-output-tokens $0.990

In this example, Alice and Bob’s expenses for different models and token types are clearly detailed, allowing organizations to track usage effectively.

Adding Tags for Aggregation and Cost Exploration

To aggregate costs by team, project, or cost center, you can incorporate tags into your IAM principals.

  • Principal Tags: These are attached directly to IAM users or roles and apply consistently across all requests from that principal.
  • Session Tags: These are more dynamic, applied when a user or application assumes a role for temporary credentials.

After activating these tags in AWS Billing, they will begin appearing in CUR 2.0 and can be utilized in Cost Explorer.

Quickstart by Scenario

Your approach will depend on how users and applications engage with Amazon Bedrock. Here’s a breakdown:

Your Setup CUR 2.0 Attribution Tagging for Aggregation
Developers with IAM users or API keys Each user’s ARN appears in CUR 2.0 Attach tags to IAM users
Applications with IAM roles Each role’s ARN appears in CUR 2.0 Attach tags to IAM roles
Users authenticate through an IdP Session name in ARN identifies users Pass session name and tags from your IdP
LLM gateway proxying to Bedrock Only shows gateway’s role (one identity for all users) Add per-user AssumeRole with session name and tags

Detailed Scenarios

  1. Per-User Tracking with IAM Users and API Keys

    • Ideal for small teams or development environments. Each user’s spending is directly linked to their IAM credentials, making it easy to track costs.
  2. Per-Application Tracking with IAM Roles

    • Suitable for production workloads. Applications assume specific IAM roles, providing visibility into costs associated with each application.
  3. Per-User Tracking with Federated Authentication

    • Best for enterprises leveraging identity providers. Users can utilize a shared IAM role while still maintaining individual visibility into their costs.
  4. Per-User Tracking Through an LLM Gateway

    • For organizations using a gateway that processes requests before contacting Amazon Bedrock. Implementing per-user session management is key for accurate cost attribution.

Getting Started

To utilize Amazon Bedrock’s new cost attribution feature, follow these steps:

  1. Identify Your Access Pattern: Determine whether you’re using IAM users, API keys, or roles.
  2. Enable IAM Principal Data: Configure your CUR 2.0 to include IAM principal data.
  3. Add Tags for Aggregation: If aggregation is needed, implement tags within IAM users, roles, or configure your IdP for session tagging.
  4. Analyze: Within 24-48 hours post-activation, your tags will be visible in Cost Explorer and CUR 2.0, allowing for sophisticated analysis.

Conclusion

Understanding where costs are accruing from inference requests is essential for effective financial planning, chargebacks, and optimization strategies. With Amazon Bedrock’s granular cost attribution, businesses can pinpoint spending back to specific users, applications, or identities. This visibility empowers organizations to not only track their costs but also to optimize their spending strategies moving forward.


About the Authors

Ba’Carri Johnson is a Sr. Technical Product Manager at Amazon Bedrock, dedicated to cost management and governance for AWS AI.

Vadim Omeltchenko serves as a Sr. Solutions Architect focused on helping AWS customers innovate.

Ajit Mahareddy has a rich background in product management, specializing in AI/ML products from leading technology firms.

Sofian Hamiti leads technological initiatives with over a decade of experience in AI solutions.


Stay tuned for more insights as we continue to empower organizations to optimize and scale their AI capabilities!

Latest

Real-Time Voice Agents Using Stream Vision Agents and Amazon Nova 2 Sonic

Building Production-Grade Real-Time Voice Agents with Stream and Amazon...

Go.Compare Introduces Insurance App Powered by ChatGPT

Go.Compare Launches ChatGPT App for Effortless Insurance Comparison Go.Compare Launches...

Dstl-Backed Robotics Innovation Revolutionizes Military Manufacturing – A Case Study

Revolutionizing Manufacturing: Rivelin Robotics’ Innovations in Precision Finishing for...

Understanding Patient Sentiment in Atopic Dermatitis Management

Insights into Patient Sentiment and Treatment Perceptions in Atopic...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Real-Time Voice Agents Using Stream Vision Agents and Amazon Nova 2...

Building Production-Grade Real-Time Voice Agents with Stream and Amazon Bedrock Co-Authored by Neevash Ramdial, Technical Marketing Leader at Stream Creating natural and responsive production-grade voice agents...

Create Financial Document Processing Solutions Using Pulse AI and Amazon Bedrock

Transforming Financial Document Processing: Leveraging Pulse AI and Amazon Bedrock for Accurate Data Extraction Introduction Financial institutions process thousands of complex documents daily. Optical Character Recognition...

Automating Schema Creation for Smart Document Processing

Streamlining Document Processing: Introducing Multi-Document Discovery for Intelligent Document Processing (IDP) Overcoming Schema Challenges in Large Document Collections The IDP Accelerator: Revolutionizing Document Processing Automated Solution Overview...