Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

Cisco achieves 50% reduction in latency with Amazon SageMaker’s faster autoscaling feature

Enhancing Contact Center Experiences with Generative AI and Amazon SageMaker Inference_SPEEDY AUTOSCALING RELEASE REFERENCE

Cisco’s Webex Collaboration AI team is at the forefront of leveraging AI-driven features to enhance its products and services. With a focus on generative AI and large language models (LLMs), the team has been able to improve productivity and user experiences, particularly in the realm of customer engagement solutions like Webex Contact Center. However, as the models grew in size and complexity, the team faced challenges in efficiently allocating resources and scaling applications.

To address these challenges, Cisco worked with Amazon SageMaker Inference to optimize its AI/ML infrastructure. By migrating LLMs to SageMaker, Cisco was able to improve speed, scalability, and price-performance. This architectural shift allowed for better resource utilization and streamlined development, testing, and deployment of new AI-powered features for the Webex portfolio.

One notable improvement came in the form of faster autoscaling with SageMaker’s new predefined metric types. By utilizing high-resolution metrics like SageMakerVariantConcurrentRequestsPerModelHighResolution, Cisco saw up to a 50% improvement in end-to-end inference latency. This enhancement enabled faster detection of scaling needs and more efficient allocation of resources, ultimately leading to improved performance and efficiency for their critical Generative AI applications.

Looking ahead, Cisco plans to continue working with SageMaker Inference to drive further improvements in variables that impact autoscaling latencies, such as model download and load times. With this new feature, Cisco looks forward to broadening its rollout in multiple regions and delivering even more impactful generative AI features to its customers.

The collaboration between Cisco and Amazon SageMaker highlights the power of AI-driven innovation in enhancing collaboration experiences and customer engagement solutions. With a focus on leveraging advanced technologies like LLMs and generative AI, Cisco is paving the way for more efficient and personalized customer interactions. As the partnership continues to evolve, we can expect to see even more exciting developments in the realm of AI-driven collaboration.

Latest

Transforming Isolated Data into Cohesive Insights: Cross-Account Athena Access for Amazon QuickSight

Harnessing Cross-Account Athena Access for Amazon Quick: A Comprehensive...

I Used ChatGPT to Overcome Daily Decision-Making Anxiety, and My Stress Plummeted Almost Instantly

Breaking Free from the Chains of Overthinking: Strategies for...

Exyn Technologies Seeks NASDAQ IPO with Autonomous Robotics and 3D Mapping Software — TradingView News

Exyn Technologies Launches Initial Public Offering on Nasdaq: A...

Mindful Anger Management Through Generative AI Tools Like ChatGPT

Harnessing AI for Anger Management: A Promising Tool for...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Transforming Isolated Data into Cohesive Insights: Cross-Account Athena Access for Amazon...

Harnessing Cross-Account Athena Access for Amazon Quick: A Comprehensive Guide Overview of Amazon Quick and Its Components Amazon Quick: An AI-focused service for unified data analysis...

Real-Time Voice Agents Using Stream Vision Agents and Amazon Nova 2...

Building Production-Grade Real-Time Voice Agents with Stream and Amazon Bedrock Co-Authored by Neevash Ramdial, Technical Marketing Leader at Stream Creating natural and responsive production-grade voice agents...

Create Financial Document Processing Solutions Using Pulse AI and Amazon Bedrock

Transforming Financial Document Processing: Leveraging Pulse AI and Amazon Bedrock for Accurate Data Extraction Introduction Financial institutions process thousands of complex documents daily. Optical Character Recognition...