Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

Improving Just Walk Out Technology with Multi-Modal AI

Revolutionizing Shopping with Just Walk Out Technology by Amazon: A Multi-Modal AI Approach

Revolutionizing Shopping with Just Walk Out Technology by Amazon

Since its launch in 2018, Just Walk Out technology by Amazon has completely transformed the shopping experience. Imagine entering a store, picking up the items you need, and simply walking out without having to wait in line to pay. This revolutionary checkout-free technology is now available in over 180 third-party locations worldwide, spanning various industries such as travel, sports, entertainment, and healthcare.

The latest generation of Just Walk Out technology is powered by a multi-modal foundation model (FM) that leverages a transformer-based architecture similar to that used in generative artificial intelligence (AI) applications. This advanced model enables retailers to automatically generate highly accurate shopping receipts using data from multiple inputs such as overhead video cameras, weight sensors on shelves, digital floor plans, and catalog images of products.

The Challenge: Complex Shopping Scenarios

One of the key challenges in developing the Just Walk Out system was ensuring accuracy in complex, long-tail shopping scenarios. Previous generations of the system utilized a modular architecture that segmented the shopper’s visit into discrete tasks. While this approach delivered accurate receipts, it required significant engineering efforts to address new, complex situations, limiting scalability.

The Solution: Just Walk Out Multi-Modal AI

To address these challenges, a new multi-modal FM was introduced specifically for retail environments. This enhanced model improves accuracy and generalization to new store formats, products, and customer behaviors. By incorporating continuous learning, the system adapts and learns from challenging scenarios to maintain high performance.

Key elements of the Just Walk Out multi-modal AI model include flexible data inputs, multi-modal AI tokens to represent shopper journeys, and the ability to continuously update receipts based on shopper interactions. Training the FM involved feeding vast amounts of data into the model and utilizing auxiliary tasks to enhance its performance.

Training the Just Walk Out FM

Effective training of the FM involved selecting challenging data sources, leveraging auto labeling for efficiency, pre-training the model on diverse tasks, and fine-tuning the model to optimize performance. The data flywheel methodology continuously improves the model by identifying and incorporating high-quality, challenging cases.

Conclusion

The introduction of multi-modal AI represents a significant advancement for Just Walk Out technology. This innovative approach simplifies and scales AI systems, moving away from traditional modular architectures. The new system sets a higher standard for accuracy and applicability across different store environments, ultimately enhancing the shopping experience for customers worldwide.

For more information on Just Walk Out technology and AWS AI services, visit the official Amazon announcements and product pages. The future of shopping is here, powered by cutting-edge AI innovation.

About the Authors

Tian Lan is a Principal Scientist at AWS, leading research efforts for Just Walk Out technology.

Chris Broaddus is a Senior Manager at AWS, overseeing research projects related to Just Walk Out technology and deep learning.

Latest

Advancements in Large Model Inference Container: New Features and Performance Improvements

Enhancing Performance and Reducing Costs in LLM Deployments with...

I asked ChatGPT if the remarkable surge in Lloyds share price has peaked, and here’s what it said…

Assessing the Future of Lloyds Banking: Insights and Reflections Why...

Cows Dominate Robots on Day One: The Tech Revolution Transforming Dairy Farming in Rural Australia

Revolutionizing Dairy Farming: Automated Milking Systems Transform the Lives...

AI Receptionist for Answering Services

Certainly! Here’s a suitable heading for the section you...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

Advancements in Large Model Inference Container: New Features and Performance Improvements

Enhancing Performance and Reducing Costs in LLM Deployments with AWS Updates Navigating the Challenges of Token Growth in Modern LLMs LMCache Support: Transforming Long-Context Inference Performance Benchmarks...

Reinforcement Fine-Tuning for Amazon Nova: Educating AI via Feedback

Unlocking Domain-Specific Capabilities: A Guide to Reinforcement Fine-Tuning for Amazon Nova Models Bridging the Gap Between General-Purpose AI and Business Needs A New Paradigm: Learning by...

Creating a Personal Productivity Assistant Using GLM-5

From Idea to Reality: Building a Personal Productivity Agent in Just Five Minutes with GLM-5 AI A Revolutionary Approach to Application Development This headline captures the...