AWS ML Blog

Shared infrastructure, isolated tenants: Pool model multi-tenancy with Amazon Bedrock AgentCore

June 23, 2026•16 min read•

Level:Intermediate

For:AI Engineers

✦TL;DR

The Amazon Bedrock AgentCore enables the implementation of production-ready multi-tenant systems with complete tenant isolation, service tier differentiation, and granular cost tracking. The solution demonstrates a three-level hierarchy: Tier → Tenant → User, with isolation enforced at every layer using native AWS capabilities. The example solution implements two service tiers, Basic and Premium, using different models, Mistral Ministral 3 8B Instruct and OpenAI GPT OSS 120B, to cater to diverse customer needs. This approach allows for efficient resource utilization and scalable multi-tenant AI architectures.

⚡ Key Takeaways

The pool model multi-tenancy approach maximizes resource utilization by sharing underlying infrastructure and compute resources among tenants.
The use of native AWS capabilities, such as AWS-managed services, enables complete tenant isolation and granular cost tracking.
The tiering strategy allows for service tier differentiation with minimal custom code, using models like Mistral Ministral 3 8B Instruct for Basic Tier and OpenAI GPT OSS 120B for Premium Tier.
The solution implements a three-level hierarchy: Tier → Tenant → User, to enforce isolation at every layer.
The GitHub repo provides sample code for the solution, demonstrating the implementation of multi-tenant systems using Amazon Bedrock AgentCore.

💡 Why It Matters

The ability to implement production-ready multi-tenant systems with complete tenant isolation and service tier differentiation is crucial for building scalable and efficient AI applications, particularly in industries like healthcare where data privacy and security are paramount. This solution enables developers to build multi-tenant AI systems that cater to diverse customer needs while maintainin

✅ Practical Steps

Implement a three-level hierarchy: Tier → Tenant → User, to enforce isolation at every layer using native AWS capabilities.
Use a pool isolation model to maximize resource utilization and share underlying infrastructure and compute resources among tenants.
Utilize the tiering strategy to differentiate service tiers with minimal custom code, using models like Mistral Ministral 3 8B Instruct and OpenAI GPT OSS 120B.

Want the full story? Read the original article.

Read on AWS ML Blog ↗

Shared infrastructure, isolated tenants: Pool model multi-tenancy with Amazon Bedrock AgentCore

⚡ Key Takeaways

✅ Practical Steps

More like this

Enterprise-grade AI image generation in 2 seconds is here: Krea 2 Raw and Turbo available as open weights under custom license

Build a protein research copilot with Amazon Bedrock AgentCore

Reliability fail: No automated zone failover for Coinbase’s global trading service

How Businesses Are Building Specialized AI They Can Trust