← Back
NVIDIA Blog

NVIDIA Unlocks AI Compute at Scale, Inviting Partners to Power the AI Infrastructure Buildout

4 min read
#compute#inference#nvidia#enterprise
NVIDIA Unlocks AI Compute at Scale, Inviting Partners to Power the AI Infrastructure Buildout
Level:Intermediate
For:AI Engineers
TL;DR

NVIDIA is introducing a new business model to provide large-scale, multi-tenant accelerated computing to the AI ecosystem, enabling startups, model builders, enterprises, and research organizations to access compute infrastructure without significant capital investments. This model allows AI cloud companies to sell cloud services delivered through NVIDIA DSX AI factories, aligning economics through revenue-sharing and credit-support. With this initiative, companies like Sharon AI and Firmus are already building DSX AI factories, deploying up to 40,000 NVIDIA Grace Blackwell GB300 GPUs and scaling to 360 megawatts and 170,000 NVIDIA GPUs, respectively. This development has significant implications for engineers building AI systems, as it provides faster access to full-stack accelerated computing and enables the scaling of AI infrastructure.

⚡ Key Takeaways

  • NVIDIA is introducing a new business model to provide access to large-scale, multi-tenant accelerated computing.
  • AI cloud companies will sell cloud services delivered through NVIDIA DSX AI factories, with revenue-sharing and credit-support models.
  • Sharon AI is deploying up to 40,000 NVIDIA Grace Blackwell GB300 GPUs, while Firmus is building a DSX AI factory campus scaling to 360 megawatts and 170,000 NVIDIA GPUs.
  • The new model provides a capital-efficient path for AI cloud companies to scale and offers NVIDIA a new recurring, usage-linked earnings stream.
  • Companies like Baseten, Fireworks AI, and Together AI require immediate access to AI cloud capacity for model training, post-training, fine-tuning, and high-volume agentic inference.
💡 Why It Matters

This development has significant implications for engineers building AI systems, as it provides faster access to full-stack accelerated computing and enables the scaling of AI infrastructure, allowing them to focus on model development and deployment rather than infrastructure management. The new business model also enables AI cloud companies to provide reliable access to large-scale NVIDIA accele

✅ Practical Steps

  1. Contact Sharon AI and Firmus to secure compute capacity and build and deploy AI models.
  2. Learn more about NVIDIA Cloud Partners and AI factories to understand the new business model and its benefits.
  3. Explore the possibilities of using NVIDIA DSX AI factories for large-scale AI compute infrastructure.

Want the full story? Read the original article.

Read on NVIDIA Blog

More like this

Enterprises lost Claude Fable 5 for a few weeks. New data shows two-thirds had already built their hedge

VentureBeat AI#anthropic

The Pulse: a new trend, smart model routing

Pragmatic Engineer#llm

How Amazon Bedrock catches AI-generated phishing

AWS ML Blog#amazon

Context vs. Memory Engineering in Agentic AI Systems

Machine Learning Mastery#agents

EXPLORE AI NEWS

Daily hand-picked stories on LLMs, RAG, agents and production AI — curated for engineers who ship.

BROWSE NEWS

GET THE WEEKLY DIGEST

Join engineers getting the Monday signal-over-noise AI breakdown. No spam, unsubscribe anytime.

LEARN AI ENGINEERING

Curated courses, research papers, repos and tutorials built for engineers leveling up in AI.

START LEARNING