HomeNvidia

Nvidia

14 curated articles on Nvidia for AI engineers

14 articles
NVIDIA and AWS Collaborate to Bring AI to Production at Scale
NVIDIA Blog· 4 min read· 3 days ago
NVIDIA and AWS Collaborate to Bring AI to Production at Scale

NVIDIA and AWS have collaborated to bring AI to production at scale, addressing constraints such as low-latency inference, fast vector search, and strong GPU price-performance. The NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs power new Amazon EC2 G7 instances, delivering up to 4.6x AI inference performance and up to 2.1x graphics performance compared to G6 instances. The NVIDIA cuVS library accelerates the retrieval layer by making GPU-powered vector indexing the default in OpenSearch Serverless, resulting in vector indexing up to 10x faster at a quarter of the cost. This collaboration provides enterprises with practical paths to deploy AI at production scale, enabling lower-latency inference and faster vector search.

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel
Hugging Face Blog· 5 min read· 3 days ago
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

Not mentioned. The title suggests a technical improvement but lacks specific details. Not mentioned. Not mentioned. The practical implication for engineers building AI systems is not mentioned.

How Businesses Are Building Specialized AI They Can Trust
NVIDIA Blog· 4 min read· 4 days ago
How Businesses Are Building Specialized AI They Can Trust

The NVIDIA Agent Toolkit provides a foundation for building specialized AI agents that can be customized, controlled, and trusted by enterprises and developers. This toolkit includes models, tools, skills, and a secure runtime, enabling the creation of digital AI coworkers that can reason, use tools, and take action. With the NVIDIA Agent Toolkit, businesses can build specialized AI agents that fit their specific workflows, leading to increased efficiency and productivity. The practical implication for engineers building AI systems is that they can now create customized AI agents that can be integrated into existing systems and workflows.

NVIDIA Powers Over 400 of the World’s 500 Fastest Supercomputers
NVIDIA Blog· 4 min read· 4 days ago
NVIDIA Powers Over 400 of the World’s 500 Fastest Supercomputers

NVIDIA technologies power over 400 of the world's 500 fastest supercomputers, with 81% of the TOP500 and 90% of new systems on the list utilizing NVIDIA technology. The top eight systems on the Green500 run on NVIDIA GPUs, with the No. 1 system, KAIROS, using a single NVIDIA Grace Hopper Superchip to achieve 73.3 gigaflops per watt. NVIDIA's momentum in new deployments is driven by a preference for machines built for AI, simulation, and science, with NVIDIA systems delivering more than 2x the AI training and nearly 3x the AI inference throughput of every other platform combined. This trend has significant implications for engineers building AI systems, as accelerated computing becomes the foundation for systems tackling demanding workloads.

NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations
NVIDIA Blog· 5 min read· 4 days ago
NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations

NVIDIA is bringing trusted, 24/7 AI agents to telecom operations, enabling autonomous networks and operations where AI agents proactively watch for problems and coordinate changes across network, IT, and business systems. The company is demonstrating the building blocks of a secure, telecom autonomy platform, including synthetic data, telecom-domain models, secure agent runtimes, and simulations. This platform allows agents to understand operator intent, act safely across business and network domains, and keep humans in control of policy. The practical implication for engineers building AI systems is the ability to create more autonomous, resilient networks and power richer AI-driven services for consumers and businesses.

Optimize model training on Amazon SageMaker AI with NVIDIA Blackwell
AWS ML Blog· 13 min read· 2 days ago
Optimize model training on Amazon SageMaker AI with NVIDIA Blackwell

The introduction of NVIDIA Blackwell GPUs on Amazon SageMaker AI enables the optimization of model training for large AI models by reducing constraints such as batch sizes limited by GPU memory and sequence lengths cut short to avoid out-of-memory errors. With Blackwell's expanded memory and new precision formats, users can train models with larger batch sizes, longer sequence lengths, and reduced model sharding, resulting in improved throughput and reduced communication overhead. The use of PyTorch Fully Sharded Data Parallel (FSDP) and strategic application of activation checkpointing can further optimize training configurations. This leads to faster iteration cycles, less networking overhead, and lower infrastructure costs. By properly configuring Blackwell training jobs, users can process larger batch sizes without aggressive sharding and achieve better results for long-range depende

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure
NVIDIA Blog· 4 min read· 5 days ago
NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

The National Artificial Intelligence Research Resource (NAIRR) pilot program, powered by NVIDIA AI infrastructure, has successfully driven over 700 innovative research projects across the U.S. in the past two years, with applications in protein prediction and infectious disease outbreak management. This program leverages NVIDIA's AI capabilities to accelerate scientific research, demonstrating the potential of AI in driving breakthroughs in various fields. By harnessing the power of AI infrastructure, researchers can now focus on complex scientific problems, leading to more efficient and effective discovery processes. This initiative has the potential to revolutionize the way scientific research is conducted, enabling faster and more accurate results.

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory
NVIDIA Blog· 4 min read· 5 days ago
NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

NVIDIA has integrated its Vera CPU into Los Alamos National Laboratory's (LANL) new supercomputers, leveraging HPE's Cray Supercomputing GX5000 architecture to accelerate scientific discovery and unlock agentic AI for science. The Vera CPU is designed to provide a significant boost in performance and efficiency, enabling researchers to tackle complex scientific problems. This integration marks a significant step towards the development of autonomous, agentic AI systems that can drive scientific innovation. The resulting supercomputers will be capable of processing vast amounts of data and executing complex tasks, driving breakthroughs in fields such as physics, chemistry, and materials science.

From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries
NVIDIA Blog· 5 min read· 5 days ago
From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries

NVIDIA has introduced the DAQIRI library and ALCHEMI NIM microservices, accelerating AI for scientific discoveries in fields like chemistry, materials science, and astronomy. The new software leverages NVIDIA's cuPhoton reference code and can be used for tasks such as materials simulation and experimental astronomy. This technology has the potential to unlock groundbreaking discoveries, but its adoption may be limited by the complexity of integrating it with existing research pipelines. The DAQIRI library and ALCHEMI NIM microservices are designed to be highly scalable and can be easily integrated into large-scale scientific simulations.

Nvidia bets on agentic AI to turbocharge biotech discovery
SiliconANGLE AI· 3 days ago
Nvidia bets on agentic AI to turbocharge biotech discovery

Nvidia is betting on agentic AI to accelerate biotech discovery, as announced at the Bio International Convention in San Diego. The company's vice president and general manager of healthcare and life sciences, Kimberly Powell, made the case for agentic AI in a special address. Not mentioned are specific numbers, model names, or benchmark results. The practical implication for engineers building AI systems is the potential application of agentic AI in biotech discovery. Agentic AI may enable more efficient and effective discovery processes.

Nvidia and DDN target the economics of AI infrastructure
SiliconANGLE AI· 4 days ago
Nvidia and DDN target the economics of AI infrastructure

Nvidia and DDN have introduced a joint solution to address the economic challenges of AI infrastructure, leveraging their combined expertise in data and compute to optimize performance and reduce costs. Their partnership aims to enable enterprises to extract maximum value from their AI investments by streamlining data movement and processing. This joint solution is designed to handle massive amounts of data and scale with growing AI workloads, making it an attractive option for large-scale AI deployments. By combining Nvidia's high-performance GPUs with DDN's storage solutions, the partnership has achieved significant performance improvements and cost reductions, setting a new standard for AI infrastructure economics.

Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines
NVIDIA Blog· 7 min read· 5 days ago
Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines

NVIDIA's newest AI servers can run their cooling liquid at up to 45 degrees Celsius, making them more energy efficient and achieving 100% liquid cooling with no fans in the system. The Rubin generation of NVIDIA AI infrastructure is the first to achieve this, and it is outlined in the NVIDIA DSX AI factory reference design. This liquid cooling methodology enables data centers to reduce cooling energy consumption, making a significant difference in overall data center energy use. The practical implication for engineers building AI systems is that they can design more efficient and sustainable data centers using liquid-cooled infrastructure.

France Advances Europe’s AI Future With NVIDIA Technologies
NVIDIA Blog· 6 min read· Jun 18, 2026
France Advances Europe’s AI Future With NVIDIA Technologies

France has successfully deployed AI infrastructure, leveraging NVIDIA technologies to establish national compute capacity and enable the development of open frontier models and industrial platforms, with AI agents now running in production. This marks a significant milestone in advancing Europe's AI future. The deployment combines NVIDIA's AI expertise with France's strategic investment, fostering innovation and driving economic growth. This achievement serves as a model for other European countries to follow, demonstrating the potential of collaborative efforts between governments and tech giants.

Hands Free, AIs Forward: NVIDIA XR AI Brings Agents to AR Glasses
NVIDIA Blog· 4 min read· Jun 16, 2026
Hands Free, AIs Forward: NVIDIA XR AI Brings Agents to AR Glasses

NVIDIA XR AI is now available in public beta, providing a framework for developers to build multimodal AI agents for AR glasses and XR devices. This framework enables the creation of agents that can interact with users in a hands-free manner. Not mentioned are specific numbers, model names, or benchmark results. The practical implication for engineers building AI systems is the ability to create more interactive and immersive experiences for users. The public beta release of NVIDIA XR AI allows developers to start building and testing their own multimodal AI agents.

EXPLORE AI NEWS

Daily hand-picked stories on LLMs, RAG, agents and production AI — curated for engineers who ship.

BROWSE NEWS

GET THE WEEKLY DIGEST

Join engineers getting the Monday signal-over-noise AI breakdown. No spam, unsubscribe anytime.

LEARN AI ENGINEERING

Curated courses, research papers, repos and tutorials built for engineers leveling up in AI.

START LEARNING