AINewsHubENGINEERING · DAILY
TRENDING
HomeCompute

Compute

AI infrastructure and compute: GPU availability, cloud pricing, hardware releases, and how compute constraints shape model architecture decisions.

8 articles

Perceptron Mk1 shocks with highly performant video analysis AI model 80-90% cheaper than Anthropic, OpenAI & Google
VentureBeat AI· 8 min read· Today
Perceptron Mk1 shocks with highly performant video analysis AI model 80-90% cheaper than Anthropic, OpenAI & Google

Perceptron Mk1, a highly performant video analysis AI model, achieves 80-90% cost savings compared to Anthropic, OpenAI, and Google's offerings, while delivering robust video understanding capabilities. This model can process live feeds with high accuracy, making it suitable for security, surveillance, and content moderation applications. The practical implication for engineers building AI systems is the potential to deploy high-quality video analysis capabilities at a significantly lower cost. Perceptron Mk1's efficiency and cost-effectiveness make it an attractive solution for enterprises and organizations seeking to leverage AI-driven video analysis.

Thinking Machines shows off preview of near-realtime AI voice and video conversation with new 'interaction models'
VentureBeat AI· 7 min read· Yesterday
Thinking Machines shows off preview of near-realtime AI voice and video conversation with new 'interaction models'

Is AI leaving the era of "turn-based" chat? Right now, all of us who use AI models regularly for work or in our personal lives know that the basic interaction mode across text, imagery, audio, and video remains the same: the human user provides an input, waits anywhere between milliseconds...

AI agents are running hospital records and factory inspections. Enterprise IAM was never built for them.
VentureBeat AI· 10 min read· 2 days ago
AI agents are running hospital records and factory inspections. Enterprise IAM was never built for them.

A doctor in a hospital exam room watches as a medical transcription agent updates electronic health records, prompts prescription options, and surfaces patient history in real time. A computer vision agent on a manufacturing line is running quality control at speeds no human inspector can match. Bot...

Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans
AWS ML Blog· 1 min read· 6 days ago
Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans

In this post, you will learn how to secure reserved GPU capacity for short-term workloads using Amazon Elastic Compute Cloud (Amazon EC2) Capacity Blocks for ML and Amazon SageMaker training plans. These solutions can address GPU availability challenges when you need short-term capacity for load tes...

Secure AI agents with Amazon Bedrock AgentCore Identity on Amazon ECS
AWS ML Blog· 1 min read· May 5, 2026
Secure AI agents with Amazon Bedrock AgentCore Identity on Amazon ECS

AI agents in production require secure access to external services. Amazon Bedrock AgentCore Identity, available as a standalone service, secures how your AI agents access external services whether they run on compute platforms like Amazon ECS, Amazon EKS, AWS Lambda, or on-premises. This post imple...

NetApp and Nutanix say storage has become the last line of defense in the AI era
SiliconANGLE AI· 1 min read· Apr 9, 2026
NetApp and Nutanix say storage has become the last line of defense in the AI era

Companies are rethinking their technology foundations as AI infrastructure modernization and security demands grow. The result is surging demand for flexible platforms that can run legacy and modern applications simultaneously while keeping data secure and AI-ready. NetApp Inc. and Nutanix Inc. are ...

Blaize launches AI Services platform to move enterprise AI from pilot to production
SiliconANGLE AI· 1 min read· Apr 9, 2026
Blaize launches AI Services platform to move enterprise AI from pilot to production

Artificial intelligence computing company Blaize Holdings Inc. today announced the launch of Blaize AI Services, a new platform designed to help AI infrastructure providers and enterprises deploy production-ready, application-level AI services without building the underlying AI stack from scratch. M...

New technique makes AI models leaner and faster while they’re still learning
MIT News AI· 1 min read· Apr 9, 2026
New technique makes AI models leaner and faster while they’re still learning

Researchers use control theory to shed unnecessary complexity from AI models during training, cutting compute costs without sacrificing performance....