Home›LLM

LLM

Large Language Models (LLMs) are the foundation of modern AI applications. Coverage includes model releases, fine-tuning techniques, inference optimization, and production deployment patterns.

41 articles

VentureBeat AI· 4 min read· Today

Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers

The increasing adoption of large language models (LLMs) in enterprises has led to a rise in prompt injection attacks, which exploit the disconnect between assumptions about LLMs and their actual characteristics. According to the OWASP LLM Top 10 (2025), prompt injection is the most critical category of LLM-specific vulnerabilities, and CrowdStrike's 2026 Global Threat Report documented over 90 organizations affected by prompt injection attacks in 2025. These attacks have evolved to target multi-agent architecture, retrieval-augmented generation (RAG) pipelines, model routers, and long-term memory capabilities, making it essential for engineers to address this threat when deploying AI systems at scale. The practical implication for engineers is to develop strategies to mitigate prompt injection attacks and ensure the secure deployment of LLMs.

Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers

Using Local Coding Agents

LLMs help robots understand vague instructions and focus on key details

Healthcare Benchmarks Are Only as Good as Their Assumptions

Better Experiments with LLM Evals — A funnel, not a fork

Claude Code turned every engineer into three. Now companies need more product thinkers

I Pitted XGBoost Against Logistic Regression on 358 Matches. The Boring Model Won.

LLM Research Papers: The 2026 List (January to May)

How Cara pioneers domain-specific AI for enterprise insurance brokerages with AWS

Exclusive: LucidLink launches MCP server to give AI agents shared access to distributed files

Clustering Unstructured Text with LLM Embeddings and HDBSCAN

How Businesses Are Building Specialized AI They Can Trust

New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M.

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

Real-world grounding in agentic AI

How to Build a Powerful LLM Knowledge Base

Building an End-to-End Sentiment Analysis Pipeline with Scikit-LLM

Bridging intent and execution in agentic systems

OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov

From Local LLM to Tool-Using Agent

In game theory, generalists sometimes win out over specialists

Liquid AI's smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run 'anywhere'

Could AI tell you where you left your keys?

Multi-Label Text Classification with Scikit-LLM

Diverse reasoning traces teach LLMs to make better decisions

Grammarly parent Superhuman buys AI detector GPTZero

Making LLMs faster without sacrificing accuracy

OpenAI, Broadcom debut custom Jalapeño chip for AI inference

The Hot Path Belongs to GBDTs, Agents Own the Cold Path: A Payment-Fraud Benchmark

Beyond the Straight Line: Choosing Between OLS, Interaction Terms, and Tweedie Regression

3 Agents. 3 LLMs. 1 Aging GPU: Engineering Parallel Inference on Bare Metal

An LLM as arbiter in RAG retrieval: picking the right candidate with reasons

How Loka Built a Natural, Low-Latency Voice Agent with Amazon Nova 2 Sonic

Anthropic debuts Claude Tag, a more capable AI teammate that lives within Slack

Build a protein research copilot with Amazon Bedrock AgentCore

Momentic raises the bar for software testing with agentic quality platform

Embed the world: Multimodal AI for searchable aerial imagery at scale

Running ComfyUI workflows on Amazon SageMaker AI processing jobs

Monitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch

At Cannes Lions, NVIDIA Partners Reshape Advertising and Marketing With AI

The consequences of relying on AI for accurate news