AWS ML Blog

How Loka Built a Natural, Low-Latency Voice Agent with Amazon Nova 2 Sonic

June 24, 2026•11 min read•

Level:Intermediate

For:AI Engineers

✦TL;DR

Loka built a conversational AI agent with Amazon Nova 2 Sonic, achieving high speech reasoning accuracy and low latency, outperforming traditional voice AI pipelines. The native speech-to-speech model processed audio end-to-end, capturing tone, emotion, and subtle cues, and scored 87.0 on the Big Bench Audio benchmark. This approach solved the common frustration of robotic, slow voice assistants, delivering natural and responsive experiences. The practical implication for engineers building AI systems is that native speech-to-speech models can provide a better solution for voice AI adoption, with lower costs and faster response times.

⚡ Key Takeaways

Amazon Nova 2 Sonic achieved a speech reasoning score of 87.0 on the Big Bench Audio benchmark.
Native speech-to-speech models can process audio end-to-end, capturing tone, emotion, and subtle cues.
Traditional voice assistants introduce compounding delays at every step, resulting in a 3 to 5 second pause before responding.
The combination of poor experience and high cost has limited voice AI adoption.
Amazon Nova 2 Sonic outperformed Gemini 2.5 Flash Native Audio and GPT Realtime on the Big Bench Audio benchmark.

💡 Why It Matters

The use of native speech-to-speech models like Amazon Nova 2 Sonic can significantly improve the customer experience and reduce costs for businesses, making voice AI adoption more feasible. This approach can help engineers building AI systems to deliver more natural and responsive voice interactions.

✅ Practical Steps

Consider using native speech-to-speech models like Amazon Nova 2 Sonic for voice AI applications.
Evaluate the performance of native speech-to-speech models using benchmarks like Big Bench Audio.
Design voice AI systems that can process audio end-to-end, capturing tone, emotion, and subtle cues.

Want the full story? Read the original article.

Read on AWS ML Blog ↗

How Loka Built a Natural, Low-Latency Voice Agent with Amazon Nova 2 Sonic

⚡ Key Takeaways

✅ Practical Steps

More like this

The fuel of the future is already here: Why TRISO matters

Huntington Bank: Redacting sensitive data from 400M+ documents with AWS

Why I Stopped Using One Agent and Built a Multi-Agent Pipeline Instead

Anthropic debuts Claude Tag, a more capable AI teammate that lives within Slack