Build low-latency Vision AI applications using our new open-source Vision AI SDK. ⭐️ on GitHub ->

Stream Blog

Open Vision Agents by Stream: Open Source SDK for Building Low-Latency Vision AI Apps

Vision Agents is a new, open-source framework from Stream that helps developers quickly build low-latency vision AI applications. The project is completely open-source and ships with over ten out-of-the-box integrations, including day one support for leading real-time voice and video models like OpenAI Realtime and Gemini Live. Text-to-speech, speech-to-text, and speech-to-speech models are also natively
Read more ->
4 min read

The 8 Best Platforms To Build Voice AI Agents

Voice assistants like Siri and Alexa are great for non-trivial everyday personal assistive tasks. However, they are limited in providing accurate answers to complex questions, real-time information, handling turns, and user interruptions. Get started! Activate your free Stream account today and start prototyping your own voice AI agent! Try asking Siri about the best things
Read more ->
13 min

The 6 Best LLM Tools To Run Models Locally

Running large language models (LLMs) like DeepSeek Chat, ChatGPT, and Claude usually involves sending data to servers managed by DeepSeek, OpenAI, and other AI model providers. While these services are secure, some businesses prefer to keep their data offline for greater privacy. Get started! Activate your free Stream account today and start prototyping with the
Read more ->
12 min

Using Stream to Build a Livestream Chat App in Next.js

I always wondered how to create the dynamic chat experience of livestreams, like those found on YouTube, but with an added convenience of allowing anyone to participate without logging in. Get started! Activate your free Stream account today and start prototyping livestream video. With Next.js and Stream, I was able to successfully create that experience.
Read more ->
8 min

How Low-Latency Video Streaming Works

These days, low-latency video streaming is so deeply embedded in current culture that, for those accustomed to TikTok, Twitch, YouTube, or even straightforward video chats, the idea that video could be anything other than millisecond-perfect seems ridiculous. Of course, all those platforms, and really the entire concept of video streaming and video chat, are only

Read more ->
10 min read

How We Tested Our Video SDK with TestDevLab

Building a Video SDK is an interesting engineering challenge, a geek’s dream of lower-level concurrency primitives, synchronization mechanisms, and intelligent throttling–all working in tight loops. Additionally, you’re negotiating codecs (H.264, VP8/9, AV1), managing lock-free queues to maintain frame flow, and performing bandwidth estimation to ensure video remains smooth and audio stays in sync. Video Challenges

Read more ->
8 min read

Build Voice Agents With MCP: The Top 4 Frameworks and APIs

Voice AI technologies have recently become central to communication between customers, small businesses, and enterprises. To extend the capabilities of these systems, the Model Context Protocol (MCP) becomes a must-have. Utilizing MCP can enhance the capabilities of voice systems to ensure they provide users with satisfactory responses. Continue reading to discover the APIs, open-source frameworks,

Read more ->
12 min read

Shipping Real-Time Therapy Conversations with Stream

A senior Android engineer at Argentina’s second-largest fintech company, Juan Andrade spent his days shipping features used by millions. But at night, he opened Xcode, taught himself SwiftUI from scratch, and coded until the early hours. His goal was to create a mental health companion that went beyond surface-level validation to provide evidence-based support. This

Read more ->
4 min read

How to Build Automated Moderation From Basic Rules to LLMs

"Discord’s AutoMod feature is amazing! How did we get by before that was a thing…" -Reddit user croc122 If you’ve ever set up a community online, even a small one, you’ll have seen the bad side of people. "croc122" knows this. It’s not just the flame wars and toxic comments that cause problems; it’s also

Read more ->
12 min read

Build a Google Docs-Style Collaborative Editor

Real-time collaboration has become the standard for modern productivity tools, transforming how we work and interact. In this tutorial, you’ll learn how to build a Google Docs–style collaborative editor that lets multiple users create, edit, and share documents in real time. We’ll use QuillJS to integrate a rich WYSIWYG editor, Next.js for full-stack development, and

Read more ->
18 min read

How to Build a Group Chat with Moderation Using React Native

Building a safe and respectful group chat experience is more important than ever, especially in online gaming communities where conversations can scale quickly. A dedicated group moderator can make this easier by enforcing community standards, reducing toxic behavior, and keeping discussions welcoming for everyone. With Stream’s AI Moderation built into the Chat API, moderation becomes

Read more ->
14 min read

Walkie Talkie Scales Community Safety with Stream AI Moderation

For any community app, safety is non-negotiable. Toxic interactions can derail conversations, drive away users, and damage retention. But for smaller, fast-growing companies, building the kind of human moderation teams that Facebook or TikTok employ simply isn’t feasible. The Challenge of Moderation at Scale Walkie Talkie, a global social audio app for Gen Z, faced

Read more ->
3 min read