Build multi-modal AI applications using our new open-source Vision AI SDK .

Product: Video

Shipping WebRTC Video From a $10 Microcontroller: Challenges Building the Stream Video ESP32 SDK

We recently open-sourced the Stream Video ESP32 SDK — an SDK that lets an ESP32-S3 or ESP32-P4 join a Stream Video call, capture camera and microphone input, encode H.264 + Opus in real-time, and publish it over WebRTC. Someone on a browser or mobile device can then see and hear the ESP32 live. If you're
Read more
15 min read

Live Commerce Explained: Features, Benefits, and Examples

TLDR; Live commerce is a trillion-dollar market, led by China's $680B in 2023 sales but growing fast in the U.S. with a 36% jump projected by 2026. The core tech stack matters more than the platform: low-latency video, real-time chat and moderation are what separate a polished experience from a broken one. Buyers who watch
Read more
22 min read

Scaling WebRTC Video to 100,000 Participants: Stream’s Latest Video Benchmarks

Stream has reached a major milestone in real-time video infrastructure: Successfully scaling a single WebRTC-based livestream to 100,000 concurrent participants while maintaining ultra-low latency, stable frame rates, and zero packet loss. Today, Stream powers real-time chat, activity feeds, moderation, audio, and video for applications serving over one billion end users worldwide, backed by a 99.999%
Read more
2 min read

How Low-Latency Video Streaming Works

These days, low-latency video streaming is so deeply embedded in current culture that, for those accustomed to TikTok, Twitch, YouTube, or even straightforward video chats, the idea that video could be anything other than millisecond-perfect seems ridiculous. Of course, all those platforms, and really the entire concept of video streaming and video chat, are only
Read more
10 min read

Open Vision Agents by Stream: Open Source SDK for Building Low-Latency Vision AI Apps

Vision Agents is a new, open-source framework from Stream that helps developers quickly build low-latency vision AI applications. The project is completely open-source and ships with over ten out-of-the-box integrations, including day one support for leading real-time voice and video models like OpenAI Realtime and Gemini Live. Text-to-speech, speech-to-text, and speech-to-speech models are also natively
Read more
4 min read

In-App Voice and Video Calling: Complete Implementation Guide for Product Managers

We've all been in the situation where we're typing out a message in an app and think, "actually, I just need to call this person." In-app voice and video calling is how you add this functionality to your product. With in-app video, your users can talk to each other without jumping to a different app.
Read more
13 min read

HLS, MPEG-DASH, RTMP, and WebRTC: Which Protocol is Right for Your App?

Picture this: Netflix serves over 15 billion hours of content monthly, Twitch handles an average of 2.55 million concurrent viewers (with peaks above 3.8 million during major events), and Zoom facilitates billions of meeting minutes—all relying on fundamentally different streaming technologies under the hood. Whether you’re binge-watching Netflix or tuning into a live broadcast, live
Read more
21 min read

Stream Releases VP9 SVC Codec for Enhanced Video Call Efficiency

At Stream, we’re dedicated to making video calls smooth and efficient for everyone, especially those using older devices or lower-powered CPUs. To this end, we’re excited to announce that the VP9 SVC codec is now available for Stream Video users on JavaScript, React, and React Native SDKs. This addition delivers significant bandwidth and performance optimizations,
Read more
2 min read