Stream Blog
Vision Agents v0.5.0 Release: Local Hardware I/O, Anam Avatars, and Faster Deepgram TTS
Stream’s AI Moderation Roadmap: What We’re Building Next
How to Build an App Like TikTok Shop (+ Turn Livestreams into Revenue)
The 8 Best Platforms To Build Voice AI Agents
Community Sift Moderation Alternatives – Top 6 Competitors Compared
Community Sift has been one of the most purpose-built content moderation platforms for gaming and online communities. If you’re evaluating whether it’s still the right fit, or your trust and safety team is looking at what else is out there, this guide gives you an honest comparison of the strongest moderation alternatives available today. We’ll
Popup Frees Creators from the Algorithm with Stream’s Livestreaming Infrastructure
Popup was founded in early 2025 with a simple but powerful premise: give creators a branded virtual space to connect with and monetize their audiences directly—no algorithm standing between them and their communities. The idea emerged from a clear shift happening across the creator economy. For years, creators have depended on brand sponsorships and social
How To Design AI Voices in Minutes Using Qwen3-TTS
Before You Start To begin, ensure that you meet these requirements and have the following credentials. Python 3.13 or a later version. An Apple Silicon Mac (recommended) or any modern laptop. Stream API credentials (for realtime audio and video communication). A HuggingFace Account and access token (HF_TOKEN). A Deepgram API key (for speech-to-text). A Google
Shipping WebRTC Video From a $10 Microcontroller: Challenges Building the Stream Video ESP32 SDK
We recently open-sourced the Stream Video ESP32 SDK — an SDK that lets an ESP32-S3 or ESP32-P4 join a Stream Video call, capture camera and microphone input, encode H.264 + Opus in real-time, and publish it over WebRTC. Someone on a browser or mobile device can then see and hear the ESP32 live. If you’re
Where LLM Training Data Comes From (And Why It Matters)
Everyone talks about models. New architectures, larger parameter counts, faster inference—those tend to dominate the conversation. But if you’re actually building AI systems (or evaluating vendors), you quickly realize something else matters more: The data. Not just how much of it you have, but where it comes from, how it’s processed, and how it evolves
HIPAA-Compliant Chat: How to Build Secure Messaging for Telemedicine Apps
TL;DR: Any chat message pairing a patient identifier with health information is PHI, making your entire messaging infrastructure a HIPAA compliance surface. HIPAA’s three rules translate into concrete engineering requirements: role-based access, encryption, tamper-evident audit logs, and PHI-safe push notifications. Building compliant chat from scratch needs 2-3 dedicated engineers; a HIPAA-eligible API like Stream compresses
The 6 Best On-Device TTS Models for Voice AI
When building voice AI applications, you have industry-leading cloud options for text-to-speech, such as Cartesia Sonic 3 and Grok TTS. For privacy and to avoid sharing your business’s data with these commercial text-to-speech (TTS) providers, your team may want to use free, open-source solutions that run locally on mobile and desktop devices. Continue reading to
Vision Agents v0.5.0 Release: Local Hardware I/O, Anam Avatars, and Faster Deepgram TTS
It’s been a busy period since our last release, and now it’s time to share Vision Agents v0.5.0 — a step toward making production-grade multimodal AI agents easy to build and deploy. While previous versions laid the groundwork for real-time voice, video, and Vision Agents, v0.5.0 focuses on stability at scale and even more expressive
