Stream Blog
Open Vision Agents by Stream: Open Source SDK for Building Low-Latency Vision AI Apps
The 8 Best Platforms To Build Voice AI Agents
The 6 Best LLM Tools To Run Models Locally
Using Stream to Build a Livestream Chat App in Next.js
Peerspace Scales Messaging Safely With Stream Chat & AI Moderation
Peerspace is the leading marketplace for booking unique spaces for meetings, productions, and events. The platform connects guests with hosts through real-time, in-app messaging, enabling seamless coordination, faster bookings, and stronger trust on both sides of the marketplace. For Peerspace, keeping conversations inside the platform is a strategic priority. In-app messaging reduces reliance on third-party
Build a Voice AI App in Python: Grok-4 + Fish Audio + Deepgram
xAI’s Grok-4 delivers strong reasoning with a 256k context window, native tool use, and multimodal support. We love it for natural, low-latency voice conversations. Pair it with Fish Audio’s high-quality, expressive TTS (known for realistic prosody, emotion control, and voice cloning via short references) and Deepgram’s fast, accurate STT, and you get a custom voice
The 2026 Python Libraries for Real-Time Multimodal Agents
Every vision-language model tutorial shows you the same thing: send an image to GPT-4o, get a description back. Ten lines of Python. Done. response = client.chat.completions.create( model="gpt-4o", messages=[{ "role": "user", "content": [ {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{img_b64}"}}, {"type": "text", "text": "What's in this image?"} ] }] ) Real applications need something different. A security camera
Seeing with GPT‑4o: Building with OpenAI’s Vision Capabilities
Over the last few years, developers have gone from using language models for text-only chat to relying on them as general-purpose perception systems. You’re not only building chatbots; you’re building apps that use text, audio, and vision to understand and act on the world around them. GPT-4o is the most capable step yet: a single
Lessons from Redesigning a Multi-Product Developer Dashboard
B2B dashboards tend to evolve quietly. New features get added. New data appears. Navigation grows more complex over time. Eventually, what started as a focused interface becomes a dense surface area that’s difficult to extend, harder to learn, and increasingly fragile to change. At Stream, we recently rebuilt our dashboard from the ground up to
Clone MedTalk: HIPAA-Ready Video and Chat Consultations in Flutter
Telehealth is transforming the way patients and providers connect, offering faster access to care and reducing barriers caused by distance or scheduling. A critical part of this experience is enabling secure, real-time video consultations alongside features like chat messaging for sharing updates, questions, and follow-ups. With Stream’s healthcare chat solution, developers can build HIPAA-ready communication
Build vs. Buy In-App Chat: The Ultimate Decision Guide
Adding in-app chat is one of the most common (and underestimated) product decisions teams face. Today, AI has made it easier than ever to prototype messaging features quickly. A small team can scaffold a working chat experience in days, not months. But shipping a demo is not the same as running a production chat system.
From Cameras to Action: Real‑World Applications of Vision and Speech AI
You’re working in a warehouse when you see an automated forklift barreling towards a coworker. You whip out your phone and type "STOP!" into the app controlling the vehicle. You add another exclamation point to make sure it knows it’s an emergency. That’s not good enough, and it’s not how things have to be. AI
