Build low-latency Vision AI applications using our new open-source Vision AI SDK. ⭐️ on GitHub ->

Engineering: AI

Open Vision Agents by Stream: Open Source SDK for Building Low-Latency Vision AI Apps

Vision Agents is a new, open-source framework from Stream that helps developers quickly build low-latency vision AI applications. The project is completely open-source and ships with over ten out-of-the-box integrations, including day one support for leading real-time voice and video models like OpenAI Realtime and Gemini Live. Text-to-speech, speech-to-text, and speech-to-speech models are also natively
Read more ->
4 min read

Top 5 Real-Time Speech-to-Speech APIs and Libraries To Build Voice Agents

There are two ways to build conversational voice agents for enterprise and production use cases. Developers can use a real-time API or speech-to-speech (STS), that takes audio input from a user and sends it to a large language model (LLM) to return a voice output. Or they can use a turn-based architecture, which consists of
Read more ->
13 min read

AI Chatbots for eCommerce: Best Platforms, Examples & Use Cases

Throughout the eCommerce customer journey, from product discovery to post-purchase support, there are many opportunities for revenue loss. While product managers might be aware of where these leaks occur and that chatbots offer potential solutions, choosing the right provider requires a thorough analysis to find the best fit. The best eCommerce AI chatbot platforms balance
Read more ->
8 min read

The 8 Best Platforms To Build Voice AI Agents

Voice assistants like Siri and Alexa are great for non-trivial everyday personal assistive tasks. However, they are limited in providing accurate answers to complex questions, real-time information, handling turns, and user interruptions. Get started! Activate your free Stream account today and start prototyping your own voice AI agent! Try asking Siri about the best things
Read more ->
13 min read

Build a SaaS AI Agent Platform

Imagine a language tutor, a startup coach, or a sales assistant, all available on-demand through a video call.  In this tutorial series from Code with Antonio, we'll guide you through building a video calling application powered entirely by AI agents.  Beyond generating summaries or transcripts after a call, every meeting in this app is a
Read more ->
7 min read

Compare the Top 5 Agentic CLI Coding Tools

Agentic AI coding tools vary in how they help you write, debug, and ship code. Some, like Lovable and Bolt, allow developers to build web and mobile apps using prompts quickly. Others, like Cursor and Windsurf, provide developers with a fully AI-featured IDE to solve engineering problems. Generally, AI coding platforms can be categorized into
Read more ->
12 min read

Build an AI Meeting Assistant with Stream Video and LLMs

In today's business environment, virtual meetings consume a significant portion of professionals' workdays. Yet, much of their value evaporates once participants leave the virtual room. Critical insights, decisions, and action items often get lost amid daily tasks, leading to misalignment, missed deadlines, and the dreaded "Wait, what did we decide?" follow-up communications. AI meeting assistants
Read more ->
18 min read

Building a Personalized AI Chatbot with Stream’s UI Components

Wouldn't it be nice to talk to your own AI chatbot all day? With it sitting at the ready to listen to your slightest whim, providing instant answers, suggestions, and companionship whenever you need it. Of course, this already exists as a billion-dollar industry. Companies such as Character.ai provide personalized chat apps that can respond
Read more ->
9 min read