Build low-latency Vision AI applications using our new open-source Vision AI SDK. ⭐️ on GitHub

Tutorials

How to Build a Slack Clone with React Native

In this three-part tutorial series, you'll build a fully functional Slack clone for iOS and Android using React Native and Stream Chat. By the end, you'll have a working app that closely mirrors Slack's UI and functionality, complete with dark mode support, emoji reactions, GIF previews, and a message action sheet. Note: This tutorial is
Read more
2 min read

Slack Clone Part 1: Project Setup & Messaging UI

In this tutorial, you will create a clone of Slack, a workplace messaging platform in React Native. The Slack application consists of three parts and includes several features. This first part covers the following UI/UX features: Channel List: Contacts in popular messaging apps like WhatsApp. Input Box: For composing messages. Message Row: For displaying incoming
Read more
11 min read

Slack Clone Part 3: Threads & Search

Welcome to the final series of the Slack clone tutorial. In part 2, we covered how to build Slack-like navigation, channel lists, channel screens, a reaction picker, and an action sheet. In part 3, we will build the various search screens and the message thread screen. Below are a few resources to assist along the
Read more
9 min read

Slack Clone Part 2: Navigation & Message Actions

In part 1 of this tutorial series, you learned how to set up your environment, build a channel list, define content and navigation, and build the message composer and message UI, alongside other UI/UX features. In this part, you will focus on adding: Icons/symbols The channel list header The compose button The message action sheet.
Read more
7 min read

Build a Local AI Agent with Qwen 3.5 Small on macOS

Qwen 3.5 Small is a new family of lightweight, high-performance models from Alibaba (0.8B, 2B, 4B, and 9B parameters) that are now available on Ollama. These models support multimodal input, native tool calling, and strong reasoning, all while running efficiently on laptops, Macs, and even mobile/IoT devices. In this demo, the agent runs completely locally
Read more
3 min read

Using Opus 4.6: Vibe Code a Custom Python Plugin for Vision Agents

Vision Agents has out-of-the-box support for the LLM services and providers developers need to build voice, vision, and video AI applications. The framework also makes it easy to integrate custom AI services — either by following a step-by-step guide or by vibe coding them using SoTA models. Let’s use Claude Opus 4.6 to create a
Read more
9 min read

Build an AI Travel Advisor That Speaks with Gemini 3.1 Pro

Most LLMs are great at thinking, but making them speak naturally is a different challenge. Gemini 3.1 Pro changes that. This new model from Google brings significantly improved reasoning, longer context, and better tool-use capabilities, making it one of the best choices (at the time of writing) for building conversational voice agents. In this guide,
Read more
2 min read

Add Text-to-Speech to Apps with Cartesia Sonic 3 & Vision Agents

Realistic text-to-speech was one of the hardest parts of building voice agents. Most models either sounded robotic, introduced noticeable latency, or required complex integration that slowed down prototyping. Cartesia Sonic 3 changes that equation. Released late 2025, it combines sub-200 ms first-chunk latency, strong emotional expressiveness, multilingual support, and the ability to clone voices from
Read more
2 min read