Tutorials
Using AI Agent Skills: Build an iOS Chat Messaging App With a Single Prompt
As developers, we typically spend time reading docs and tutorials, and watching YouTube videos to integrate APIs and SDKs to add specific functionality to apps and services. These integrations can now be completed much more quickly using AI Agent Skills. Agent Skills are sets of instructions, scripts, and reference documents that equip AI models to
Read more
12 min read
Gemini Live API & Lyria 3: Generate Music From Text, Phone & Video Calls
The instrumental background music in the video below is AI-generated using Lyria 3 by Google DeepMind. Lyria 3 allows anyone to generate AI music from text and image prompts. The music demos in this article take it further by adding another input prompt modality, your voice. Let’s proceed to generate your first music with Lyria
Read more
15 min read
How to Clone Any Voice in Minutes Using Voxtral TTS
What You Will Build This tutorial demonstrates how to build an AI speech app with in-app voice cloning support. You can clone your favorite voice by supplying a reference audio of about 3 seconds. Here is a demo. Voice cloning example demonstrating reference and output voices Voice cloning example demonstrating reference and agent's output voices
Read more
9 min read
How To Design AI Voices in Minutes Using Qwen3-TTS
Before You Start To begin, ensure that you meet these requirements and have the following credentials. Python 3.13 or a later version. An Apple Silicon Mac (recommended) or any modern laptop. Stream API credentials (for realtime audio and video communication). A HuggingFace Account and access token (HF_TOKEN). A Deepgram API key (for speech-to-text). A Google
Read more
11 min read
HIPAA-Compliant Chat: How to Build Secure Messaging for Telemedicine Apps
TL;DR: Any chat message pairing a patient identifier with health information is PHI, making your entire messaging infrastructure a HIPAA compliance surface. HIPAA's three rules translate into concrete engineering requirements: role-based access, encryption, tamper-evident audit logs, and PHI-safe push notifications. Building compliant chat from scratch needs 2-3 dedicated engineers; a HIPAA-eligible API like Stream compresses
Read more
16 min read
The 6 Best On-Device TTS Models for Voice AI
When building voice AI applications, you have industry-leading cloud options for text-to-speech, such as Cartesia Sonic 3 and Grok TTS. For privacy and to avoid sharing your business’s data with these commercial text-to-speech (TTS) providers, your team may want to use free, open-source solutions that run locally on mobile and desktop devices. Continue reading to
Read more
21 min read
Build a Restaurant Reservation AI Agent With Turbopuffer and Twilio
Let’s build a restaurant reservation system to speak with a voice agent via a real-time phone call. The service will have three main features: Agent Outbound Call: The agent can act as both a customer helper and a restaurant assistant. For example, it can be configured as an AI restaurant employee that calls customers back
Read more
12 min read
Grok TTS + Vision: Build a Healthcare Appointment Agent
This step-by-step guide will help you build an AI front-desk receptionist that interacts with patients through conversations, assesses their conditions, and advises whether to visit a doctor or seek online medical advice. When an agent can see the patient’s condition in real time, it can make a smarter recommendation, saving patients an unnecessary trip to
Read more
12 min read
