How to use speechtranscriber
Asked on 2025-06-11
1 search
To use the Speech Transcriber, you can follow these steps as outlined in the session "Bring advanced speech-to-text to your app with SpeechAnalyzer" from WWDC 2025:
-
Configure the Speech Transcriber: Initialize the Speech Transcriber with a locale object and the necessary options. The locale's language code should correspond to the language you want to transcribe. You can also set options like audio time range to sync text playback with audio.
-
Ensure the Model is Present: Check if the Speech Transcriber supports the language you want to transcribe. If the language is supported but not downloaded, you can request the model via the Asset Inventory API. The transcription is entirely on-device, but models need to be fetched.
-
Handle the Results: The Speech Transcriber returns results via an async stream. You can track volatile and finalized results. Volatile results are real-time guesses, while finalized results are the best guesses. Use the async stream to process these results and update your application accordingly.
For more detailed guidance, you can refer to the session chapter on Build a speech-to-text feature (09:06) in the "Bring advanced speech-to-text to your app with SpeechAnalyzer" session.

Bring advanced speech-to-text to your app with SpeechAnalyzer
Discover the new SpeechAnalyzer API for speech to text. We’ll learn about the Swift API and its capabilities, which power features in Notes, Voice Memos, Journal, and more. We’ll dive into details about how speech to text works and how SpeechAnalyzer and SpeechTranscriber can enable you to create exciting, performant features. And you’ll learn how to incorporate SpeechAnalyzer and live transcription into your app with a code-along.

What’s new in visionOS 26
Explore exciting new features in visionOS 26. Discover enhanced volumetric APIs and learn how you can combine the power of SwiftUI, RealityKit and ARKit. Find out how you can build more engaging apps and games using faster hand tracking and input from spatial accessories. Get a sneak peek at updates to SharePlay, Compositor Services, immersive media, spatial web, Enterprise APIs, and much more.

Optimize for the spatial web
Discover how to make the most of visionOS capabilities on the web. Explore recent updates like improvements to selection highlighting, and the ability to present spatial photos and panorama images in fullscreen. Learn to take advantage of existing web standards for dictation and text-to-speech with WebSpeech, spatial soundscapes with WebAudio, and immersive experiences with WebXR.