Can I do voice to text transformation ?
Asked on 06/19/2025
1 search
Yes, you can perform voice-to-text transformation using Apple's new Speech Analyzer API. This API, introduced in iOS 26, allows developers to integrate advanced speech-to-text capabilities into their applications. The Speech Analyzer API supports a wide range of use cases, including long-form and distant audio, such as lectures and meetings. It operates entirely on-device, ensuring privacy and efficiency.
For more details, you can refer to the session titled "Bring advanced speech-to-text to your app with SpeechAnalyzer" from WWDC 2025. This session provides an overview of the Speech Analyzer API and demonstrates how to build a speech-to-text feature. You can start exploring this topic from the SpeechAnalyzer API chapter.

Bring advanced speech-to-text to your app with SpeechAnalyzer
Discover the new SpeechAnalyzer API for speech to text. We’ll learn about the Swift API and its capabilities, which power features in Notes, Voice Memos, Journal, and more. We’ll dive into details about how speech to text works and how SpeechAnalyzer and SpeechTranscriber can enable you to create exciting, performant features. And you’ll learn how to incorporate SpeechAnalyzer and live transcription into your app with a code-along.

Discover machine learning & AI frameworks on Apple platforms
Tour the latest updates to machine learning and AI frameworks available on Apple platforms. Whether you are an app developer ready to tap into Apple Intelligence, an ML engineer optimizing models for on-device deployment, or an AI enthusiast exploring the frontier of what is possible, we’ll offer guidance to help select the right tools for your needs.

What’s new in visionOS 26
Explore exciting new features in visionOS 26. Discover enhanced volumetric APIs and learn how you can combine the power of SwiftUI, RealityKit and ARKit. Find out how you can build more engaging apps and games using faster hand tracking and input from spatial accessories. Get a sneak peek at updates to SharePlay, Compositor Services, immersive media, spatial web, Enterprise APIs, and much more.