speech

Asked on 06/13/2025

1 search

Apple's WWDC 2025 introduced the new Speech Analyzer API, which enhances speech-to-text capabilities across Apple platforms. This API, part of the Speech Framework, allows developers to perform speech-to-text processing with minimal code, entirely on-device. It supports a wide range of use cases, including long-form and distant audio like lectures and meetings. The Speech Analyzer API is designed to work asynchronously, allowing applications to process audio input and display results independently.

For more detailed information, you can refer to the session titled "Bring advanced speech-to-text to your app with SpeechAnalyzer," which covers the API's design, usage, and integration into apps. The session also includes a live coding demo to help developers get started with building speech-to-text features.

If you're interested in exploring this further, you can check out the session chapters: