speech framework 中有怎么区分对话的 api 吗

Asked on 2025-06-16

3 searches

Tailscan

This post is brought to you by Tailscan,
the best devtool for Tailwind CSS.

Want to sponsor? We'd love to chat →

在WWDC 2025中，Apple介绍了一个新的API，名为Speech Analyzer，它是Speech Framework的一部分。这个API在iOS 26中引入，支持更多的用例，并且比之前的SF Speech Recognizer更快、更灵活。Speech Analyzer特别适合处理长时间和远距离的音频，如讲座、会议和对话。

Speech Analyzer API的设计包括Speech Analyzer类和几个其他类。你可以将一个模块类添加到分析会话中以执行特定类型的分析。例如，添加一个转录模块可以使会话成为一个转录会话，执行语音到文本的处理。你可以将音频缓冲区传递给分析器实例，它会通过转录器和其语音到文本模型进行路由。模型预测与所说音频相匹配的文本，并将其返回给你的应用程序。

如果你想了解更多关于Speech Analyzer的详细信息，可以查看Bring advanced speech-to-text to your app with SpeechAnalyzer (02:41) 这一章节。

Discover machine learning & AI frameworks on Apple platforms

Discover machine learning & AI frameworks on Apple platforms

Tour the latest updates to machine learning and AI frameworks available on Apple platforms. Whether you are an app developer ready to tap into Apple Intelligence, an ML engineer optimizing models for on-device deployment, or an AI enthusiast exploring the frontier of what is possible, we’ll offer guidance to help select the right tools for your needs.

Bring advanced speech-to-text to your app with SpeechAnalyzer

Bring advanced speech-to-text to your app with SpeechAnalyzer

Discover the new SpeechAnalyzer API for speech to text. We’ll learn about the Swift API and its capabilities, which power features in Notes, Voice Memos, Journal, and more. We’ll dive into details about how speech to text works and how SpeechAnalyzer and SpeechTranscriber can enable you to create exciting, performant features. And you’ll learn how to incorporate SpeechAnalyzer and live transcription into your app with a code-along.

What’s new in visionOS 26

What’s new in visionOS 26

Explore exciting new features in visionOS 26. Discover enhanced volumetric APIs and learn how you can combine the power of SwiftUI, RealityKit and ARKit. Find out how you can build more engaging apps and games using faster hand tracking and input from spatial accessories. Get a sneak peek at updates to SharePlay, Compositor Services, immersive media, spatial web, Enterprise APIs, and much more.