is that any sdk api that can recognize book's title and author from a photo?

Asked on 06/12/2025

1 search

This post is brought to you by Tailscan,
the best devtool for Tailwind CSS.

Yes, Apple has introduced a new API in the Vision framework that can help recognize and extract text from images, which could potentially be used to recognize a book's title and author from a photo. The "Recognize Documents Request" API is designed to detect and extract structured elements and important information from documents, including text. This API can recognize text in 26 languages and can group lines of text into paragraphs, detect machine-readable codes, and identify important information like email addresses, phone numbers, or URLs.

For more details, you can refer to the session Read documents using the Vision framework (00:01:29).

Provide feedback

Read documents using the Vision framework

Learn about the latest advancements in the Vision framework. We’ll introduce RecognizeDocumentsRequest, and how you can use it to read lines of text and group them into paragraphs, read tables, etc. And we’ll also dive into camera lens smudge detection, and how to identify potentially smudged images in photo libraries or your own camera capture pipeline.

Discover machine learning & AI frameworks on Apple platforms

Tour the latest updates to machine learning and AI frameworks available on Apple platforms. Whether you are an app developer ready to tap into Apple Intelligence, an ML engineer optimizing models for on-device deployment, or an AI enthusiast exploring the frontier of what is possible, we’ll offer guidance to help select the right tools for your needs.

Keep colors consistent across captures

Meet the Constant Color API and find out how it can help people use your app to determine precise colors. You’ll learn how to adopt the API, explore its scientific and marketing potential, and discover best practices for making the most of the technology.