New documents capture api

Asked on 06/11/2025

1 search

The new documents capture API is part of the Vision framework enhancements introduced at WWDC 2025. This API, known as the RecognizeDocuments Request, allows developers to extract structural elements and important information from documents. It can detect structures such as tables and lists, group lines of text into paragraphs, and identify machine-readable codes like QR codes. This API supports text recognition in 26 languages and is designed to provide a better understanding of document structures, making it easier to parse with fewer lines of code.

For more details, you can refer to the session titled "Read documents using the Vision framework" from WWDC 2025. The relevant chapter for reading documents starts at 00:01:22.