Information about the machine learning technologies behind "Select Subject" and "Visual Look Up"

Asked on 2025-08-14

1 search

The machine learning technologies behind "Select Subject" and "Visual Look Up" are primarily powered by Apple's Vision framework. This framework provides a range of capabilities for visual intelligence, including object detection and image analysis, which are crucial for features like "Select Subject" and "Visual Look Up."

  1. Vision Framework: The Vision framework offers APIs that allow developers to integrate machine learning into their apps for various use cases, such as person and object detection, body and hand pose tracking, and trajectory analysis. These APIs run entirely on-device, ensuring performance and security. The framework includes requests like detect face rectangles and saliency image requests, which help in identifying and highlighting important objects in an image.

  2. Machine Learning on Apple Platforms: Apple's platforms leverage machine learning models that are optimized for on-device execution. This allows for efficient and low-latency inference, which is essential for real-time features like "Select Subject" and "Visual Look Up."

  3. Apple Intelligence: The intelligence built into Apple's operating systems powers many features, including visual intelligence capabilities. These features are designed to be highly interactive while maintaining user privacy by keeping data on-device.

For more detailed information on how these technologies are integrated and used, you can refer to the session Explore machine learning on Apple platforms (03:55) and Discover Swift enhancements in the Vision framework (01:07).