Can I run an LLM on my iPhone?

Asked on 07/16/2024

169 searches

Yes, you can run a large language model (LLM) on your iPhone. Apple provides the necessary tools and frameworks to facilitate this. You can start with any PyTorch model and convert it into the Core ML format using Core ML tools. These tools offer various optimization techniques, such as quantization and efficient key-value caching in LLMs, to ensure the model runs efficiently on Apple devices. Core ML optimizes hardware-accelerated execution across the CPU, GPU, and neural engine, making it possible to run a wide array of models, including large language models, on iOS devices.

For more details, you can refer to the Platforms State of the Union session at WWDC 2024.