Can I run llm in my phone?

Yes, you can run large language models (LLMs) on your phone. Apple provides tools and frameworks to help you import and run AI models, including large language models, on various Apple devices such as iPhones.

You can start with any PyTorch model and convert it into the Core ML format using Core ML tools. These tools offer various optimization techniques, such as quantization and efficient key-value caching in LLMs, to ensure the model runs efficiently on Apple hardware. Core ML optimizes hardware-accelerated execution across the CPU, GPU, and neural engine, making it suitable for running AI models on iOS devices.

For more details, you can refer to the Platforms State of the Union session from WWDC 2024.