NSCache

Asked on 01/09/2025

1 search

The context provided does not mention NSCache specifically. However, it does discuss the use of caching techniques in the context of machine learning models, particularly the use of a key-value cache (KV cache) to improve the efficiency of transformer models on Apple silicon. This technique is used to store key and value vectors to avoid redundant calculations, thereby speeding up the decoding process in large language models.

If you are interested in learning more about caching techniques in machine learning models on Apple platforms, you might want to check out the session titled "Bring your machine learning and AI models to Apple silicon," which discusses transformer optimization and the use of stateful models to enhance performance. You can find more details in the chapter on Transformer optimization.

For more information on this topic, you can refer to the session Bring your machine learning and AI models to Apple silicon (17:08).