Francesco Cariaggi

The uncompromising intro to KV caching

A basic optimization for autoregressive generation with Transformers

By Francesco Cariaggi

Posted on January 7, 2026

In this blog post, I will try my best to explain key-value caching, or KV caching in short, in a beginner-friendly way, but with enough technical depth to make it enjoyable for non-beginners too. The “uncompromising” part of the title refers to the intention of avoiding shortcuts or simplifications that... [Read More]

🏷️

Exotic floating-point formats

Training large-scale neural networks with low precision

By Francesco Cariaggi

Posted on September 13, 2025

In this blog post, I will strive to provide a down-to-earth introduction to “exotic” floating-point formats, with a special focus on how they can be leveraged to efficiently train large-scale AI models. By the end of this blog post, you will be aware of both the positive and negative implications... [Read More]

🏷️

Neural Audio Codecs & (Residual) Vector Quantization

The technology behind State-of-the-Art Audio AI models

By Francesco Cariaggi

Posted on February 7, 2025

In this blog post, I’ll take you through two important concepts behind modern Audio AI models such as Google’s AudioLM and VALL-E, Meta’s AudioGen and MusicGen, Microsoft’s NaturalSpeech 2, Suno’s Bark, Kyutai’s Moshi and Hibiki, and many more: Neural Audio Codecs and (Residual) Vector Quantization. [Read More]

🏷️