Deep Learning - Latent Space

topic://

The Mathematics Behind Transformer Attention

A deep dive into the self-attention mechanism — from scaled dot-product attention to multi-head projections and why positional encoding matters.

Feb 03, 2026 · 2 min read

Diffusion Models

Diffusion Models: From Noise to Intelligence

Understanding the mathematical framework behind denoising diffusion probabilistic models — the forward process, reverse process, and the connection to score matching.

Feb 03, 2026 · 2 min read