x_t = x₀ + ε
x_t = x₀ masked at rate t/T
Standard diffusion operates in continuous space and predicts a correction vector — the noise to subtract. Text diffusion operates in discrete space: there is nothing to subtract from a token ID. Instead the model predicts a probability distribution over the clean vocabulary at each masked position, then reveals tokens by confidence. Same high-level intuition, completely different mathematical objective.