Exploding gradients: This happens if the gradient is simply too massive, making an unstable design. In such a case, the model weights will grow too large, and they're going to inevitably be represented as NaN. . The gradient admits multiple generalizations to more normal functions on manifolds; see § Generalizations. https://elizabethu368xci6.mappywiki.com/user