Vanishing gradients : shallow networks :: exploding gradients : ?

Question

Seekh · Accepted Answer

Vanishing gradients are a problem that mainly shows up in shallow networks, where the signal from the output layer fades as it travels back through the layers. In the same way, exploding gradients are a problem that mainly shows up in deep networks, where the signal grows uncontrollably as it propagates backward. Both issues arise when the product of many weight matrices multiplies a gradient, causing it to shrink or blow up. For example, if a deep network has many layers with weights slightly larger than one, the gradient can grow exponentially, making training unstable. Thus, the analogy is Vanishing gradients : shallow networks :: Exploding gradients : deep networks.

Vanishing gradients : shallow networks :: exploding gradients : ?

Learning Path

Choose the Best Answer

Understanding the Answer

Key Concepts

Deep Dive: Vanishing/Exploding Gradients Problem

Definition

Topic Definition

Ready to Master More Topics?