Chapter 16 Quiz: Recurrent Neural Networks
Learning objectives
- Test your understanding of RNN hidden states, LSTM gates, vanishing gradients, and sequence modeling
This quiz covers the lecture material from Chapter 16.
Key Concepts Review
- RNN: Hidden state carries information across time steps.
- Vanishing Gradient: multiplied at each step causes gradients to shrink exponentially.
- LSTM: 3 gates (forget, input, output). Cell state with additive updates preserves gradients.
- GRU: 2 gates (update, reset). Simpler than LSTM, similar performance.
- Bidirectional: Process sequence forward and backward. Use when full sequence is available.
References
- Goodfellow, I., Bengio, Y., Courville, A. (2016). Deep Learning, ch. 10. MIT Press.
- Hochreiter, S., Schmidhuber, J. (1997). Long short-term memory. Neural Comput. 9(8), 1735–1780.
- Mousavi, S.M., Beroza, G.C. (2022). Deep-learning seismology. Science 377, eabm4470.