Conference Proceedings

Efficient orthogonal parametrisation of recurrent neural networks using householder reflections

Z Mhammedi, A Hellicar, A Rahman, J Bailey

34th International Conference on Machine Learning, ICML 2017 | Published : 2017


© 2017 by the author(s). The problem of learning long-term dependencies in sequences using Recurrent Neural Networks (RNNs) is still a major challenge. Recent methods have been suggested to solve this problem by constraining the transition matrix to be unitary during training which ensures that its norm is equal to one and prevents exploding gradients. These methods either have limited expressiveness or scale poorly with the size of the network when compared with the simple RNN case, especially when using stochastic gradient descent with a small mini-batch size. Our contributions are as follows; we first show that constraining the transition matrix to be unitary is a special case of an ortho..

View full abstract

Citation metrics