Documents
Presentation Slides
Presentation Slides
A Bayesian Interpretation of the Light Gated Recurrent Unit
- Citation Author(s):
- Submitted by:
- Alexandre Bittar
- Last updated:
- 6 July 2021 - 10:00am
- Document Type:
- Presentation Slides
- Document Year:
- 2021
- Event:
- Presenters:
- Alexandre Bittar
- Paper Code:
- 2242
- Categories:
- Log in to post comments
We summarise previous work showing that the basic sigmoid activation function arises as an instance of Bayes’s theorem, and that recurrence follows from the prior. We derive a layer- wise recurrence without the assumptions of previous work, and show that it leads to a standard recurrence with modest modifications to reflect use of log-probabilities. The resulting architecture closely resembles the Li-GRU which is the current state of the art for ASR. Although the contribution is mainly theoretical, we show that it is able to outperform the state of the art on the TIMIT and AMI datasets.