RNNs can keep an internal state that captures information regarding the past inputs, that makes them nicely-suited for tasks for instance speech recognition, purely natural language processing, and language translation.RNNs use a backpropagation as a result of time (BPTT) algorithm to find out the gradients, which is a bit different from regular ba