• two different pruning criteria employed to accelerate computation and conserve memory

    • during backward pass, at any time, values falling more than a beam width below the maximum value at that time are ignored, in the following forward pass, values are computed only if the corresponding backward probabilities are valid

    • at any time, if the ratio of the product of forward and backward probabilities divided by the full utterance probability below a fixed threshold, both values are ignored