`athena.models.asr.speech_conformer_ctc`¶

speech transformer implementation

Module Contents¶

Standard implementation of a SpeechTransformer. Model mainly consists of three parts:

class athena.models.asr.speech_conformer_ctc.SpeechConformerCTC(data_descriptions, config=None)¶

Standard implementation of a SpeechTransformer. Model mainly consists of three parts: the x_net for input preparation and the transformer itself

decode(samples, hparams, lm_model=None)¶

Initialization of the model for decoding, decoder is called here to create predictions

Parameters

Returns:

predictions: the corresponding decoding results

argmax(samples, hparams)¶

argmax for the Conformer CTC model

Parameters

ctc_prefix_beam_search(samples, hparams, ctc_final_layer) → List[int]¶

freeze_ctc_prefix_beam_search(samples, ctc_final_layer, hparams=None, beam_size=1) → List[int]¶

freeze_beam_search(samples, beam_size)¶

beam search for freeze only support batch=1

Parameters

restore_from_pretrained_model(pretrained_model, model_type='')¶: restore from pretrained model