`athena.data.datasets.kws.speech_wakeup_kaldiio_av`¶

Module Contents¶

Classes¶

SpeechWakeupDatasetKaldiIOBuilderAVCE

Dataset builder for RNN model. The builder mix the spliced frame in one dim

class athena.data.datasets.kws.speech_wakeup_kaldiio_av.SpeechWakeupDatasetKaldiIOBuilderAVCE(config=None)¶

Bases: athena.data.datasets.base.BaseDatasetBuilder

Dataset builder for RNN model. The builder mix the spliced frame in one dim For example (1, 1323) The input data format is (batch, t, dim, channel) For example (b, t, 1323, 1) The output data format is (batch, timestep)

property sample_type¶: example types

property sample_shape¶: examples shapes

property sample_signature¶: examples signature

default_config¶

preprocess_data(data_dir='')¶: loading data

video_scp_loader(scp_dir)¶: load video list from scp file return a dic

__getitem__(index)¶

splice_feature(feature, input_left_context, input_right_context)¶

splice features according to input_left_context and input_right_context input_left_context: the left features to be spliced,

repeat the first frame in case out the range

input_right_context: the right features to be spliced,: repeat the last frame in case out the range

Parameters: feature – the input features, shape may be [timestamp, dim, 1]
Returns: the spliced features
Return type: splice_feat

athena.data.datasets.kws.speech_wakeup_kaldiio_av¶

Module Contents¶

Classes¶

`athena.data.datasets.kws.speech_wakeup_kaldiio_av`¶