athena.data.datasets.kws.speech_wakeup_kaldiio_av

Module Contents

Classes

SpeechWakeupDatasetKaldiIOBuilderAVCE

Dataset builder for RNN model. The builder mix the spliced frame in one dim

class athena.data.datasets.kws.speech_wakeup_kaldiio_av.SpeechWakeupDatasetKaldiIOBuilderAVCE(config=None)

Bases: athena.data.datasets.base.BaseDatasetBuilder

Dataset builder for RNN model. The builder mix the spliced frame in one dim For example (1, 1323) The input data format is (batch, t, dim, channel) For example (b, t, 1323, 1) The output data format is (batch, timestep)

property sample_type

example types

property sample_shape

examples shapes

property sample_signature

examples signature

default_config
preprocess_data(data_dir='')

loading data

video_scp_loader(scp_dir)

load video list from scp file return a dic

__getitem__(index)
splice_feature(feature, input_left_context, input_right_context)

splice features according to input_left_context and input_right_context input_left_context: the left features to be spliced,

repeat the first frame in case out the range

input_right_context: the right features to be spliced,

repeat the last frame in case out the range

Parameters

feature – the input features, shape may be [timestamp, dim, 1]

Returns

the spliced features

Return type

splice_feat