athena.data.datasets.mpc.mpc_speech_set_kaldiio

audio dataset

Module Contents

Classes

MpcSpeechDatasetKaldiIOBuilder

MpcSpeechDatasetKaldiIOBuilder

class athena.data.datasets.mpc.mpc_speech_set_kaldiio.MpcSpeechDatasetKaldiIOBuilder(config=None)

Bases: athena.data.datasets.mpc.mpc_speech_set.MpcSpeechDatasetBuilder

MpcSpeechDatasetKaldiIOBuilder This data builder is a offline feature data builder and is used to mcp training

default_config
preprocess_data(file_path, apply_sort_filter=True)

generate a list of tuples (feat_key, speaker).

__getitem__(index)
compute_cmvn_if_necessary(is_necessary=True)

compute cmvn file