athena.data.datasets.asr.speech_recognition_batch_bins_kaldiio
¶
audio dataset
Module Contents¶
Classes¶
SpeechRecognitionDatasetBatchBinsKaldiIOBuilder |
Functions¶
|
data loader |
- athena.data.datasets.asr.speech_recognition_batch_bins_kaldiio.data_loader(dataset_builder, batch_size=1, num_threads=1)¶
data loader
- class athena.data.datasets.asr.speech_recognition_batch_bins_kaldiio.SpeechRecognitionDatasetBatchBinsKaldiIOBuilder(config=None)¶
Bases:
athena.data.datasets.asr.speech_recognition_kaldiio.SpeechRecognitionDatasetKaldiIOBuilder
SpeechRecognitionDatasetBatchBinsKaldiIOBuilder
- property sample_shape_batch_bins¶
@property
- Returns
sample_shape of the dataset:
{ "input": tf.TensorShape([None, None, dim, nc]), "input_length": tf.TensorShape([None]), "output_length": tf.TensorShape([None]), "output": tf.TensorShape([None, None]), }
- Return type
dict
- default_config¶
- preprocess_kaldi_data(file_dir, apply_sort_filter=True)¶
- read_shape_file(file_dir=None)¶
- __getitem__(index)¶
- __len__()¶
- as_dataset(batch_size=16, num_threads=1)¶
return tf.data.Dataset object
- shard(num_shards, index)¶
creates a Dataset that includes only 1/num_shards of this dataset
- batch_wise_shuffle(batch_size=1, epoch=-1, seed=917)¶
Batch-wise shuffling of the data entries.
- Parameters
batch_size (int, optional) – an integer for the batch size. Defaults to 1
. (in batch_bins mode) –