athena.data.datasets.asr.speech_recognition_batch_bins
¶
audio dataset
Module Contents¶
Classes¶
SpeechRecognitionDatasetBatchBinsBuilder |
Functions¶
|
data loader |
- athena.data.datasets.asr.speech_recognition_batch_bins.data_loader(dataset_builder, batch_size=1, num_threads=1)¶
data loader
- class athena.data.datasets.asr.speech_recognition_batch_bins.SpeechRecognitionDatasetBatchBinsBuilder(config=None)¶
Bases:
athena.data.datasets.asr.speech_recognition.SpeechRecognitionDatasetBuilder
SpeechRecognitionDatasetBatchBinsBuilder
- property sample_shape_batch_bins¶
@property
- Returns
sample_shape of the dataset:
{ "input": tf.TensorShape([None, None, dim, nc]), "input_length": tf.TensorShape([None]), "output_length": tf.TensorShape([None]), "output": tf.TensorShape([None, None]), }
- Return type
dict
- default_config¶
- preprocess_data(file_path)¶
- __getitem__(index)¶
- __len__()¶
- as_dataset(batch_size=16, num_threads=1)¶
return tf.data.Dataset object
- shard(num_shards, index)¶
creates a Dataset that includes only 1/num_shards of this dataset
- batch_wise_shuffle(batch_size=1, epoch=-1, seed=917)¶
Batch-wise shuffling of the data entries.
- Parameters
batch_size (int, optional) – an integer for the batch size. Defaults to 1
. (in batch_bins mode) –