speechcommand_dataset.Rd
Speech Commands Dataset
speechcommand_dataset(
root,
url = "speech_commands_v0.02",
folder_in_archive = "SpeechCommands",
download = FALSE,
normalization = NULL
)
(str): Path to the directory where the dataset is found or downloaded.
(str, optional): The URL to download the dataset from,
or the type of the dataset to dowload.
Allowed type values are "speech_commands_v0.01"
and "speech_commands_v0.02"
(default: "speech_commands_v0.02"
)
(str, optional): The top-level directory of the dataset. (default: "SpeechCommands"
)
(bool, optional): Whether to download the dataset if it is not found at root path. (default: FALSE
).
(NULL, bool, int or function): Optional normalization. If boolean TRUE, then output is divided by 2^31. Assuming the input is signed 32-bit audio, this normalizes to [-1, 1]. If numeric, then output is divided by that number. If function, then the output is passed as a paramete to the given function, then the output is divided by the result. (Default: NULL)
a torch::dataset()