Speech Commands Dataset

speechcommand_dataset(
  root,
  url = "speech_commands_v0.02",
  folder_in_archive = "SpeechCommands",
  download = FALSE,
  normalization = NULL
)

Arguments

root: (str): Path to the directory where the dataset is found or downloaded.
url: (str, optional): The URL to download the dataset from, or the type of the dataset to dowload. Allowed type values are "speech_commands_v0.01" and "speech_commands_v0.02" (default: "speech_commands_v0.02")
folder_in_archive: (str, optional): The top-level directory of the dataset. (default: "SpeechCommands")
download: (bool, optional): Whether to download the dataset if it is not found at root path. (default: FALSE).
normalization: (NULL, bool, int or function): Optional normalization. If boolean TRUE, then output is divided by 2^31. Assuming the input is signed 32-bit audio, this normalizes to [-1, 1]. If numeric, then output is divided by that number. If function, then the output is passed as a paramete to the given function, then the output is divided by the result. (Default: NULL)

a torch::dataset()