transform_mel_scale.Rd
Turn a normal STFT into a mel frequency STFT, using a conversion matrix. This uses triangular filter banks.
transform_mel_scale(
n_mels = 128,
sample_rate = 16000,
f_min = 0,
f_max = NULL,
n_stft = NULL
)
(int, optional): Number of mel filterbanks. (Default: 128
)
(int, optional): Sample rate of audio signal. (Default: 16000
)
(float, optional): Minimum frequency. (Default: 0.
)
(float or NULL, optional): Maximum frequency. (Default: sample_rate // 2
)
(int, optional): Number of bins in STFT. Calculated from first input
if NULL is given. See n_fft
in :class:Spectrogram
. (Default: NULL
)
tensor
: Mel frequency spectrogram of size (..., n_mels
, time).
forward param: specgram (Tensor): Tensor of audio of dimension (..., freq, time).