sliding-window Cepstral Mean Normalization (functional) — functional_sliding_window

Apply sliding-window cepstral mean (and optionally variance) normalization per utterance.

functional_sliding_window_cmn(
  waveform,
  cmn_window = 600,
  min_cmn_window = 100,
  center = FALSE,
  norm_vars = FALSE
)

Arguments

waveform: (Tensor): Tensor of audio of dimension (..., freq, time)
cmn_window: (int, optional): Window in frames for running average CMN computation (int, default = 600)
min_cmn_window: (int, optional): Minimum CMN window used at start of decoding (adds latency only at start). Only applicable if center == FALSE, ignored if center==TRUE (int, default = 100)
center: (bool, optional): If TRUE, use a window centered on the current frame (to the extent possible, modulo end effects). If FALSE, window is to the left. (bool, default = FALSE)
norm_vars: (bool, optional): If TRUE, normalize variance to one. (bool, default = FALSE)

tensor: Tensor of freq of dimension (..., frame)