Datasets |
|
---|---|
CMU Arctic Dataset |
|
Speech Commands Dataset |
|
YesNo Dataset |
|
Transforms |
|
Axis Masking |
|
Amplitude to DB |
|
Complex Norm |
|
Delta Coefficients |
|
Fade In/Out |
|
Frequency-domain Masking |
|
Inverse Mel Scale |
|
Mel Scale |
|
Mel Spectrogram |
|
Mel-frequency Cepstrum Coefficients |
|
Mu Law Decoding |
|
Mu Law Encoding |
|
Signal Resample |
|
sliding-window Cepstral Mean Normalization |
|
Spectrogram |
|
Time Stretch |
|
Time-domain Masking |
|
Convert an audio object into a tensor |
|
Voice Activity Detector |
|
Add a volume to an waveform. |
|
Functionals |
|
Combine Max (functional) |
|
Normalized Cross-Correlation Function (functional) |
|
Find Max Per Frame (functional) |
|
Wave Table Generator (functional) |
|
Median Smoothing (functional) |
|
Noise Shaping (functional) |
|
All-pass Biquad Filter (functional) |
|
Amplitude to DB (functional) |
|
Angle (functional) |
|
Probability Distribution Apply (functional) |
|
Two-pole Band Filter (functional) |
|
Band-pass Biquad Filter (functional) |
|
Band-reject Biquad Filter (functional) |
|
Bass Tone-control Effect (functional) |
|
Biquad Filter (functional) |
|
Complex Norm (functional) |
|
Delta Coefficients (functional) |
|
Contrast Effect (functional) |
|
DCT transformation matrix (functional) |
|
Frequency Bin Conversion Matrix (functional) |
|
DB to Amplitude (functional) |
|
DC Shift (functional) |
|
ISO 908 CD De-emphasis IIR Filter (functional) |
|
Detect Pitch Frequency (functional) |
|
Dither (functional) |
|
Biquad Peaking Equalizer Filter (functional) |
|
Flanger Effect (functional) |
|
Gain (functional) |
|
Griffin-Lim Transformation (functional) |
|
High-pass Biquad Filter (functional) |
|
An IIR Filter (functional) |
|
Low-pass Biquad Filter (functional) |
|
Magnitude and Phase (functional) |
|
Mask Along Axis (functional) |
|
Mask Along Axis IID (functional) |
|
Mel Scale (functional) |
|
Mu Law Decoding (functional) |
|
Mu Law Encoding (functional) |
|
Overdrive Effect (functional) |
|
Phase Vocoder |
|
Phasing Effect (functional) |
|
RIAA Vinyl Playback Equalisation (functional) |
|
sliding-window Cepstral Mean Normalization (functional) |
|
Spectrogram (functional) |
|
Treble Tone-control Effect (functional) |
|
Voice Activity Detector (functional) |
|
Kaldi API |
|
Linear Resample Indices And Weights |
|
Linear Resample Output Samples |
|
Kaldi's Resample Waveform |
|
List available audio backends |
|
Neural network modules |
|
MelResNet |
|
ResBlock |
|
Stretch2d |
|
UpsampleNetwork |
|
WaveRNN |
|
Audio I/O |
|
Load Audio File |
|
Informational |
|
Audio Information |
|
Utilities |
|
Extract Archive |
|
Misc |
|
Linear to mel frequency |
|
Mel to linear frequency |