torchaudio_load.Rd
Loads an audio file from disk into a tensor
torchaudio_load( filepath, out = NULL, normalization = TRUE, channels_first = TRUE, num_frames = 0L, offset = 0L, signalinfo = NULL, encodinginfo = NULL, filetype = NULL )
filepath | (str): Path to audio file |
---|---|
out | (Tensor): An optional output tensor to use instead of creating one. (Default: |
normalization | (bool, float or function): Optional normalization.
If boolean |
channels_first | (bool): Set channels first or length first in result. (Default: |
num_frames | (int): Number of frames to load. 0 to load everything after the offset. (Default: |
offset | (int): Number of frames from the start of the file to begin data loading. (Default: |
signalinfo | (str): A sox_signalinfo_t type, which could be helpful if the
audio type cannot be automatically determined. (Default: |
encodinginfo | (str): A sox_encodinginfo_t type, which could be set if the
audio type cannot be automatically determined. (Default: |
filetype | (str): A filetype or extension to be set if sox cannot determine it
automatically. (Default: |
list(Tensor, int): An output tensor of size `[C x L]` or `[L x C]` where L is the number of audio frames and C is the number of channels. An integer which is the sample rate of the audio (as listed in the metadata of the file)
if (FALSE) { if(torch::torch_is_installed()) { mp3_filename <- system.file("sample_audio_2.mp3", package = "torchaudio") data = torchaudio_load(mp3_filename) print(data[[1]]$size()) norm_fun <- function(x) torch::torch_abs(x)$max() data_vol_normalized = torchaudio_load(mp3_filename, normalization= norm_fun) print(data_vol_normalized[[1]]$abs()$max()) } }