torch
torchaudio
librosa
numpy
huggingface_hub
resemble-perth
einops
scipy
tokenizers
soundfile
s3tokenizer
conformer